Gene Gdia_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2000 
Symbol 
ID6975426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2222502 
End bp2223605 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content75% 
IMG OID643391529 
Productprotein of unknown function DUF1058 
Protein accessionYP_002276375 
Protein GI209544146 
COG category[S] Function unknown 
COG ID[COG3807] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTTC CCTCGCCCGT CGTCCCTTCC GTTGGCACCT GCCGGCCCAG CGCCGCGCCG 
GCCGGCGGAA CCGTCCGCCG TGGCTGGCTG GTCTGCGCCG CCATCCTGGC CCCGGCCGCG
CTGGGCGCCC CGGGCGCGGC CGCGCAGCAG ACCGCCACGC ATCACCGCCA CCACCACCAT
CATGCGCCGG CAAACGCACC CGTGCCGGCC GCAACCGCCG CGCGCCATCA TCATCACGCG
GCCGCCCCCG CGCCCCACGG GCACCACCCC GCCCCCCACC ATGCCGGGGC GATTCACGGG
GGAACGCCGC CTCATCACCA TCATGTGATC GCGCCCCGCC ATGCCGCCCT TCCCCCTGCC
GCGACGGGTG TGGCGGCCGG CGCGGCAGCG GGGGTGGCGG CCGGGACGGC GGGACCGGCG
CAGGCCGAGA CGGCCATCCC GTCCCCCCCC GGACCGGCGG ATGCCGCCGC GATAGACAAG
GGCACCGTGA CCGGCCTGCC GCTGCCGCGC TTCGCGGCGC TGCGCGCGGA CGAGGTGAAC
ATGCGTTCGG GCCCTGGCCA GCGCTATCCT ATCGCCTGGG TCTATCACCG CCGCGACCTG
CCGGTGAAAA TCGAGCGGGA ATTCGACGTC TGGCGCCTGG TCGAGGATTC CGACGGCCAG
AAAGGCTGGG TCCATCAGGC GACGCTGGTC GGCGCACGCA CCTTCGTGGT GCCCGGGCTG
CCGCCGGTCG ACCCCGCATC AGATGCTGCT GCCCAGGGTG CTTCTGCCCA GGGCGCCCCT
GCCCGGAGCG GGACCGCCCC GGCCGGCGGC AAGCCCGCCG CCCCGACGCC GCAACCGGGG
CCGGGCGGCC ATTTCGACAC CACCGTCGTC GGCCACCTTG CGGACCCGGC GGCGGCGGCC
ACGATCCCGG GCGCCGTCAT CCTGCGCGCG GCGGCCGATG CCGCATCGGC GGTCGTCGCG
GTGCTGAAGC CGGGTTCCGT CGGCACATTT CGCACGTGCG CCGCCGGCAC AACCTGGTGC
AGGGTCAGCG TGCAGCATTA TTCGGGCTGG CTGGACCGGT CGTCGGTCTG GGGTCTTCTG
CCGCAGGAGA CCATCCAGCC GTAG
 
Protein sequence
MTLPSPVVPS VGTCRPSAAP AGGTVRRGWL VCAAILAPAA LGAPGAAAQQ TATHHRHHHH 
HAPANAPVPA ATAARHHHHA AAPAPHGHHP APHHAGAIHG GTPPHHHHVI APRHAALPPA
ATGVAAGAAA GVAAGTAGPA QAETAIPSPP GPADAAAIDK GTVTGLPLPR FAALRADEVN
MRSGPGQRYP IAWVYHRRDL PVKIEREFDV WRLVEDSDGQ KGWVHQATLV GARTFVVPGL
PPVDPASDAA AQGASAQGAP ARSGTAPAGG KPAAPTPQPG PGGHFDTTVV GHLADPAAAA
TIPGAVILRA AADAASAVVA VLKPGSVGTF RTCAAGTTWC RVSVQHYSGW LDRSSVWGLL
PQETIQP