Gene Gdia_2160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2160 
Symbol 
ID6975588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2390820 
End bp2392946 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content74% 
IMG OID643391689 
Producthypothetical protein 
Protein accessionYP_002276533 
Protein GI209544304 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein
[COG4233] Uncharacterized protein predicted to be involved in C-type cytochrome biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCCTAT CCTGGCGCCT GCCCCCGTTC CTGCGTCTGT CATTGCTGTT TCTGGCGCCG 
CTTGTCTTCC TCCTGGCGCT GCCGTCCGGC CGCGCGGGGG CGGCTGAGAG TGCGGCGCTG
ACCACCACGC GCGACAGCGC GACGCTGGTG ACGGACACCG ACCGGTACCG GCCCGGCATG
CCGCTGCATG TCGGGCTGCG CCTGCGGCTG CAGCCGGGGT GGCATACCTA CTGGATCAAT
CCCGGCGACG CGGGCGAGGC CCCGACCCTG GCGGTGCAGG CCGGTGGCGG GGCCGCCGGC
GCATCCGGCA CCATTGCCTG GCCGGTGCCC CGGCGCCTGC CCGACGGGCC GCTGATGTCC
TACGGCTATA CCGGCGAGGT GCTGCTGCCG GCCGAACTGC GCCCGACCGG ACCGGGGGAC
GGGCCGATGG TGTTGCAGGC CACGGCCGAA TGGCTGGTCT GCGCCGCCGT CTGCGTGCCC
GAACAGGGGC GGTTCACCCT GACGCTGCCG CCGGGCGACC CCGCGCCCTC GGACCAGGCG
CCGCTGTTCG CGCGCGCCGC CGCCGCGCGG CCGGTTGCCT CGCCCTTCGC CGCGATGATC
GCGTCCGATG GCGTGTTGGC GATCACGGGC GCGGGGCTGT CGTCCGATAC GGTGCGTGAT
GCCTGGTTCA TGCCCACGAT CTCCGGCGCC ATCGACCAGG ATGCGGCGCA GGCCCTGTCG
GTCCATGACG GCGGGCTGAC CCTGGCGCTG AAGCCCGGCG ACGCCTTCCA CGCCGCCCCC
ACGCTGGACG GCGTGCTGGT GCTGAAGGAT GCGGCGGGGC AGGAACGCGG CCTGTCGGTT
TCGGCCCGTC CCGGTGCGGT GCCGACCGTG ACCCCTGGGG CGGTCCCCGG GCCGGCCCAT
AGCGGCGCGG CGCGCATGCT GCTGCTGGCT TTCGCCGGCG GGCTGATCCT GAATTTGATG
CCGTGCGTGT TTCCGGTGCT GGTGATGAAG GCGATGGCCC TGACCCGCCT GGCCGGCGGG
GGCGCGTCGG CGCGGCTGGC CGGCGCCGGG GCCTATGCGG GCGGCGTGAT CGGCGCGTTT
GTGATGCTGG GCGGCGTGAT CCTGGCGCTG CGCGCGGCCG GGGCCGTCGC GGGCTGGGGC
TTCCAGTTCC AGTCGGCGGC CTTTGTCGCC GCCATGTGCT GGCTGCTGTT CGCGGTGGCG
CTGAACCTGC TGGGCGTGTT CGGCATGGGC AACGGCATCG CGGCGCTGGG CGGCGTGGGC
GTGTCGCGCA CCGGCTATCT GGGCGATGCG ATGACCGGGG TGCTGGCGGT GCTGGTGGCC
ACCCCCTGCA CCGCGCCGTT CATGGGGGCG GCGATCGCGG GCGCGCTGGC GGCCTCGCCG
GTCGTGGCGA TGGCGGTGTT CGTCGCGATG GGGGCGGGGC TGGCTGCGCC CTACCTGCTG
CTGGCCGGCG TGCCGGGGCT GGCGCGCCAC CTGCCGCGCC CCGGCGCGTG GATGGATATC
CTGCGCCAGG GCCTGGCCTT TCCGGTGCTG GGCACCTGCG TATGGCTGAT GTGGGTGCTG
GCGTTGCAGG CCGGGGCCGG CGCGATCGCA GTGCTGGGCA TCGGGCTGGT GCTGATCGGG
CTGGCGGCGT GGCTGACCGG CCTGGCGCAG GGCATGGCCA TGCGCGACGG CCCGGCGCGG
CGCATTCGCC TGTGCCGGGC GGTGGCGCTG GCCGCGCTGG CGCTGGCCAT GGCGCTGCTG
CCCGGGCTGG GCGGCACCGT ATCCGGGGGC GCGGCGCGTC TGGCCGCCGA CGGGTCGGAA
CCGTTTTCGG CCAGCCGGCT GGCCACGTTG CGGGCGCAGG GCCGCCCGGT GTTCGTGGAC
ATGACGGCGG CGTGGTGCAT TTCCTGCCTG GTGAACGAAC GGGTGGCCCT GTCCTCGGCC
CCGGTACAGG CGGCGTTCCA TGACCGGCAC GTGGTCTACA TGAAGGGCGA CTGGACCAAC
CGGAACGCGG CGATCAGCGC CTTCCTGGAC GCCCATGGGC GGGCGGGGGT GCCGTTCTAC
CTCTATTACC CGCCGGGCAG CGGCGATGGC CGGGCCCTGC CGCAGATCCT GACGACCGGA
CTGGTGCTGG ACGCGCTGAA AGGATAG
 
Protein sequence
MVLSWRLPPF LRLSLLFLAP LVFLLALPSG RAGAAESAAL TTTRDSATLV TDTDRYRPGM 
PLHVGLRLRL QPGWHTYWIN PGDAGEAPTL AVQAGGGAAG ASGTIAWPVP RRLPDGPLMS
YGYTGEVLLP AELRPTGPGD GPMVLQATAE WLVCAAVCVP EQGRFTLTLP PGDPAPSDQA
PLFARAAAAR PVASPFAAMI ASDGVLAITG AGLSSDTVRD AWFMPTISGA IDQDAAQALS
VHDGGLTLAL KPGDAFHAAP TLDGVLVLKD AAGQERGLSV SARPGAVPTV TPGAVPGPAH
SGAARMLLLA FAGGLILNLM PCVFPVLVMK AMALTRLAGG GASARLAGAG AYAGGVIGAF
VMLGGVILAL RAAGAVAGWG FQFQSAAFVA AMCWLLFAVA LNLLGVFGMG NGIAALGGVG
VSRTGYLGDA MTGVLAVLVA TPCTAPFMGA AIAGALAASP VVAMAVFVAM GAGLAAPYLL
LAGVPGLARH LPRPGAWMDI LRQGLAFPVL GTCVWLMWVL ALQAGAGAIA VLGIGLVLIG
LAAWLTGLAQ GMAMRDGPAR RIRLCRAVAL AALALAMALL PGLGGTVSGG AARLAADGSE
PFSASRLATL RAQGRPVFVD MTAAWCISCL VNERVALSSA PVQAAFHDRH VVYMKGDWTN
RNAAISAFLD AHGRAGVPFY LYYPPGSGDG RALPQILTTG LVLDALKG