Gene Gdia_0115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0115 
Symbol 
ID6973506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp130556 
End bp132457 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content69% 
IMG OID643389648 
Productcobalt chelatase, pCobT subunit 
Protein accessionYP_002274530 
Protein GI209542301 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4547] Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) 
TIGRFAM ID[TIGR01651] cobaltochelatase, CobT subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.102217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0675463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGACC GCAAGGACAC CACCCAGTCC GCACGATTGG CCGCCGCCGA GCGGGCCGAC 
GTCTTCAAGC GCGCGACCGT TGGCGCGCTG CGCGCGCTGG GCGGCCGGGC GACGGCCGAG
GTCACGTTCC AGACCGGCCC GATTCCGCCT GCGGCGGCGG TCAGCGGCGA TCACGTCCGC
CTGCCGCAGC CCGCCCTGCA ACTGGCCGAG GCCGATATCC GGCGCGTGCG CGGCGCGGCG
GATGCCGTGG CGCTGCAACT GCGCCATCAT GACGTGACGA TCCACAACGC GACCCGGCCG
GAGCAGGCCG ATGCCCGGCT GGCCTATGAC GCGCTGGAAC AGGCGCGGGT CGAAAGCTTC
GGCGCGCGCC ACATGGCGGG AGTCGCCGCC AATCTGCGGA ATCAGGCCGA GCGGGACTAT
CACGACAGGG GTTATGACCG GGCGCAGGCG CGCGACCAGA TTCCCGTGCA GGTTGCGCTG
TCGCTGCTGG CGCGCGAACG CATGACGGGC GAGCCCGTGC CCGAGAGCAT GCGCGCGATG
GCCGAACAGT GGCGTGCGCA TCTGGGCCCC TCGGCACTGC GCGCGCTGGA CGACATGGCC
GCCCATCAGG ACGACCAGAT GGCGTTTTCG CGTGCCGCGA AGCGGCTGCT GGTCGCCTGC
GAACTGATCG AGGGCGAGGC CGAGATCGAG GAGGACGAGG ACGGCGACGA CAGCGCCCCC
TCGGACGAGA CCGAGGAAGA ACCGGGCGAA GCGCCCGAGA AGCCGCAGCC GCAGGACGAG
GACGCCAGCG GCCAGCAGGA AGACGAGACC GGCCTGCAGC CCCAGTTGGC GCAAGGTGCG
GGAGCCGGCG ACGACAACCC CGACGAGTCC GAACCCGGCG GTACAGCGGG GTCCGAGGAA
GCGGGCGGCC CACGCGGCAC GGACGATCAG GAAGCGACCG ATCCGGCGTC CCTGTATCAT
GCCTTCACCA CCGCGTTCGA CGAGGAAATC GCGGCCGAGG ATCTGTGCGA CGCCGACGAA
CTGGCCCGCC TGCGCCAGCA GTTGGACCAC CAGTTGCTCA GCCTGCAGGG CGTGGTGTCG
CGCCTGGCCA ACCGGCTGCA ACGACGCCTG CTGGCACAGC AGACGCGGGC GTGGGAGTTC
GACCTGGAGG AAGGCATCCT GGATGCCGGC CGGCTGTCGC GGGTGGTGGT CAACCCGACG
CTGTCGCTGT CCTACAAGCA CGAACGCGAC ACCGATTTCC GCGACACGGT CGTGACCCTG
CTGATCGACA ATTCCGGATC GATGCGCGGC CGGCCGATTT CGGTGGCCGC GATGTGCGGC
GACATCCTGG CCCGCACGCT GGAACGCTGC GCGGTGAAGG TCGAGGTCCT GGGCTTCACC
ACCCGGGCCT GGAAGGGCGG GCAGAGCCGC GAGCGCTGGG TGGCGCAGGG CAAGCCGGCC
AATCCGGGGC GGCTGAACGA TCTGCGGCAC ATCATCTACA AATCGGCGGA CATGCCGTGG
CGCCGGGCGC GGAAGAATCT GGGCCTGATG CTGCGCGAGG GGCTGCTGAA GGAAAATATC
GACGGCGAGG CCCTGCTGTG GGCCTGGCGG CGCCTGCAGG GCCGGCCGGA AAGCCGGAAG
ATCCTGATGG TGATCTCGGA CGGCGCGCCG GTGGATGACA GCACGCTGTC GGTCAATGCC
GGGTCGTATC TGGAAACGCA CCTGCGGCAG GTGATCGCCC AGATCGAAAA CCGCAGCGGC
GTCGAACTGG TGGCCATCGG GATCGGCCAT GACGTGACGC GCTATTACCG CCGCGCGGTC
ACGATCTCCG ACGCCGAGGA ACTGGGCGGC ACGATGATGC AGAAGCTCTC CGAACTTTTC
GATGAAAAGG TCGCTGTCGC GGGTCGCCGC CGAATCGCCT GA
 
Protein sequence
MRDRKDTTQS ARLAAAERAD VFKRATVGAL RALGGRATAE VTFQTGPIPP AAAVSGDHVR 
LPQPALQLAE ADIRRVRGAA DAVALQLRHH DVTIHNATRP EQADARLAYD ALEQARVESF
GARHMAGVAA NLRNQAERDY HDRGYDRAQA RDQIPVQVAL SLLARERMTG EPVPESMRAM
AEQWRAHLGP SALRALDDMA AHQDDQMAFS RAAKRLLVAC ELIEGEAEIE EDEDGDDSAP
SDETEEEPGE APEKPQPQDE DASGQQEDET GLQPQLAQGA GAGDDNPDES EPGGTAGSEE
AGGPRGTDDQ EATDPASLYH AFTTAFDEEI AAEDLCDADE LARLRQQLDH QLLSLQGVVS
RLANRLQRRL LAQQTRAWEF DLEEGILDAG RLSRVVVNPT LSLSYKHERD TDFRDTVVTL
LIDNSGSMRG RPISVAAMCG DILARTLERC AVKVEVLGFT TRAWKGGQSR ERWVAQGKPA
NPGRLNDLRH IIYKSADMPW RRARKNLGLM LREGLLKENI DGEALLWAWR RLQGRPESRK
ILMVISDGAP VDDSTLSVNA GSYLETHLRQ VIAQIENRSG VELVAIGIGH DVTRYYRRAV
TISDAEELGG TMMQKLSELF DEKVAVAGRR RIA