Gene Tmz1t_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0221 
Symbol 
ID7084342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp254000 
End bp255868 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content73% 
IMG OID643697263 
ProductCobaltochelatase 
Protein accessionYP_002353912 
Protein GI217968678 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4547] Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) 
TIGRFAM ID[TIGR01651] cobaltochelatase, CobT subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.435346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGCG AACGCCCCCG CTCCCGGCCC CTGCGCGGAG TCGCGAACAC GGCCGTCTTT 
CTGCGGGCGG TCGACAGCAC CCTGCGTGCG ATCGCCGGGT GCGCCGGCGG CGCGTCCGCC
CCGTCCGGCG TCGACGGCAT CGAAACCGCT GCACTGCCGC CCGCCCTGAC GACGGCGGAG
ATCGTCCGCC TGCGCGGCGA GGCCGACGCG CAGGCCCTGC GCCTGCGCCA TCACGACGAG
CGCCTCCATG CCCTGCACGA TCTGCAGGGC GAGCGCGCAC GCGCGGCCTA CGATGCGATC
GCACAGGCAC GCGTCGAGGC CTTGGGGGCG GGCCGGATGC CGGGCGTGGC GGCCAACATC
GCCGCGCTCA TCGAGCAGCG CTGCCGGGAC AGGCGGCTCG ACCGCGCGAC CAGCCGGGAA
CAGGTTGCGC TCGCAGAGGC CCTGCACGTC CTCGCCCGCG AACGCTTCTG CGGCGCGCCT
CCCCCGCCGG CGGCGCGGAC CATGGCCGAG CTGTGGCGAC CCTACTTCGA CGCCCACGTC
GGCCCGCGCC TCGACGGCCT ACGCGAGCAC CTGCGCGACG AACGCACGTT CGCCGCCGGA
TTGCATGAAC TGATCGAGGC GCTCGACCTG GAAAACCCCA CCAGCGAAGC CCGCGAGCCG
CGCAAGCTGG CCCGCAGCGA GGGCGAATCC GGAGACGAGG ACGCTGCGGA GCAGGCGCGG
GCGGGCGGCG CGGCTGCGAG GGACGCGCAG CCCTCGCTCC GCCACGCCTC GTCCGGTCGG
AACGAGGGCG GGCGGGCAGC GCAGGCAGAC TTGGCCGCCG CGCCGGCACG GACTGGCGCG
GCAGGAGGGG ACGCAGTCCG CGACGCCGCG TCTGCGCGGG GGCCCAGCGC GTCGTACCGT
GCCTACACCA CCGCCTTCGA CCAGACCGTG GGCGCGGAAG AGCTCTGCGA CGCGTCGGAA
CTGCACCCGC TGCGCGCCCG CCTGGACCGC GAGCTGATGC GTCTCCCGGA ACTCGTCGGG
CGACTCGCCA AGCGGTTGCA GCGACGCCTG CTGGCGAGCC AGCTGCGCGA CTGGGCCTTC
GATCTGGAGG AAGGCTGGCT CGATCCCGGG CGACTCGACC GCATCGTCGT CAGTCCCGAT
CATGGCCTGT CGTTCAGGAT GGAGAAGGCT TCCGGCTTTC GCGATACCGT GGTCGGCATG
CTGATCGACA ACTCCAGCTC GATGCGCGGC CGTCCGCTCG CCGTTGCGGC GATGAGCGCC
GACATCCTCG CCCGTACGCT GGAGCGCTGC GGCATCAAGG TCGAGATCCT CGGCTTCACC
ACGCGCACCT GGAAGGGCGG CCGTGCCCGC GAGCAGTGGC GGCGCGAGGG CGAACCCGTC
GCTCCGGGGC GGCTCGCCGA GCTGCGTCAC ATCGTATACA AGGCTGCGGA CACGCCCTGG
CGGCGTGCGC GCAAGAACCT GGGCCTGATG CTGCGTGACG GCATCCCCAA GGAGAACATC
GACGGCGAAG CCCTGCTGTG GGCCCACCGC CGCCTCGTCG CCCGCCCCGA GCAGCGTCGC
ATCCTGGTCG TCGTGTCGGA TGGCGCGCCT GCCGACGAGG CCACGCTTGC GGCCAACCCG
GGCGATTATC TGGAACGTCA CCTCCACGAG GTGATCGCCT GGATCGAGCG GCGCTCGCCC
GTCGAGCTGC TTGCCATCGG CATCGGCCAC GACGTGACCC GCCATTACCG CCGCGCGGTC
ACCCTGCGGG ATGCCGACGC GCTCGGCGCA GCCCTGCTCG AGCAGCTCGC GCGGCTGTTC
GACGAGCGGC CCCGAGCGCC CGCGGGAGCG AGGATGCGCC CGCGGGCCAG CGAAACGGCT
CCCCGATGA
 
Protein sequence
MTSERPRSRP LRGVANTAVF LRAVDSTLRA IAGCAGGASA PSGVDGIETA ALPPALTTAE 
IVRLRGEADA QALRLRHHDE RLHALHDLQG ERARAAYDAI AQARVEALGA GRMPGVAANI
AALIEQRCRD RRLDRATSRE QVALAEALHV LARERFCGAP PPPAARTMAE LWRPYFDAHV
GPRLDGLREH LRDERTFAAG LHELIEALDL ENPTSEAREP RKLARSEGES GDEDAAEQAR
AGGAAARDAQ PSLRHASSGR NEGGRAAQAD LAAAPARTGA AGGDAVRDAA SARGPSASYR
AYTTAFDQTV GAEELCDASE LHPLRARLDR ELMRLPELVG RLAKRLQRRL LASQLRDWAF
DLEEGWLDPG RLDRIVVSPD HGLSFRMEKA SGFRDTVVGM LIDNSSSMRG RPLAVAAMSA
DILARTLERC GIKVEILGFT TRTWKGGRAR EQWRREGEPV APGRLAELRH IVYKAADTPW
RRARKNLGLM LRDGIPKENI DGEALLWAHR RLVARPEQRR ILVVVSDGAP ADEATLAANP
GDYLERHLHE VIAWIERRSP VELLAIGIGH DVTRHYRRAV TLRDADALGA ALLEQLARLF
DERPRAPAGA RMRPRASETA PR