Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0221 |
Symbol | |
ID | 7084342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 254000 |
End bp | 255868 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643697263 |
Product | Cobaltochelatase |
Protein accession | YP_002353912 |
Protein GI | 217968678 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4547] Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) |
TIGRFAM ID | [TIGR01651] cobaltochelatase, CobT subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.435346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAGCG AACGCCCCCG CTCCCGGCCC CTGCGCGGAG TCGCGAACAC GGCCGTCTTT CTGCGGGCGG TCGACAGCAC CCTGCGTGCG ATCGCCGGGT GCGCCGGCGG CGCGTCCGCC CCGTCCGGCG TCGACGGCAT CGAAACCGCT GCACTGCCGC CCGCCCTGAC GACGGCGGAG ATCGTCCGCC TGCGCGGCGA GGCCGACGCG CAGGCCCTGC GCCTGCGCCA TCACGACGAG CGCCTCCATG CCCTGCACGA TCTGCAGGGC GAGCGCGCAC GCGCGGCCTA CGATGCGATC GCACAGGCAC GCGTCGAGGC CTTGGGGGCG GGCCGGATGC CGGGCGTGGC GGCCAACATC GCCGCGCTCA TCGAGCAGCG CTGCCGGGAC AGGCGGCTCG ACCGCGCGAC CAGCCGGGAA CAGGTTGCGC TCGCAGAGGC CCTGCACGTC CTCGCCCGCG AACGCTTCTG CGGCGCGCCT CCCCCGCCGG CGGCGCGGAC CATGGCCGAG CTGTGGCGAC CCTACTTCGA CGCCCACGTC GGCCCGCGCC TCGACGGCCT ACGCGAGCAC CTGCGCGACG AACGCACGTT CGCCGCCGGA TTGCATGAAC TGATCGAGGC GCTCGACCTG GAAAACCCCA CCAGCGAAGC CCGCGAGCCG CGCAAGCTGG CCCGCAGCGA GGGCGAATCC GGAGACGAGG ACGCTGCGGA GCAGGCGCGG GCGGGCGGCG CGGCTGCGAG GGACGCGCAG CCCTCGCTCC GCCACGCCTC GTCCGGTCGG AACGAGGGCG GGCGGGCAGC GCAGGCAGAC TTGGCCGCCG CGCCGGCACG GACTGGCGCG GCAGGAGGGG ACGCAGTCCG CGACGCCGCG TCTGCGCGGG GGCCCAGCGC GTCGTACCGT GCCTACACCA CCGCCTTCGA CCAGACCGTG GGCGCGGAAG AGCTCTGCGA CGCGTCGGAA CTGCACCCGC TGCGCGCCCG CCTGGACCGC GAGCTGATGC GTCTCCCGGA ACTCGTCGGG CGACTCGCCA AGCGGTTGCA GCGACGCCTG CTGGCGAGCC AGCTGCGCGA CTGGGCCTTC GATCTGGAGG AAGGCTGGCT CGATCCCGGG CGACTCGACC GCATCGTCGT CAGTCCCGAT CATGGCCTGT CGTTCAGGAT GGAGAAGGCT TCCGGCTTTC GCGATACCGT GGTCGGCATG CTGATCGACA ACTCCAGCTC GATGCGCGGC CGTCCGCTCG CCGTTGCGGC GATGAGCGCC GACATCCTCG CCCGTACGCT GGAGCGCTGC GGCATCAAGG TCGAGATCCT CGGCTTCACC ACGCGCACCT GGAAGGGCGG CCGTGCCCGC GAGCAGTGGC GGCGCGAGGG CGAACCCGTC GCTCCGGGGC GGCTCGCCGA GCTGCGTCAC ATCGTATACA AGGCTGCGGA CACGCCCTGG CGGCGTGCGC GCAAGAACCT GGGCCTGATG CTGCGTGACG GCATCCCCAA GGAGAACATC GACGGCGAAG CCCTGCTGTG GGCCCACCGC CGCCTCGTCG CCCGCCCCGA GCAGCGTCGC ATCCTGGTCG TCGTGTCGGA TGGCGCGCCT GCCGACGAGG CCACGCTTGC GGCCAACCCG GGCGATTATC TGGAACGTCA CCTCCACGAG GTGATCGCCT GGATCGAGCG GCGCTCGCCC GTCGAGCTGC TTGCCATCGG CATCGGCCAC GACGTGACCC GCCATTACCG CCGCGCGGTC ACCCTGCGGG ATGCCGACGC GCTCGGCGCA GCCCTGCTCG AGCAGCTCGC GCGGCTGTTC GACGAGCGGC CCCGAGCGCC CGCGGGAGCG AGGATGCGCC CGCGGGCCAG CGAAACGGCT CCCCGATGA
|
Protein sequence | MTSERPRSRP LRGVANTAVF LRAVDSTLRA IAGCAGGASA PSGVDGIETA ALPPALTTAE IVRLRGEADA QALRLRHHDE RLHALHDLQG ERARAAYDAI AQARVEALGA GRMPGVAANI AALIEQRCRD RRLDRATSRE QVALAEALHV LARERFCGAP PPPAARTMAE LWRPYFDAHV GPRLDGLREH LRDERTFAAG LHELIEALDL ENPTSEAREP RKLARSEGES GDEDAAEQAR AGGAAARDAQ PSLRHASSGR NEGGRAAQAD LAAAPARTGA AGGDAVRDAA SARGPSASYR AYTTAFDQTV GAEELCDASE LHPLRARLDR ELMRLPELVG RLAKRLQRRL LASQLRDWAF DLEEGWLDPG RLDRIVVSPD HGLSFRMEKA SGFRDTVVGM LIDNSSSMRG RPLAVAAMSA DILARTLERC GIKVEILGFT TRTWKGGRAR EQWRREGEPV APGRLAELRH IVYKAADTPW RRARKNLGLM LRDGIPKENI DGEALLWAHR RLVARPEQRR ILVVVSDGAP ADEATLAANP GDYLERHLHE VIAWIERRSP VELLAIGIGH DVTRHYRRAV TLRDADALGA ALLEQLARLF DERPRAPAGA RMRPRASETA PR
|
| |