Gene Cthe_2528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2528 
Symbol 
ID4809284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2996125 
End bp2997606 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content41% 
IMG OID640107944 
Producturoporphyrinogen-III synthase / uroporphyrinogen-III C-methyltransferase 
Protein accessionYP_001038923 
Protein GI125975013 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1587] Uroporphyrinogen-III synthase 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.172889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACG GCAAAGGAAA GGTTTACCTG GTGGGAGCGG GACCGGGAGA TGAGGAACTT 
CTTACTGTTA AGGCAGCCGA TGTTATAAAA AAGGCGGATG TCATAGTTTA TGACAGACTT
ATAAGCGATG GTATTCTTTC AAAAATTCCC GACAATGCGG AGAAAATCTA TGTTGGAAAA
AATGCCGGCA ATCATCCGGT TCCTCAGCAT GAAATAAATA AAATCCTTCT GGCCAAAGCA
CTGGAAGCTA AAATTGTTGT TAGGCTTAAA GGCGGCGATC CCTTTGTATT TGGCAGAGGG
GGAGAGGAGC TGGAGTTCTT GCATCAAAAC GGCATACCTT TTGAGGTAGT GCCGGGCATT
ACTTCGGCAA TAGCCGCAGC AGCATATGCA GGCATTCCTG TCACCCATAG AGATTTTTGC
TCGTCGCTGC ATATTATAAC CGGACATGTA AAAGACGGCG GAAATCCGGA CATCGACTAT
GAATCTTTAG TGAAGCTAAA AGGAACTCTC ATCTTTATGA TGTCGGTGGC CTCATTTCCG
CAAATAGCAC AGGGCCTTAT AGAAGCCGGC ATGGAATCCG ATATGGATGC GGCAGTGATA
GAAAACGGCA CAATGCCAAA CCAAAGAAAA TTTGTTTCAA AGCTTTGTGA CATCCCGAAA
GTTATCGCAG AAAATCAGGT TAAATCCCCT GCAATAATCA TAGTCGGAAA GGTATGTTCC
CTTTCTGAAA AGCTTGATTG GTTTTCAAAA TTGCCTCTTT ACGGATGCAA TGTATTGGTA
ACAGGCCCAA AGAAAACATC GGAGAGATTG GCAGGCAAAC TCCGAGGGCT CGGAGCACAT
GTAGTTGAAT ATCCTTGCAT AGAAACAGAA CCACTCGACT TTGACATTGA TATACGAGAG
CATAGCTGGA TAATTTTCAC AAGCAGCCTG GGTGTAACAA TATTCTTTAA AAAACTGTAT
GAAAACAAGC TTGACAGCAG ATATTTGTAC AATAAAAAAA TTGCAGCGGT AGGCTCCCAG
ACCGCCGAAG AACTGTTAAA ACACGGAATA TGCGCCGACT TTGTTCCCGG TAAATTTGAC
GGCAGACATT TAGCGGAAGA GCTTTTGCAA AGCGGCAAAA TAAGCCCTGA AGACAAAGTG
GTTATTTTCA GGGCAAAGGA CGGAACCCAG GCCATTGTAG ATATTTTCGA AAACAATAAT
ATACACTACA CTGATATCTC TGTATATGAA ACTAAATGCA TAGAAAATGA GAGAATTGAC
ATAAGTGGTT TTAATTATAT TACTTTTACC AGCGAAAGCT GCGTAAAAGG CTTTGCCCAC
TCCTTTAAAA ACACAATAGA TTATTCAAAA ATCAATGCAA TATGCATTGG AGAACAAACA
GCCAAAGCTA CAAGGGCTTA TGGAATGAAC ACAATTATTT CCGATGTTGC AACGGTTTCT
TCAATGGTTG AAAAAATTAT TGAAATGCAT TGCGCTAAAT AA
 
Protein sequence
MADGKGKVYL VGAGPGDEEL LTVKAADVIK KADVIVYDRL ISDGILSKIP DNAEKIYVGK 
NAGNHPVPQH EINKILLAKA LEAKIVVRLK GGDPFVFGRG GEELEFLHQN GIPFEVVPGI
TSAIAAAAYA GIPVTHRDFC SSLHIITGHV KDGGNPDIDY ESLVKLKGTL IFMMSVASFP
QIAQGLIEAG MESDMDAAVI ENGTMPNQRK FVSKLCDIPK VIAENQVKSP AIIIVGKVCS
LSEKLDWFSK LPLYGCNVLV TGPKKTSERL AGKLRGLGAH VVEYPCIETE PLDFDIDIRE
HSWIIFTSSL GVTIFFKKLY ENKLDSRYLY NKKIAAVGSQ TAEELLKHGI CADFVPGKFD
GRHLAEELLQ SGKISPEDKV VIFRAKDGTQ AIVDIFENNN IHYTDISVYE TKCIENERID
ISGFNYITFT SESCVKGFAH SFKNTIDYSK INAICIGEQT AKATRAYGMN TIISDVATVS
SMVEKIIEMH CAK