Gene Athe_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1199 
Symbol 
ID7409673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1291388 
End bp1292863 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content34% 
IMG OID643715564 
Producturoporphyrin-III C-methyltransferase 
Protein accessionYP_002573072 
Protein GI222529190 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAGTTG GAAAAGTATA TATTGTTGGT GCCGGTCCGT ACATGGAGGA CTTGATAACA 
GTAAAGGGTC TGAATGCTAT AAAAACTGCT GATGTAATAA TTTACGACAG GTTAATAAAC
AAAAATTTGT TGAAGCTTGC AAAAGATGAT GCAATTTATA TCTACTGTGG TAAAGAACCA
AAAAAACATG TACTTTCTCA GGACAGAATT ACAAACTTAA TGATAGAATA TGCAAAAAAA
GGTTTTAATG TTGTTCGGCT AAAAGGTGGA GACCCATATA TATTTGGAAG GGGAGCAGAA
GAGGCAGAAA AACTATTTAA AAATAACATA CCTTTTGAAA TTATTCCTGG TGTTAGTTCA
TTTTACTCTG CTTTAACCTT TGCGGGTATT CCAATCACGT ATAGAAAGCT TGCACGTGAA
TTTCATGTGT TTACTGGGCA CACGTGCAAC GATGAAGAGC TGAATTGGAA CATAATTTCT
AAACTTGATG GCACTTTAAT ATTTCTCATG TCTGCCGAAA ATGTTGAAAG TATCTCACAA
AAACTGATTT TACACGGCAA ACAACCTAAA ACCCTTGCTG CAGCAGTAAT TAATGCAACA
ACTGGTCGGC AAAAAGTAAT TAGCGGGTAT TTAGAAGACT TTGCAACTGG AAAGTTTAAA
AATCAAATTA CATCGCCGAT GGTATTTGTA ATAGGTGAGG TTATAAAATT TAGAAATAAG
CTTTCTTTTC ATGAGAGTTT GCCTTTATTT GGTAAGAGAA TTTGTATCAC ACGTCCAAAA
AGTGTCTCAA GGAATATAAA AAGTTTGCTA TTCAGTCTTG GTGCGGACGT TGTTGATGGT
TGCTGCTCAA AACTGGTCCT CCATAGGGAG GAGATTGATA AAATTTTAAA CTCTTTGCCA
GAGTATGATA TATTAGTGTT TACAAGTGTA AATGGTGTTG ATAGTTTTTT TGACTACTTG
ACTGAAAAAA ATATAGATGT GAGAAATATT AAAGGTGACT TTGCTGCAAT AGGGAAAAAA
ACTGCACTTT CACTCCAAAA GAGAGGATTT GGGGTAAAAT ATATTCCAGA TGAACATTCA
TCAGATGGCT TAATCAAGAT TTTCGAGAAT GAAGTTGATA AAAGCAAAAA GATATTGACT
GTGCAGTCAA AAAATGCAGG AGATTACCTC AAAAATTCCC TTGAAAGCTT GGGATTTGAG
GTTGATACCA TCTTTGCATA TTCAATGGAA TTTACTAAAA ACCCAAATGA TGCAGTTTAC
GATTCAGATA TATTTGTATT TACAAGCTCA GGAATGTTTA GACACTTTAT AGAATGCTAT
GGGGTTGAAG TGCTTTCTAA TAAAATAGTT ATTTCAATTG GTGAGCATAC CCAAAAGACA
TTGGAAAGTT TCGGTATTAA AAGTATCATT AGTAATGAGG CAACCGATGA GGGAATAGTA
AATAAGATTT TAGAGGTGGT CAAAAATGGA GTTTAA
 
Protein sequence
MKVGKVYIVG AGPYMEDLIT VKGLNAIKTA DVIIYDRLIN KNLLKLAKDD AIYIYCGKEP 
KKHVLSQDRI TNLMIEYAKK GFNVVRLKGG DPYIFGRGAE EAEKLFKNNI PFEIIPGVSS
FYSALTFAGI PITYRKLARE FHVFTGHTCN DEELNWNIIS KLDGTLIFLM SAENVESISQ
KLILHGKQPK TLAAAVINAT TGRQKVISGY LEDFATGKFK NQITSPMVFV IGEVIKFRNK
LSFHESLPLF GKRICITRPK SVSRNIKSLL FSLGADVVDG CCSKLVLHRE EIDKILNSLP
EYDILVFTSV NGVDSFFDYL TEKNIDVRNI KGDFAAIGKK TALSLQKRGF GVKYIPDEHS
SDGLIKIFEN EVDKSKKILT VQSKNAGDYL KNSLESLGFE VDTIFAYSME FTKNPNDAVY
DSDIFVFTSS GMFRHFIECY GVEVLSNKIV ISIGEHTQKT LESFGIKSII SNEATDEGIV
NKILEVVKNG V