Gene Cthe_2581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2581 
Symbol 
ID4809188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3054052 
End bp3055233 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content46% 
IMG OID640107995 
Productdihydropteroate synthase 
Protein accessionYP_001038974 
Protein GI125975064 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes 
TIGRFAM ID[TIGR01496] dihydropteroate synthase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAACG CAAGAATTGT CTACATAAAT GATATGCATG AAGCAAAAGA AGAAATCCGC 
AAAATCGGTG TGGATGCGTC AGCCATAACA TGGCTTTCAC CCAAAGCATT GTCCATTGCA
ATAAAGCTTG AGAATGTAAG CTCTTATGAG GCAAACATAC TCAAGCAGGA AATGCTGGCC
AAGGGCGGAG ATGCCGCCGT AAACAGAGGT GTGGCAAATT TCAGCACGGA AAGCTCCGAT
GTTCTCCTGA TGGGCACATA CAGCCAGTTT AACAAACTTG TGTACAAGCT TCGCATGCAC
GGCGGAAGTT TCAAGGAAAT TACCGATGAA ATCCAAAGGC TTCTGGAAGG TTTGGAGAAG
GGAAAGCCGG AGTATTTTGA ATGCGGCAAG TATAGACTGC CCATAGGAGA AAAAACTTAT
GTGATGGGAA TACTCAATGT TACACCGGAT TCTTTCTCCG ACGGTGGAAA ATATCTTGAT
ATTGACAGTG CGGTAAAAAG AGCCAGGGAA ATGGTGGATG AAGGCGCTGA CATAATAGAT
GTGGGAGGAG AGTCGACGAG GCCCGGGCAT CAGCCCGTTG ATGCCCTGGA GGAAATAAAC
CGGGTGATAC CGGTTATAGA AAGGCTTTCA AAGGAGTTGA ATGTGCCCAT ATCGGTTGAC
ACCAGCAAGG CTCAGGTTGC AGAAAAGGCT CTTTGTGCGG GTGCCTGCAT TGTAAATGAC
GTTTGGGGCC TCCAGAGGGA CCCGGACATG GCGGAGGTTG TATCAAAGCA CGGTGCAGGA
GTAATTATGA TGCACAACAG TGACACCAAA GAATACAAGG ACCTAATGGG TGACATTATA
AGGTTTTTGA GAAAGAGTGT TGAAATAGCC GAAAAGGCCG GAATTACCAG GGAAAATATG
GTTATTGACC CCGGAATAGG CTTTGGAAAG ACTTTGGAGC ACAACCTGGA AGTAATGAGA
AGAATGAGGG AACTAAACAC CCTAAACCTT CCGGTTCTTC TTGGGACATC AAGAAAGTCC
ATGATAGGAA ATGTTTTGGA TTTGCCTGTA AATGAAAGGC TTGAAGGGAC TGCCGCAACC
ATTACCCTTG GTATTGCCAA CGGGGCGGAT ATAGTGAGAG TCCACGATGT AAAGGAAATG
GTGCGGGTGG CAAGGATGAC CGATGCTATG GTAAGAGTTT GA
 
Protein sequence
MINARIVYIN DMHEAKEEIR KIGVDASAIT WLSPKALSIA IKLENVSSYE ANILKQEMLA 
KGGDAAVNRG VANFSTESSD VLLMGTYSQF NKLVYKLRMH GGSFKEITDE IQRLLEGLEK
GKPEYFECGK YRLPIGEKTY VMGILNVTPD SFSDGGKYLD IDSAVKRARE MVDEGADIID
VGGESTRPGH QPVDALEEIN RVIPVIERLS KELNVPISVD TSKAQVAEKA LCAGACIVND
VWGLQRDPDM AEVVSKHGAG VIMMHNSDTK EYKDLMGDII RFLRKSVEIA EKAGITRENM
VIDPGIGFGK TLEHNLEVMR RMRELNTLNL PVLLGTSRKS MIGNVLDLPV NERLEGTAAT
ITLGIANGAD IVRVHDVKEM VRVARMTDAM VRV