Gene Cthe_1384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1384 
Symbol 
ID4809379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1689412 
End bp1690698 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content39% 
IMG OID640106808 
ProductFolC bifunctional protein 
Protein accessionYP_001037809 
Protein GI125973899 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATTATG AACAGGCGTT GGAATATATA CACGGTACCC ATAAATTCGG AATAAAGCTT 
GGCCTTGACA ATATTCGAAA ACTTCTTGAA TTCATGGACA ATCCCCACAG GAAACTGAAA
TACGTACATG TTGCCGGAAC GAACGGAAAA GGCTCAACGG TTGCGTTTAT AAGCAGTATA
CTGAAAGAAT CCGGCTACAA AGTTGGTATA TACACCTCGC CGTACATTGA ACGTTTTACC
GAAAGAATTA AAATCAACAA TGATGAAATT TCAAAAGAAG ATTTAGCAAG AATTACCGAG
TATGTAAAAG GAAAAGTTGA ACTGATGATT TCACAGGGAG AAAATCATCC TACGGAATTT
GAAATAGTAA CGGCAATCGC TTTTCAATAT TTTTATGAAA AGGATTGCGA CATTGTTGTG
CTGGAGGTGG GACTTGGAGG CAGGTTTGAT TCCACCAATG TGATAGATTC CCCTTTGGCT
GCGGTTATAA CCACTATAAG TTACGACCAT ATGGCACAGC TGGGAGACAC TTTGGATAAA
ATAGCTTTTG AAAAGGCGGG AATCATAAAG AGAGGCACCG ATGTGGTGTT GTATCCCCAG
ACCCGGGAAG CAGACAAGGT TTTTGAAGAG GTATGTCATG AAAAAGGTGC CCGACTTCAC
AAATTGTCTT TTCATTCCAT TAACATAAAA AAATTTTCAC CAGACGGGCA GGAATTTGAT
TATGGCGAGT TTAAATCTTT AAAAATAGGA CTTTTGGGAG AGCATCAGGT AAAAAATGCG
GTGGTGGCTC TTGAAACGGC TTTGATTTTG GCACGTAAGG GTTATGACAG AATTTGCGAA
AGTTCTGTAA GAAAAGGACT GGCTGATGCA AAATGGCCCG GAAGACTTGA AATTCTTAAA
AAAGAACCGA TATTTTTGAT AGACGGTGCT CACAATGCTG AAGGTGCAAA AACTCTTTCC
GAGTTTTTAA AAACATATTT TCCCGGCAAA AAAATAGTAT TTATTATAGG TGTTTTAAAA
GATAAAGATT TTAAGTCAAT GATTGAGGTA TGTGCACCCC TTGCGGAGGA TATAATTACT
GTGACACCAA ACAGCGACAG AGCTTTGCCG GCTGAGACAC TGGCGCAGAA TCTCGAAAAC
TATTGTAAAA ATGTATCAAT AAGTGATACA ATTGTAAATG CGGTGGAAAA AAGTCTTAAG
ATTGCTCCAA AGGACGGAGT GATTTGTGCC TTTGGTTCGC TGTATTATAT TGGTGAGATA
CGAAGTGTGC TGATGAATCA TAAATGA
 
Protein sequence
MNYEQALEYI HGTHKFGIKL GLDNIRKLLE FMDNPHRKLK YVHVAGTNGK GSTVAFISSI 
LKESGYKVGI YTSPYIERFT ERIKINNDEI SKEDLARITE YVKGKVELMI SQGENHPTEF
EIVTAIAFQY FYEKDCDIVV LEVGLGGRFD STNVIDSPLA AVITTISYDH MAQLGDTLDK
IAFEKAGIIK RGTDVVLYPQ TREADKVFEE VCHEKGARLH KLSFHSINIK KFSPDGQEFD
YGEFKSLKIG LLGEHQVKNA VVALETALIL ARKGYDRICE SSVRKGLADA KWPGRLEILK
KEPIFLIDGA HNAEGAKTLS EFLKTYFPGK KIVFIIGVLK DKDFKSMIEV CAPLAEDIIT
VTPNSDRALP AETLAQNLEN YCKNVSISDT IVNAVEKSLK IAPKDGVICA FGSLYYIGEI
RSVLMNHK