Gene Cthe_2787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2787 
Symbol 
ID4810104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3287171 
End bp3288847 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content44% 
IMG OID640108207 
Productferredoxin 
Protein accessionYP_001039179 
Protein GI125975269 
COG category[R] General function prediction only 
COG ID[COG3894] Uncharacterized metal-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.978566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGAAG TAGTGTTTTA CCCGCAAAAC AAGTCCATTA ATGTAGAAGA AGGAACCACC 
ATTCTTCAGG CGGCCCGCAG TGCAGGAGTG ATAATAGAGT CCCCGTGCAA CGGTACAGGA
ACTTGCGGAA AATGCAAAGT AAGACTGGAT GAGAAATCTT TGCCAAATGT CCTGGCAAAA
AGCAGGCATT ACCTTTCCAA AGAGGAAGAG GAGCAAGGGT ATGTACTGGC CTGCGAAACG
CAAATAACCG GGGACATCAA GGTTGAACTT GGCGAAAACA AGCAAAATGG CACTCTCAAA
ATATTAAGCA GGGGTCACAG TTTTAATATA GATTTGAAGC CTTTTATAAG AAAGGAATAC
TTCGTCCACG AAGACGTTAC AAAAGTGTTT GCCGGAAAAG AACAATTGGG AATTGAAGCA
GGAGATACAA CAAAAGAAAA CTATGGAGCC GTCGTTGACA TAGGAACTAC AACCTTGGTG
GCCTCCATTG TAAATTTAAA CAACGGGGAC GAAATAGGTA CTTCTTCGGC ATTAAATCCT
CAGGCCGTCC ATGCCCAGGA TGTGTTGTCG AGAATCAAGT TTTCATCCGA TGCCGATGGG
CTTAAAGTTA TGCATAGTGA ACTGACAGAC AAAATTAACA GCATGATTGG TAAAATAGCT
TTAAGGGCCG GTATCAGCAA AGAACACATA TATGAAATTG TTTTCAGCGG CAACACATGC
ATGCTTCATT TGGCTTCAAA CACCTGCCCC GAATCCCTTG GGAAGTATCC GTATACTCCA
AAGATAAGCG GTGCCGCATA TCTGGACGCT GCCAAATACA ATATTGATAT TTCGCCGTTT
GGAATTATAT ATCTGCCTCC GATTATATCG GCTTATGTGG GCGCTGACAT CGTTTCCGGA
ATTTTGGCAT CGCAGCTTCA TGAGAAAGAT GGCGTTATTT TGTTTGTTGA CATAGGTACC
AACGGGGAAA TGGTACTGGC CTCTTGTGGA AATCTTTCGG CTACGTCCAC GGCGGCAGGA
CCGGCTTTTG AAGGAATGAA CATAACCTGC GGCATGAGGG CGGGGGAGGC GGCAATAGAG
TTTTTTGAAA TTGAGGAACA GGGGAGTATT AACATTAAGG TTATCGGTGA AACGGAAGCG
GCGGGAATTT GCGGAAGCGG GCTTTTGGAT ATGGTTGGTG AGTTTGCGGC CCATGGAGTT
ATTAAAAAGA ACGGCCAATT TATTGACCCG GAAAGCGAAA ACGTTCTGCA TCCGAAACTG
GCGGAAAGAC TTGTAAGACA GGACGGAAAA TGGATTTTTA AAGTTACCGA CAAAGTTTTC
CTTTCTCAAA AAGATATAAG GCAGGTTCAG CTTGCAAAGG GAGCTGTAAG GGCCGGAATT
GAATTTTTGC TGGAAAACAA AGGGGTAAGA GCCTCCGATG TGGATAAAGT GCTTATTGCT
GGGTCTTTTG GATATCATCT GAGGGAAAAA AGCCTTATCA ATATAGGTCT TCTTCCAAAA
GAGTTTGAGG GTAAGGTGGA GTTTGTCGGC AATACTTCGC TGTCCGGCGC AAAAGCCTTT
CTTTTGAATC AAACCTATAG GGAGAAAATG AAGGAAACGG TAAAAAGTGT CGAGGTTCTG
GAACTGGCAA ATTACAAGGA TTTTGACAGG GTCTTTGTCA GGTGCCTGAG TTTTTAG
 
Protein sequence
MPEVVFYPQN KSINVEEGTT ILQAARSAGV IIESPCNGTG TCGKCKVRLD EKSLPNVLAK 
SRHYLSKEEE EQGYVLACET QITGDIKVEL GENKQNGTLK ILSRGHSFNI DLKPFIRKEY
FVHEDVTKVF AGKEQLGIEA GDTTKENYGA VVDIGTTTLV ASIVNLNNGD EIGTSSALNP
QAVHAQDVLS RIKFSSDADG LKVMHSELTD KINSMIGKIA LRAGISKEHI YEIVFSGNTC
MLHLASNTCP ESLGKYPYTP KISGAAYLDA AKYNIDISPF GIIYLPPIIS AYVGADIVSG
ILASQLHEKD GVILFVDIGT NGEMVLASCG NLSATSTAAG PAFEGMNITC GMRAGEAAIE
FFEIEEQGSI NIKVIGETEA AGICGSGLLD MVGEFAAHGV IKKNGQFIDP ESENVLHPKL
AERLVRQDGK WIFKVTDKVF LSQKDIRQVQ LAKGAVRAGI EFLLENKGVR ASDVDKVLIA
GSFGYHLREK SLINIGLLPK EFEGKVEFVG NTSLSGAKAF LLNQTYREKM KETVKSVEVL
ELANYKDFDR VFVRCLSF