Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2787 |
Symbol | |
ID | 4810104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3287171 |
End bp | 3288847 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640108207 |
Product | ferredoxin |
Protein accession | YP_001039179 |
Protein GI | 125975269 |
COG category | [R] General function prediction only |
COG ID | [COG3894] Uncharacterized metal-binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.978566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGAAG TAGTGTTTTA CCCGCAAAAC AAGTCCATTA ATGTAGAAGA AGGAACCACC ATTCTTCAGG CGGCCCGCAG TGCAGGAGTG ATAATAGAGT CCCCGTGCAA CGGTACAGGA ACTTGCGGAA AATGCAAAGT AAGACTGGAT GAGAAATCTT TGCCAAATGT CCTGGCAAAA AGCAGGCATT ACCTTTCCAA AGAGGAAGAG GAGCAAGGGT ATGTACTGGC CTGCGAAACG CAAATAACCG GGGACATCAA GGTTGAACTT GGCGAAAACA AGCAAAATGG CACTCTCAAA ATATTAAGCA GGGGTCACAG TTTTAATATA GATTTGAAGC CTTTTATAAG AAAGGAATAC TTCGTCCACG AAGACGTTAC AAAAGTGTTT GCCGGAAAAG AACAATTGGG AATTGAAGCA GGAGATACAA CAAAAGAAAA CTATGGAGCC GTCGTTGACA TAGGAACTAC AACCTTGGTG GCCTCCATTG TAAATTTAAA CAACGGGGAC GAAATAGGTA CTTCTTCGGC ATTAAATCCT CAGGCCGTCC ATGCCCAGGA TGTGTTGTCG AGAATCAAGT TTTCATCCGA TGCCGATGGG CTTAAAGTTA TGCATAGTGA ACTGACAGAC AAAATTAACA GCATGATTGG TAAAATAGCT TTAAGGGCCG GTATCAGCAA AGAACACATA TATGAAATTG TTTTCAGCGG CAACACATGC ATGCTTCATT TGGCTTCAAA CACCTGCCCC GAATCCCTTG GGAAGTATCC GTATACTCCA AAGATAAGCG GTGCCGCATA TCTGGACGCT GCCAAATACA ATATTGATAT TTCGCCGTTT GGAATTATAT ATCTGCCTCC GATTATATCG GCTTATGTGG GCGCTGACAT CGTTTCCGGA ATTTTGGCAT CGCAGCTTCA TGAGAAAGAT GGCGTTATTT TGTTTGTTGA CATAGGTACC AACGGGGAAA TGGTACTGGC CTCTTGTGGA AATCTTTCGG CTACGTCCAC GGCGGCAGGA CCGGCTTTTG AAGGAATGAA CATAACCTGC GGCATGAGGG CGGGGGAGGC GGCAATAGAG TTTTTTGAAA TTGAGGAACA GGGGAGTATT AACATTAAGG TTATCGGTGA AACGGAAGCG GCGGGAATTT GCGGAAGCGG GCTTTTGGAT ATGGTTGGTG AGTTTGCGGC CCATGGAGTT ATTAAAAAGA ACGGCCAATT TATTGACCCG GAAAGCGAAA ACGTTCTGCA TCCGAAACTG GCGGAAAGAC TTGTAAGACA GGACGGAAAA TGGATTTTTA AAGTTACCGA CAAAGTTTTC CTTTCTCAAA AAGATATAAG GCAGGTTCAG CTTGCAAAGG GAGCTGTAAG GGCCGGAATT GAATTTTTGC TGGAAAACAA AGGGGTAAGA GCCTCCGATG TGGATAAAGT GCTTATTGCT GGGTCTTTTG GATATCATCT GAGGGAAAAA AGCCTTATCA ATATAGGTCT TCTTCCAAAA GAGTTTGAGG GTAAGGTGGA GTTTGTCGGC AATACTTCGC TGTCCGGCGC AAAAGCCTTT CTTTTGAATC AAACCTATAG GGAGAAAATG AAGGAAACGG TAAAAAGTGT CGAGGTTCTG GAACTGGCAA ATTACAAGGA TTTTGACAGG GTCTTTGTCA GGTGCCTGAG TTTTTAG
|
Protein sequence | MPEVVFYPQN KSINVEEGTT ILQAARSAGV IIESPCNGTG TCGKCKVRLD EKSLPNVLAK SRHYLSKEEE EQGYVLACET QITGDIKVEL GENKQNGTLK ILSRGHSFNI DLKPFIRKEY FVHEDVTKVF AGKEQLGIEA GDTTKENYGA VVDIGTTTLV ASIVNLNNGD EIGTSSALNP QAVHAQDVLS RIKFSSDADG LKVMHSELTD KINSMIGKIA LRAGISKEHI YEIVFSGNTC MLHLASNTCP ESLGKYPYTP KISGAAYLDA AKYNIDISPF GIIYLPPIIS AYVGADIVSG ILASQLHEKD GVILFVDIGT NGEMVLASCG NLSATSTAAG PAFEGMNITC GMRAGEAAIE FFEIEEQGSI NIKVIGETEA AGICGSGLLD MVGEFAAHGV IKKNGQFIDP ESENVLHPKL AERLVRQDGK WIFKVTDKVF LSQKDIRQVQ LAKGAVRAGI EFLLENKGVR ASDVDKVLIA GSFGYHLREK SLINIGLLPK EFEGKVEFVG NTSLSGAKAF LLNQTYREKM KETVKSVEVL ELANYKDFDR VFVRCLSF
|
| |