Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2870 |
Symbol | |
ID | 4809150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3390246 |
End bp | 3391547 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108289 |
Product | hypothetical protein |
Protein accession | YP_001039261 |
Protein GI | 125975351 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.221651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTTGTTG AGATTCTGGT TTTAATACTT CTGATAGTGC TGAACGGATT TTTTGCGGCT TCTGAGATTG CTCTGATATC TTTAAATGAC AACAAGCTCA GGTTGATGGG CAATGAAAAG AGCAAAAAGA AGATTGAAAT ATTGAAAAAA CTATTATCTG AACCGGGCAG GTTCCTGGCC ACCATCCAAA TTGGTATAAC TCTTGGCGGA ACATTGTCGA GTGCTTTCGC GTCCGAGAGT TTTTCCGACC GGTTGGCAGG ACTTATAAAA CAGACGGGTG TACCTGTTCC GGATGCAGTG CTTAAGACTC TGTCCATGAT TTTCGTATCT GTAATTTTGT CATATTTTTC ACTGGTAATT GGCGAACTGG TTCCTAAAAG ACTTGCGATG AAGAAAGAAG AAGCCATATC CATGTTTGCT GCCAGGCCGC TTTATATCCT TTCAGTTGTC ACTTATCCTG CTGTAAAGCT TCTCAATGCT TCAACCAATT TGATAGTCAG ACTTTTCGGA ATTGACCCCA ATGCGGACGA GGAAGAGGTC ACCGAGGAAG AGATTCGAAT GATGGTTGAC GTTGGAGAAG AAAAGGGGAC CATACAGGAA AACGAGAAGG AAATGATCAA TAATATTTTT GATTTTGACA ACAAGACGGT TATGGATATT ATGACCCATA GAACCGATAT TGTGGCTCTT CCGGTTGATG CAAGTCTTGA TGAAGTTATA TCCTTGTTTA ATGAGGAAAA ATACACAAGG ATACCGGTGT ATGAGGAAAG CATAGATAAC ATTGTAGGGA TACTTCACGT CAAGGATTTA ATAAAATATA TAGGCGTCGG GAGTGATACT GCAGACTTTG ATTTAAGAAA GATAATAAGA AAGCCTTATA ATGTGCCTTG GTCCAAAAAG GCAGACGAGC TTTTCAGCGA GCTGCAGAAA AACAAGGTCC ATATGGCGAT TATTATTGAC GAGTACGGAG GAACGGCCGG AATTGTCACT GTTGAAGACC TTGTGGAGGA AATTGTGGGT AATATATTTG ACGAATACGA TGAAGAAGAA AAAGATTTCG AAAAATTGGA TGAGAGCACT TATATATTCA GCGGCACCGC AGGTCTTGAT GTTTTAAATG AATGGGCGGA CGCGCAGCTG CCTGAGGACG AGTATGACAC ATTGAGCGGT TTTATTATAA GTCAGTTGGG TAGAATTCCG GAGTATGATG AAAAGCCGGA AATTGAGTTC AACGGACTTT TATTCAAAGT GGAAGAGGTA AGTGAAAAAA GGATTGAAAA AATTAAGGTG TGCAGAGCGT AA
|
Protein sequence | MFVEILVLIL LIVLNGFFAA SEIALISLND NKLRLMGNEK SKKKIEILKK LLSEPGRFLA TIQIGITLGG TLSSAFASES FSDRLAGLIK QTGVPVPDAV LKTLSMIFVS VILSYFSLVI GELVPKRLAM KKEEAISMFA ARPLYILSVV TYPAVKLLNA STNLIVRLFG IDPNADEEEV TEEEIRMMVD VGEEKGTIQE NEKEMINNIF DFDNKTVMDI MTHRTDIVAL PVDASLDEVI SLFNEEKYTR IPVYEESIDN IVGILHVKDL IKYIGVGSDT ADFDLRKIIR KPYNVPWSKK ADELFSELQK NKVHMAIIID EYGGTAGIVT VEDLVEEIVG NIFDEYDEEE KDFEKLDEST YIFSGTAGLD VLNEWADAQL PEDEYDTLSG FIISQLGRIP EYDEKPEIEF NGLLFKVEEV SEKRIEKIKV CRA
|
| |