Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2466 |
Symbol | |
ID | 4809846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2938591 |
End bp | 2939673 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107881 |
Product | hypothetical protein |
Protein accession | YP_001038861 |
Protein GI | 125974951 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00527701 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAACA GAATCATTAA AGAATCAATA TGTACAAGCG ACACCATAGA CCAATTATCT TGGTTTGAGG AAGTGTTTTT CTATCGCCTC CTGGTTAATT GCGATGATTA TGGGAGGATG GATGCAAGGC CGGCGATTCT GAAGGCGAAG CTGTTTCCTC TTAAAAGTGT TACCGAAAAG CAGATTTCTG ATGCTTTAAA TAAGTTATCG ACGGTAGGTA TTGTAGCCCT ATATGAGTAT GATGGGAGAC CGTACCTGCA ATTGGTAACT TGGGAAAAGC ATCAGCAAAT ACGTTCCCGG AAGGCAAAGT ATCCATTGCC TCCAGAGGAT ATTCCTTGTA AGCGCGAACA CATCATGCCC GAAGAAAAAG ACATCGAGGA CCTCTTGTAT GATGTCATGA GCTCAACGAA ACGATTTGAG GAACATACCT TGCTCTCGGT TGAAAGACAG GTAAGGGTCG GTGAAAGCTA TCTTGATATC GTTGCTAAAA CCGAGAGCTC CGAAACACTT GTATTTGAAT TGAAACGCGG CCGGTTGAGT AATAAGGCTA TTGACCAGAT ATCCAAGTAT TTGACCCTCA TAAATGGAAA TGGCATCTTA ATAGGCTGCG GCTTAAGCGC CAATTTCGAC ATTGAGCGGT GCAGGAGCAA CGATATAGCT GTTGTCATTT ATGATGATGA CCTGAATATG TCGCTTGTAT TAGGTAACTC CACTGTAAAC AGTATTGATT TAACGTTAAA TCACGTTAAA TCACGTTATG CAAAGTTAGC GCCTAATCCA ATCCAATCCA ATCCAATCCG AATCCAATCC GAATCCAATC CTAATCCGAA TCCAATTAGT AATAATGGCG CGAACAAGTC GCGCGGATTC ACTCCTCCTA CTCTTGAGGA AGTGGCCGCA TATTGCCAGG AGCGTAACAA CGGTGTTGAT CCACAGAAAT GGTATGACTT TTACGCCGCC AAGGGTTGGA TGATTGGGAA AAACAAGATG AAGGATTGGA AAGCAGCGGT GCGCACTTGG GAGAAGCGGC AACAAAAAGG GGGTTATACA TACAACTATG AGGATGGAGG CGATAGTCTG TGA
|
Protein sequence | MPNRIIKESI CTSDTIDQLS WFEEVFFYRL LVNCDDYGRM DARPAILKAK LFPLKSVTEK QISDALNKLS TVGIVALYEY DGRPYLQLVT WEKHQQIRSR KAKYPLPPED IPCKREHIMP EEKDIEDLLY DVMSSTKRFE EHTLLSVERQ VRVGESYLDI VAKTESSETL VFELKRGRLS NKAIDQISKY LTLINGNGIL IGCGLSANFD IERCRSNDIA VVIYDDDLNM SLVLGNSTVN SIDLTLNHVK SRYAKLAPNP IQSNPIRIQS ESNPNPNPIS NNGANKSRGF TPPTLEEVAA YCQERNNGVD PQKWYDFYAA KGWMIGKNKM KDWKAAVRTW EKRQQKGGYT YNYEDGGDSL
|
| |