Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1172 |
Symbol | |
ID | 4810124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1399482 |
End bp | 1400630 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106594 |
Product | hypothetical protein |
Protein accession | YP_001037597 |
Protein GI | 125973687 |
COG category | [S] Function unknown |
COG ID | [COG2006] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAG TGGCTCTAAT CAGATGTGAA AGTTATGACT ATGATGCCGT CAAATCAGCC GTAAAAAGGG GGCTTGACCT TATTGGAGGC CCTCACCGGT TTGCCGCTCC CAATGAAAAA ATACTCTTAA AACCCAATCT TCTTTCGGCA GACCCGCCGG AAAGATGCAG CACAACGCAC CCTTCCGTAT TTAAAGCCGT GGCGGAAATA TTCATGGAGG CAGGAATAAC CAATCTTTCC TACGGCGACT CCCCCGGCAT TCACAAGCCC ATAACCGCGG CGAGAAAAAA CGGAATTGAA AAAGCTGCAA ATGAGCTTGG AATCAAACTT GCCGATTTCC TGGAAGGAAA GGAAGTGTTC TTTGAAAACG CAATCCAAAA TAAAAAGTTT ATAATAGCAA ACGGTGTTCT TGAAAGCGAC GGCATTATCA GTCTGCCCAA GCTAAAAACC CATGGATTTG CCAGAATGAC GGGTTGTGTG AAAAACCAGT TTGGATGCAT ACCCGGACCT CTGAAAGGAG AATTTCACGT TCGGATTCCC AGTATAATCG ATTTTTCAAA AATGCTGGTG GATCTTAACG TTTATTTAAA ACCTCGTCTT TTTGTCATGG ACGGTATCAT AGCAATGGAA GGCAACGGAC CCAGAGGCGG GACTCCCAGA AAAATAAATG CGATACTTCT TTCCGAAGAT CCAATTGCCC TGGATGCCAC TGTATGCAGA ATGATAAATT TAAACCCCGA GTTTGTGCCT ACCATAGTAT TTGGAAAAGA AGCCGGCCTT GGAACTTATG ACGAAAATGA AATTGAAATT CTCGGAGATG ATATTCAAAG CTTCATAACT TATGACTTTG ATGTAAGAAG AGAACCTGTA AAGCCCTTCA AGCCCGGCGG AGCCATCCAG TTTTTCAGAA ATTTCATCGT TCCAAAGCCC TACATTTTAA AAAACAAATG TATTAAATGC GGAGTATGTG TAAATGCGTG TCCGGTAAAA CCAAAAGCAG TAGATTGGCA CAACGGAAAT AAAAAAGAAC CTCCTACATA TATTTACAAA AGATGTATAA GATGTTACTG CTGTCAGGAA CTTTGTCCGG AAAGCGCAAT CCACCTCAAG GTTCCTTTTA TTCGCAAATT TTTTTATAAT CCGAAATAA
|
Protein sequence | MSKVALIRCE SYDYDAVKSA VKRGLDLIGG PHRFAAPNEK ILLKPNLLSA DPPERCSTTH PSVFKAVAEI FMEAGITNLS YGDSPGIHKP ITAARKNGIE KAANELGIKL ADFLEGKEVF FENAIQNKKF IIANGVLESD GIISLPKLKT HGFARMTGCV KNQFGCIPGP LKGEFHVRIP SIIDFSKMLV DLNVYLKPRL FVMDGIIAME GNGPRGGTPR KINAILLSED PIALDATVCR MINLNPEFVP TIVFGKEAGL GTYDENEIEI LGDDIQSFIT YDFDVRREPV KPFKPGGAIQ FFRNFIVPKP YILKNKCIKC GVCVNACPVK PKAVDWHNGN KKEPPTYIYK RCIRCYCCQE LCPESAIHLK VPFIRKFFYN PK
|
| |