Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1431 |
Symbol | |
ID | 4810581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1751588 |
End bp | 1753543 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106854 |
Product | hypothetical protein |
Protein accession | YP_001037855 |
Protein GI | 125973945 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGTTT TTCTCTTGAT TGCATCATTC TTAACGGGAT TTTGCCTGCT TAAAAAATTA ACTCAAATAA AATCCCCGAT AATGATTATT TCTGGTTCTT TTCTAATAGG GTCCGTTTTT TCCGGCACAC TGCTATACTG GCTTGATATA CTTTTCGTTA AAACCCTGAG CAACTACTAT CTCAGCAATA TAGCATACCT CGTAATATCT TTTGCTTTTA TAGCATACGT ATACCGAACA AACAGCAAAA TATTCAAAGA GCTTCTTGAC ACGATAAAAG AGTTCTGCCG TGACAGAGTT GCAATAATTT GCTTTATCGC TTTTGTTTTA TTTTCAACGT GGTTCAATTA TGATACCTTC AGTTTGTCAA ACGGAAGCAT CACCATGTCC GGGGGAGCAT GGAGCGATTT AACTCTTCAC AACGGATTTG TACGTTCCAT AAGCATTGGC CAAAACATTC CGGTTGAGCA TATCTTTTAC GCCAATACTC CGGCAAAATA TCACTTTCTG TTTGACTACT ATGTGGGTAA AATAACCCAA ACCGGCCTGC ATTCGGTTCA CGCCTTGAAT CTTATGTGTA TTTTGAGTCT TTCTTTCCTG CTGTTAATGA TATTTCAGTT TGGAAGAACA GTATTTAAAA ATGATGCCGT GGGAATTCTC GGCGCCCTCT TTTTATTGTT CCACAGCTCA ATAAGCGGTT TAAAGTGGGT TGCCGAAAAC TGGGGCTGGG ATATATTCAA GAAAATGTAT GAAAAAACCG GCTGGCTGGC TTCAACCATG TTTGAAGGCT GGGGGCTTTT TAACCTGAAT GTTTTCGTAA ATCAGCGGCA CTTTGCCTTC TCCTTGGCAT TTCTTGTTTT CATCGTTACA TATGTGATTT CTATGTATGA AAATGAAAAA GAGACAAAAA ATCCGGAACC GGGAAGCGGT GAAATAATCC CATTGAACCT GCGTAACGAT TATTCATGGC TTTTATCAAG CTGTCTTATA GGAATTGCTG TAGGTATCAT GCCTTACTGG AACTCTGTTG TAAACACGGC TCTGCTTTCA TTTTTGGGAC TGTATACAAT TATAAATATA AGAAAAAAAG ATGTCTTTAT TCCAATGTTT ATCTCAACCG CCATAGCCGG ACTTGTTTCA CTGCCGCAGC TTCTGAGATT TAAATCAGGT GCTTCGAGCC TGACAGAATA TCCAAAGTTT CATATCGGCT ATGAAGTGGG ACGCTTTGAC ATATTGGATT TAACAGAATT TTACTTCAAA GTCCTGGGAT TAAAATTAAT TATAATTGTT ATAGCATTTT TGATTGTACC CAACAGAAAA AAAATACTGT TCCTTATCCT TTCAGTGCCG TTTGTACTGG CCAATTTACT GCAGCTTGGC GTGGTGCTGT ATGACAATAA CAAACTCATG ATTTCATCAC TCATATTTAT AAACTGCCTG GCAGCTTACT ACTTAGTAGA ACTTTTCCGC CAAAAACATG TAATACTTAA ATTCATATCC GTTATCCTTT GTCTCTGCCT TATGATTGCC GGAGTGCTTG ACTTAATGTC AGTCAAGAAT CTTCCCAAAG TCAATGTGGC AGACAAATCA GACTTCACAC AGTGGATTAT TGAAAACACC GAACCGGGCT CAACTTTCCT TACTCTGCCG ACCATACAAT ATAATGACAA CGCAGTATCC AACATACTGA TGGCAGGCGG TAAAATGTAT GTACATAATG CAGCCGACTC GGCATACAAG CTCGCCGAAC GCTTCAACAT TCTAAACACC ATATTAAGGG GCGAAGAAAG TTTTGAAAAA ATAAAAAGCA TTATTGAGCA GGAAGGTATT GACTATATCG TGGTCAGTCC GGAACTCAGA CAAAGCCAGG AATACCCGGT AAACGAGGAA TTTCTCAAAC AAAACTTTGT AACAAAATAT GATTTTAACG GCATTACCGT TTACTCAATT TATTAG
|
Protein sequence | MFVFLLIASF LTGFCLLKKL TQIKSPIMII SGSFLIGSVF SGTLLYWLDI LFVKTLSNYY LSNIAYLVIS FAFIAYVYRT NSKIFKELLD TIKEFCRDRV AIICFIAFVL FSTWFNYDTF SLSNGSITMS GGAWSDLTLH NGFVRSISIG QNIPVEHIFY ANTPAKYHFL FDYYVGKITQ TGLHSVHALN LMCILSLSFL LLMIFQFGRT VFKNDAVGIL GALFLLFHSS ISGLKWVAEN WGWDIFKKMY EKTGWLASTM FEGWGLFNLN VFVNQRHFAF SLAFLVFIVT YVISMYENEK ETKNPEPGSG EIIPLNLRND YSWLLSSCLI GIAVGIMPYW NSVVNTALLS FLGLYTIINI RKKDVFIPMF ISTAIAGLVS LPQLLRFKSG ASSLTEYPKF HIGYEVGRFD ILDLTEFYFK VLGLKLIIIV IAFLIVPNRK KILFLILSVP FVLANLLQLG VVLYDNNKLM ISSLIFINCL AAYYLVELFR QKHVILKFIS VILCLCLMIA GVLDLMSVKN LPKVNVADKS DFTQWIIENT EPGSTFLTLP TIQYNDNAVS NILMAGGKMY VHNAADSAYK LAERFNILNT ILRGEESFEK IKSIIEQEGI DYIVVSPELR QSQEYPVNEE FLKQNFVTKY DFNGITVYSI Y
|
| |