Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2749 |
Symbol | |
ID | 4810252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3243871 |
End bp | 3245376 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640108169 |
Product | hypothetical protein |
Protein accession | YP_001039141 |
Protein GI | 125975231 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.596289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGACTA TGACTGATAT AAAGTATATC AAAGATTTAT TCGAAAAGAA AGGCCTATCT CTTAGGGAAA TTACAAGAGT AACCGGACAT AATTTTAGAA CAGTCCGGAA ATATATTGAT AAAGAGGATT GGTCACAACC TCTGGTTAAT AGAACAAGGG AATCTTTGAT TAATAAATAT AAAGCAGATA TTGATGAATG GCTGGAGAGT GACGTTGATG CACCAAGAAA ACAGAGACAT ACGGCAAAAA GAATTTTTAA CAAACTGAAG CATAAATACA ATAATGAATT TAACTTGTCC TACCGAACTG TTGCAAGGTA TGTAAGCCTT AAAAAGAAAG CTTTGTATCA AGACACTGAT GGATATATAC CTTTGGAACA CCCTACTGGT GAGGCACAGG TTGATTTTGG CAGAGCTGCT TTTTTTGAAA ACGGTATTAG GTATGAAGGG TATTATGTTA CCATGTCGTT TCCATACAGC AATGGAGGGT ATATACAGCT TTTCAAAGGT GCTAATATAG AATGCCTATT ACAAGGAATG AAAAAGATTT TTGAACACAT GGGAAAAGTA CCGACATGTA TCTGGTTTGA CAATGATAAA ACAATTGTCA AAAAAATATT TGCTAATGGA GAAAGAAAAG TTACTGAAGC TTTTGCACGA TTCCGCATGC ATTACGGCTT TGAAAGTAAT TTTTGTAATC CAAGCAGTGG GCACGAAAAA GGTCATGTTG AAAACAAGGT TGGATATTCA AGAAGAAATA TGCTAGTGCC CATACCGAAG TTTAAAGATA TTGTAGAATT TAACCGTCAA TTGCTTATCC AATGTGATGA AGATATGCAG AGGGAACACT ACAAAAAGAA TGTATTCATC AATATTTTAT TTGAAGAGGA CAAAAAAGCG ATGCGGGATA TTCCAAAAGC TGAATATGAA ATATACCGCA TCGAAAAGTT AAAGTCGGAT AAATATGGCA AACTGAACTT TGACAACAGA AAATACTCTT CCGGGCCTCA ATATGCCCAG AGAGAATTAA TGATAAAAGC AGATGCCTTC TCGGTTGCAA TTATGGATGA ACAGTACAAT ACAGTTCAGG TACATAAACG TTTGTATGGA GAAGAGAAAG AGTCAATGAA GTGGGGGCCA TATCTTGAGC TAATGAGCCG TAGACCAACT GCTTTAAAGT ATACCGGTTT CTTCCGGGAG TTGCCGCAAA CTCTTCAGGA TTATCTTACA GTATGTGACT ATGAACAGAA AAAAGGTGCT TTGCGACTAT TAGTGAAGAT GTTGGAACAA AGCGAACTTG ATATAGCCAT TGAAGCCTTT AGATTCTGTA TCGAGAGGGG AATAAAAGAT TTAGACAGCA TATGGGCAAA ATATTACACT ATGGTCTGTA CACACATACA AGTTCAAGAT GTTTTACTCA ACACTAAGAC TCCTGATGTT GTGCCTTACA CTGTGGATAA TAGCATATAT GATAACCTGC TGGCTGGGGG TGTCCAATAT GTATGA
|
Protein sequence | MLTMTDIKYI KDLFEKKGLS LREITRVTGH NFRTVRKYID KEDWSQPLVN RTRESLINKY KADIDEWLES DVDAPRKQRH TAKRIFNKLK HKYNNEFNLS YRTVARYVSL KKKALYQDTD GYIPLEHPTG EAQVDFGRAA FFENGIRYEG YYVTMSFPYS NGGYIQLFKG ANIECLLQGM KKIFEHMGKV PTCIWFDNDK TIVKKIFANG ERKVTEAFAR FRMHYGFESN FCNPSSGHEK GHVENKVGYS RRNMLVPIPK FKDIVEFNRQ LLIQCDEDMQ REHYKKNVFI NILFEEDKKA MRDIPKAEYE IYRIEKLKSD KYGKLNFDNR KYSSGPQYAQ RELMIKADAF SVAIMDEQYN TVQVHKRLYG EEKESMKWGP YLELMSRRPT ALKYTGFFRE LPQTLQDYLT VCDYEQKKGA LRLLVKMLEQ SELDIAIEAF RFCIERGIKD LDSIWAKYYT MVCTHIQVQD VLLNTKTPDV VPYTVDNSIY DNLLAGGVQY V
|
| |