Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1148 |
Symbol | |
ID | 4810816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1364867 |
End bp | 1366150 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640106570 |
Product | hypothetical protein |
Protein accession | YP_001037573 |
Protein GI | 125973663 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00179115 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATTGCG TGTTTATTAC ACCAGATAAA ATGGATGGAA AGCGGATTTC TTATGATTAT GTAGTACAGC AATATGAAAT ACAGCGATTA TTACAGAGGC GTCCCCTTAT AACATTAATT GATATTTGCA AAGAAATAAC AAGTGGGATT CGGGTTAAGA AAGAGTATTA TACAGATAAA AACGGATATA AGATTATTGC TCCTGGAGAC ATAAGAAATG AAGTTATATA TATTAATGAA CTTAAAGTAG TACAGCCTGA AGTAGTAAGA GAAAAAGACA TTATAAATAA TGGAGATATA TTGATTACAG CCTCAGGTAA ATCAGGACAG GTAATTTATG TAAATGAAGT ATTAGAAGGA TGTGTAGTAA CATCGGATAT TATTAAAATT ACATTAAGGG ATAGGGATAA AGGTATAAGA TTATATAAGT TTTTAAAAAG CAGTATAGGA CAAATGCTGT TAAACTCCAT AAAAATAGGG ATTTTAAATA AAATTTTTGT GGAGGATGTT GAAAATTTAT TAATTCCTGA AGACTTTGAT ACATATCAGG AAGATTGTTC TGATGATTCT ACAGTATATG CAGAGGCTGA AAAACTATAC AGGTCTGCAG AAAACATATT TTACAGGGTA TTTGATTATA AAGGTGAAAA AAAGAATCTT AAACACTTTT ATGTGACAGA ATACCTTGAC AGTCACAGAT TAGACCCTGA GTACTACTCG AACTTTTACA CTGAACTGTA TAGGGTAATT CACAAGAATT TTGATGATGT AAAATGGGAG GAACTTGGAG AACTTGTAGA AATAAAAAAA GCAGATAAGC CCGAAATAAG TAAGAATCAA AAAGTAAAGT ATTTTTTGTT AGCGGATATA GACCCGAATT TTTCAATCAT AAAAGAAACA CATGAGGATT TTTATGGAAA TTTAAGCAAT CGGATGAGAT ATATTGTGAG GAGGGGAGAA ATAGTAACTG CCAAAGGAGG CAGTGCTACA GGGACTAAGG GACATGCAAC GGCACTTATT ACAGAAAAAT TTGATGGTTT GGTTACGACA GATGCTCTAT ATAACTTGGT CCCCAGAAGA ATTAATCCTT ACTATCTTCT GTTTTTGTTT AAACAGCCAA TAATTCTAAA CCAGGTAAAC ATGTTTACTA AAGGGACACT ATATAAACTC ATTCAAAGAA ATGACTTTGA AAAAATCAAA ATTCCAAGAC TGGAAAGTAG TTTGGAAGAA CAAATAGTAG ATAAAATGAT GAATTATTTA AGTGTGTTAC AAAACAAATT TTAA
|
Protein sequence | MNCVFITPDK MDGKRISYDY VVQQYEIQRL LQRRPLITLI DICKEITSGI RVKKEYYTDK NGYKIIAPGD IRNEVIYINE LKVVQPEVVR EKDIINNGDI LITASGKSGQ VIYVNEVLEG CVVTSDIIKI TLRDRDKGIR LYKFLKSSIG QMLLNSIKIG ILNKIFVEDV ENLLIPEDFD TYQEDCSDDS TVYAEAEKLY RSAENIFYRV FDYKGEKKNL KHFYVTEYLD SHRLDPEYYS NFYTELYRVI HKNFDDVKWE ELGELVEIKK ADKPEISKNQ KVKYFLLADI DPNFSIIKET HEDFYGNLSN RMRYIVRRGE IVTAKGGSAT GTKGHATALI TEKFDGLVTT DALYNLVPRR INPYYLLFLF KQPIILNQVN MFTKGTLYKL IQRNDFEKIK IPRLESSLEE QIVDKMMNYL SVLQNKF
|
| |