Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0723 |
Symbol | |
ID | 4810341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 877816 |
End bp | 879042 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106140 |
Product | tyrosyl-tRNA synthetase |
Protein accession | YP_001037151 |
Protein GI | 125973241 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0162] Tyrosyl-tRNA synthetase |
TIGRFAM ID | [TIGR00234] tyrosyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.154159 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAGTG TGTTTGAAAC TTTGAAAGAA AGGGGATATA TTGCGCAACT TACCCATGAA GAAAAAATAA AAGAGCTTCT AGAAAAGGAA AAAATTACAT TTTATATCGG ATTTGATCCC ACTGCCGACA GTCTTCATGT GGGACATTTC CTGCAGATGA TGGTCATGGC CCACATGCAA AAAGCGGGAC ACAGACCCAT AGCGTTAATC GGCGGCGGTA CAGCCATGGT CGGCGACCCC ACAGGGAGAA CTGATATGAG AAAAATGATG ACCCGGGAGG AAATCAAGCA TAATGCCGAT TGCTTTAAAA AGCAGCTTTC CAAATTCATT GAATTTGGAG AAGGAAAAGC CATAATGGTG GATAACGCCG ATTGGCTGCT TGATTTAAAT TATATTGAAT TTTTGAGAGA TATAGGAGTG CATTTTTCCG TCAACCGAAT GTTGACGGCA GAATGTTTTA AATCCAGACT CGAAAGAGGT CTTTCCTTTA TCGAGTTCAA CTACATGCTC ATGCAAAGCT ATGACTTCCT GAAACTGTAT AAGGAATACG GCTGTATCAT GCAGCTCGGC GGTGACGACC AGTGGTCAAA CATTCTCGGA GGAATAGATC TTATTAGAAG AAAAGAAGGA AAAGAAGTCT ATGGTATGAC CTTTACACTT CTTACCACCA GTGAAGGCAA AAAGATGGGT AAGACGGAAA AAGGTGCATT ATGGCTGGAT GCCAACAAGA CTTCTCCTTA TGAGTTCTAT CAGTACTGGA GAAATATTCA TGACGCAGAT GTGATAAAAT GCTTAAAACT CTTAACCTTT GTTCCTATGG AAGAAATCGA AGAGTATGCA AAACTTAAAG ATCAGGAAAT TAACATTGCT AAAAAACGTC TGGCCTTCGA AGTTACAAAG CTTATTCACG GTGAGGAAGA AGCATTAAAT GCCCAAAAAA CTGCAGAGGC CTTGTTTGAA AAAGGTGCAA GCACTGACAA CATGCCTACT ACTGAAGTAG CCTCCGGTGA GCTTTCCAAC GGTATAAACA TAATTGACCT GCTTTTAAAA ACAAAGCTGA TTCCTTCAAA GGGTGAAGGC CGCCGACTTA TCGAACAGGG CGGAATTTCT GTAAACGACG TTAGAGTTGA AGGTTTTGAC AGATTAGTCA CCATGGATGA TTTCAACAAC GGAGAGTTAA TTATTAAAAA GGGTAAAAAG ACATACCACA GGGTAAAACT TGTATAA
|
Protein sequence | MSSVFETLKE RGYIAQLTHE EKIKELLEKE KITFYIGFDP TADSLHVGHF LQMMVMAHMQ KAGHRPIALI GGGTAMVGDP TGRTDMRKMM TREEIKHNAD CFKKQLSKFI EFGEGKAIMV DNADWLLDLN YIEFLRDIGV HFSVNRMLTA ECFKSRLERG LSFIEFNYML MQSYDFLKLY KEYGCIMQLG GDDQWSNILG GIDLIRRKEG KEVYGMTFTL LTTSEGKKMG KTEKGALWLD ANKTSPYEFY QYWRNIHDAD VIKCLKLLTF VPMEEIEEYA KLKDQEINIA KKRLAFEVTK LIHGEEEALN AQKTAEALFE KGASTDNMPT TEVASGELSN GINIIDLLLK TKLIPSKGEG RRLIEQGGIS VNDVRVEGFD RLVTMDDFNN GELIIKKGKK TYHRVKLV
|
| |