Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0203 |
Symbol | leuS |
ID | 6165044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 178322 |
End bp | 181159 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641667367 |
Product | leucyl-tRNA synthetase |
Protein accession | YP_001793604 |
Protein GI | 171184685 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0495] Leucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00395] leucyl-tRNA synthetase, archaeal and cytosolic family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.313732 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGC TCTCGCGGTT CTTCATAGAG CTTGGGGAGA GGTGGCAGAG GAGGTGGAGG GAGGCCCGGG TTTTCGAGCC TGAGCCGGCC CCCGGCGTCC CGAAGTATTT CATCACGGCG GCCTACCCCT ACCCCAACGG GGCTATACAC ATCGGCCACG GGCGCACCTA CCTGGTGGCC GACGTCATGG CCAGGTTCCA GAGACACCTC GGCAGATCTG TCCTCTTCCC GATGGGCTTC CACTACACAG GGACGCCTAT ACTCACAATC GCGGAGGTGA TCGCGGCGGG AGACAAGGCC GTCATGGAGG AGTACATGGA GCTGTACGGC GTCCCCGAGG AGGAGATCAA GAAGATGGGG GACCCCCTCT ACCTCGCCCG CTACTTCCAC GGCCAGTCCA AGAGGGCGAT GGAGAGGTTC GGCCTAAGCA TAGACTGGAC TAGGGAGTTC ACCACAATAG ACCCGGAGTA CCAGCGCTTC ATCCAGTGGC AGTTCGAGAA GCTGAGGAAG AAGGGGCTGA TCGTGAGGGG GAGACACCCC GTGGGCTGGT GCCCAAGGCA CTCGATGCCG GTAGGAGCTC ACGACACCAA GGACGATAAG GAGCCCGACA TTGGCCAGTG GACGCTGGTG TATTTCACGG ACTCGGAGGG GCTGACCTTC CCCACGGCCA CGCTTAGGCC GGAGACGGTG CTGGGCGTCA CCAACCTCTG GATTAACCCA GACGCCGAGT ACGTGGTGGC CGAGTTCGAC GGGAGGCGTG CCGTAGTCAG CAGAGACGCG GCGTACCGCC TCTCCTTCCA GGTGGGGGTG AAGATCTTGA GGGAGGCCAG GGGCAGGGAG TTCGTGGGCC GCATGGTTCA GAACCCGGTG ACCGGGGAGT GGGTGCCCGT ATACGAGGCC CGGTTTGTGG ACCCCAAGGT GGGGACCGGC GTTGTGATGT CTGTGCCCGC GCATGCGCCT TATGACTACG CCGCGCTCCG CGACCTAGGG ACCGTGAAGC TGATCCCGCT GATAAGGGTG GAGGGGTACG GCGATTACCC AGCTAAGGAG GTCGTGGAGA GGATGGGGAT AAAGAGCCAG GCGGACCCTG CCTTGGAGGA CGCCACCAAG GAGGTGTATT CCGCGGAGTA CGCGAGGGGC GTCATGAGGG AGGACGTCGC GGAGAGGGTG GGCGCCCACC TGGAGGAGCC AGCCAGATCG ATGTTGCGCG CCGTGTTTAA GATGTACTTC GCGGGCAGGC CCGTGAGGGA GGCTCGGGAG TTCATAGCCA GATGGCTTAC GGAGGCCCGC CTCGGCGGCG TCATGTACGA CATAATGAAC AAGCCTGTCT ACTGCCGCTG CGGGACGGAG ATCGTGGTTA AGGTGTTGGA GGACCAGTGG TTTATAAATT ACGGCGAGTC CAGATGGAAG GAGGCAGCTA GAGAGCTTGT GAAGGAGATG TCCATCGTGC CGGGGGAGGC CCGGGCGCAG TTCCTCGCCA CGATAGACTG GTTGGACAAG AGGGCGTGTG CCAGAACTCG CGGCCTCGGC ACGCCGCTTC CCTGGAGCTC GGGTTGGGTG ATAGAGAGCT TGAGCGACTC GACGATATAT ATGGCGTTTT ACACGGTGGT GAAGAGGATC AGGCAGTTCG GCATAAGGCC GGAGCAACTG ACGGAAGAGT TCTGGGACTT CGTCTTCTTG GGCCAGGGCT CGGCAGATGA AGTATCTAAG AAGACGGGGG TGCCGGTTGA GGCCCTCAAG GCCATCAGAG AGGAGTTCGA GTACTGGTAC CCCCTGGACT CTAGGAACTC CGGCAAGGAT CTCATCCCCA ATCACCTGAC CTTCTTCATC TTCAACCACG TGGCCATATT CCCCAGGGAG AAGTGGCCGC GGCAGATCGT GGCCAACGGC TGGGTGCTTA GAGAGGGCGA GAAGATGTCG AAGTCCAAGC GCAACGTCCT ACCTCTTGAT AGAGCGGTGG AGATGTACGG CCCGGACCCG CTTAGGGCCA CCCTGGCTCT CGCCGCCGAG GTGGAGCAGG ATCTGGACTT CAGAGACGCC GAGGCTAGGA GAAACGCCCA GCAGCTGATG TCCATATATA CGCTGGCGCA GAGGCTTGTA CAAGGCGCCG AGGAGCGGCC GCCGACGTGG GTAGACCAGT GGCTTGTGGC TGAGATCTCC AGGGTGTTGG AGAGGGCTAG AGAGGCCTAC GAGAAGGTGA GAGTTAGGCA AGCGGCGGTG GAGGTGCTCT ACAACGCCAA GGCGGTCTTC GACCAGTACC TCGCCATGGT GGAGAAACCA TCTAGGCAGG CTGTGGAGGC CGCCAAGGCG TGGGCGGTGG CGATGGAGCC CCTCGTGCCG CATCTGGCCG AGGAGCTCTG GGCTACCCTT GGCGGGGCTG GATTCGCGGC GCTGGCTCCC TGGCCTAAGC TGAGGGCTGA GCCGGCGGCG CTTCTCGCGA AGAGGTACGT CGACATGTTG ATTGAGGACG TGAAAAACAT ACCGGCCTTT GGCCAAGGGG CTAAGCGCGT CGTGATCTAC GTCAACAGGT CCTTTGCCTG GGTTAAGGCG GCTTTGGCGG GAGATGTGAA AACGGTCATA GGCGCGGGCG TGCCGCCTCA GCAGGCCAAG AAGGTGGTTG ACTTGGTAAA AACGCTGGGG GATGAGATGA GGGGGCTCAT AGCCGCCGTG GATCACTTCG ACGAGCTAGA GGCGCTTAGA TCCTACAGGA ACTACGTCGA GAAGGCGCTC GGGGCGCCGG TGGAGATCTA CGGCGCAGAT GACCCAGCGG CGCCGGATCT CGGCGGTAAG AAGAGGGTCG CCCTGCCTTT GAAGCCGGGC ATCTACGTGG AGAAGTAG
|
Protein sequence | MSELSRFFIE LGERWQRRWR EARVFEPEPA PGVPKYFITA AYPYPNGAIH IGHGRTYLVA DVMARFQRHL GRSVLFPMGF HYTGTPILTI AEVIAAGDKA VMEEYMELYG VPEEEIKKMG DPLYLARYFH GQSKRAMERF GLSIDWTREF TTIDPEYQRF IQWQFEKLRK KGLIVRGRHP VGWCPRHSMP VGAHDTKDDK EPDIGQWTLV YFTDSEGLTF PTATLRPETV LGVTNLWINP DAEYVVAEFD GRRAVVSRDA AYRLSFQVGV KILREARGRE FVGRMVQNPV TGEWVPVYEA RFVDPKVGTG VVMSVPAHAP YDYAALRDLG TVKLIPLIRV EGYGDYPAKE VVERMGIKSQ ADPALEDATK EVYSAEYARG VMREDVAERV GAHLEEPARS MLRAVFKMYF AGRPVREARE FIARWLTEAR LGGVMYDIMN KPVYCRCGTE IVVKVLEDQW FINYGESRWK EAARELVKEM SIVPGEARAQ FLATIDWLDK RACARTRGLG TPLPWSSGWV IESLSDSTIY MAFYTVVKRI RQFGIRPEQL TEEFWDFVFL GQGSADEVSK KTGVPVEALK AIREEFEYWY PLDSRNSGKD LIPNHLTFFI FNHVAIFPRE KWPRQIVANG WVLREGEKMS KSKRNVLPLD RAVEMYGPDP LRATLALAAE VEQDLDFRDA EARRNAQQLM SIYTLAQRLV QGAEERPPTW VDQWLVAEIS RVLERAREAY EKVRVRQAAV EVLYNAKAVF DQYLAMVEKP SRQAVEAAKA WAVAMEPLVP HLAEELWATL GGAGFAALAP WPKLRAEPAA LLAKRYVDML IEDVKNIPAF GQGAKRVVIY VNRSFAWVKA ALAGDVKTVI GAGVPPQQAK KVVDLVKTLG DEMRGLIAAV DHFDELEALR SYRNYVEKAL GAPVEIYGAD DPAAPDLGGK KRVALPLKPG IYVEK
|
| |