Gene Tneu_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0203 
SymbolleuS 
ID6165044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp178322 
End bp181159 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content62% 
IMG OID641667367 
Productleucyl-tRNA synthetase 
Protein accessionYP_001793604 
Protein GI171184685 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0495] Leucyl-tRNA synthetase 
TIGRFAM ID[TIGR00395] leucyl-tRNA synthetase, archaeal and cytosolic family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.313732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC TCTCGCGGTT CTTCATAGAG CTTGGGGAGA GGTGGCAGAG GAGGTGGAGG 
GAGGCCCGGG TTTTCGAGCC TGAGCCGGCC CCCGGCGTCC CGAAGTATTT CATCACGGCG
GCCTACCCCT ACCCCAACGG GGCTATACAC ATCGGCCACG GGCGCACCTA CCTGGTGGCC
GACGTCATGG CCAGGTTCCA GAGACACCTC GGCAGATCTG TCCTCTTCCC GATGGGCTTC
CACTACACAG GGACGCCTAT ACTCACAATC GCGGAGGTGA TCGCGGCGGG AGACAAGGCC
GTCATGGAGG AGTACATGGA GCTGTACGGC GTCCCCGAGG AGGAGATCAA GAAGATGGGG
GACCCCCTCT ACCTCGCCCG CTACTTCCAC GGCCAGTCCA AGAGGGCGAT GGAGAGGTTC
GGCCTAAGCA TAGACTGGAC TAGGGAGTTC ACCACAATAG ACCCGGAGTA CCAGCGCTTC
ATCCAGTGGC AGTTCGAGAA GCTGAGGAAG AAGGGGCTGA TCGTGAGGGG GAGACACCCC
GTGGGCTGGT GCCCAAGGCA CTCGATGCCG GTAGGAGCTC ACGACACCAA GGACGATAAG
GAGCCCGACA TTGGCCAGTG GACGCTGGTG TATTTCACGG ACTCGGAGGG GCTGACCTTC
CCCACGGCCA CGCTTAGGCC GGAGACGGTG CTGGGCGTCA CCAACCTCTG GATTAACCCA
GACGCCGAGT ACGTGGTGGC CGAGTTCGAC GGGAGGCGTG CCGTAGTCAG CAGAGACGCG
GCGTACCGCC TCTCCTTCCA GGTGGGGGTG AAGATCTTGA GGGAGGCCAG GGGCAGGGAG
TTCGTGGGCC GCATGGTTCA GAACCCGGTG ACCGGGGAGT GGGTGCCCGT ATACGAGGCC
CGGTTTGTGG ACCCCAAGGT GGGGACCGGC GTTGTGATGT CTGTGCCCGC GCATGCGCCT
TATGACTACG CCGCGCTCCG CGACCTAGGG ACCGTGAAGC TGATCCCGCT GATAAGGGTG
GAGGGGTACG GCGATTACCC AGCTAAGGAG GTCGTGGAGA GGATGGGGAT AAAGAGCCAG
GCGGACCCTG CCTTGGAGGA CGCCACCAAG GAGGTGTATT CCGCGGAGTA CGCGAGGGGC
GTCATGAGGG AGGACGTCGC GGAGAGGGTG GGCGCCCACC TGGAGGAGCC AGCCAGATCG
ATGTTGCGCG CCGTGTTTAA GATGTACTTC GCGGGCAGGC CCGTGAGGGA GGCTCGGGAG
TTCATAGCCA GATGGCTTAC GGAGGCCCGC CTCGGCGGCG TCATGTACGA CATAATGAAC
AAGCCTGTCT ACTGCCGCTG CGGGACGGAG ATCGTGGTTA AGGTGTTGGA GGACCAGTGG
TTTATAAATT ACGGCGAGTC CAGATGGAAG GAGGCAGCTA GAGAGCTTGT GAAGGAGATG
TCCATCGTGC CGGGGGAGGC CCGGGCGCAG TTCCTCGCCA CGATAGACTG GTTGGACAAG
AGGGCGTGTG CCAGAACTCG CGGCCTCGGC ACGCCGCTTC CCTGGAGCTC GGGTTGGGTG
ATAGAGAGCT TGAGCGACTC GACGATATAT ATGGCGTTTT ACACGGTGGT GAAGAGGATC
AGGCAGTTCG GCATAAGGCC GGAGCAACTG ACGGAAGAGT TCTGGGACTT CGTCTTCTTG
GGCCAGGGCT CGGCAGATGA AGTATCTAAG AAGACGGGGG TGCCGGTTGA GGCCCTCAAG
GCCATCAGAG AGGAGTTCGA GTACTGGTAC CCCCTGGACT CTAGGAACTC CGGCAAGGAT
CTCATCCCCA ATCACCTGAC CTTCTTCATC TTCAACCACG TGGCCATATT CCCCAGGGAG
AAGTGGCCGC GGCAGATCGT GGCCAACGGC TGGGTGCTTA GAGAGGGCGA GAAGATGTCG
AAGTCCAAGC GCAACGTCCT ACCTCTTGAT AGAGCGGTGG AGATGTACGG CCCGGACCCG
CTTAGGGCCA CCCTGGCTCT CGCCGCCGAG GTGGAGCAGG ATCTGGACTT CAGAGACGCC
GAGGCTAGGA GAAACGCCCA GCAGCTGATG TCCATATATA CGCTGGCGCA GAGGCTTGTA
CAAGGCGCCG AGGAGCGGCC GCCGACGTGG GTAGACCAGT GGCTTGTGGC TGAGATCTCC
AGGGTGTTGG AGAGGGCTAG AGAGGCCTAC GAGAAGGTGA GAGTTAGGCA AGCGGCGGTG
GAGGTGCTCT ACAACGCCAA GGCGGTCTTC GACCAGTACC TCGCCATGGT GGAGAAACCA
TCTAGGCAGG CTGTGGAGGC CGCCAAGGCG TGGGCGGTGG CGATGGAGCC CCTCGTGCCG
CATCTGGCCG AGGAGCTCTG GGCTACCCTT GGCGGGGCTG GATTCGCGGC GCTGGCTCCC
TGGCCTAAGC TGAGGGCTGA GCCGGCGGCG CTTCTCGCGA AGAGGTACGT CGACATGTTG
ATTGAGGACG TGAAAAACAT ACCGGCCTTT GGCCAAGGGG CTAAGCGCGT CGTGATCTAC
GTCAACAGGT CCTTTGCCTG GGTTAAGGCG GCTTTGGCGG GAGATGTGAA AACGGTCATA
GGCGCGGGCG TGCCGCCTCA GCAGGCCAAG AAGGTGGTTG ACTTGGTAAA AACGCTGGGG
GATGAGATGA GGGGGCTCAT AGCCGCCGTG GATCACTTCG ACGAGCTAGA GGCGCTTAGA
TCCTACAGGA ACTACGTCGA GAAGGCGCTC GGGGCGCCGG TGGAGATCTA CGGCGCAGAT
GACCCAGCGG CGCCGGATCT CGGCGGTAAG AAGAGGGTCG CCCTGCCTTT GAAGCCGGGC
ATCTACGTGG AGAAGTAG
 
Protein sequence
MSELSRFFIE LGERWQRRWR EARVFEPEPA PGVPKYFITA AYPYPNGAIH IGHGRTYLVA 
DVMARFQRHL GRSVLFPMGF HYTGTPILTI AEVIAAGDKA VMEEYMELYG VPEEEIKKMG
DPLYLARYFH GQSKRAMERF GLSIDWTREF TTIDPEYQRF IQWQFEKLRK KGLIVRGRHP
VGWCPRHSMP VGAHDTKDDK EPDIGQWTLV YFTDSEGLTF PTATLRPETV LGVTNLWINP
DAEYVVAEFD GRRAVVSRDA AYRLSFQVGV KILREARGRE FVGRMVQNPV TGEWVPVYEA
RFVDPKVGTG VVMSVPAHAP YDYAALRDLG TVKLIPLIRV EGYGDYPAKE VVERMGIKSQ
ADPALEDATK EVYSAEYARG VMREDVAERV GAHLEEPARS MLRAVFKMYF AGRPVREARE
FIARWLTEAR LGGVMYDIMN KPVYCRCGTE IVVKVLEDQW FINYGESRWK EAARELVKEM
SIVPGEARAQ FLATIDWLDK RACARTRGLG TPLPWSSGWV IESLSDSTIY MAFYTVVKRI
RQFGIRPEQL TEEFWDFVFL GQGSADEVSK KTGVPVEALK AIREEFEYWY PLDSRNSGKD
LIPNHLTFFI FNHVAIFPRE KWPRQIVANG WVLREGEKMS KSKRNVLPLD RAVEMYGPDP
LRATLALAAE VEQDLDFRDA EARRNAQQLM SIYTLAQRLV QGAEERPPTW VDQWLVAEIS
RVLERAREAY EKVRVRQAAV EVLYNAKAVF DQYLAMVEKP SRQAVEAAKA WAVAMEPLVP
HLAEELWATL GGAGFAALAP WPKLRAEPAA LLAKRYVDML IEDVKNIPAF GQGAKRVVIY
VNRSFAWVKA ALAGDVKTVI GAGVPPQQAK KVVDLVKTLG DEMRGLIAAV DHFDELEALR
SYRNYVEKAL GAPVEIYGAD DPAAPDLGGK KRVALPLKPG IYVEK