Gene TBFG_10040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10040 
SymbolleuS 
ID5220703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp43663 
End bp46572 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content63% 
IMG OID640604780 
Productleucyl-tRNA synthetase 
Protein accessionYP_001285985 
Protein GI148821231 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0495] Leucyl-tRNA synthetase 
TIGRFAM ID[TIGR00396] leucyl-tRNA synthetase, eubacterial and mitochondrial family 


Plasmid Coverage information

Num covering plasmid clones250 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones198 
Fosmid unclonability p-value0.928861 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAAT CGCCAACCGC TGGGCCTGGC GGCGTGCCCC GTGCCGACGA CGCGGACTCC 
GACGTGCCAC GGTACCGCTA TACCGCCGAG CTCGCGGCTA GGCTGGAACG GACCTGGCAG
GAAAACTGGG CCCGGCTAGG GACGTTCAAC GTGCCCAACC CGGTCGGCTC GCTGGCCCCA
CCGGATGGTG CCGCGGTGCC TGACGACAAG CTCTTCGTGC AGGACATGTT CCCCTACCCC
TCGGGTGAGG GACTCCACGT TGGTCATCCC CTCGGCTACA TCGCGACCGA CGTCTATGCC
CGCTATTTCC GGATGGTGGG CCGTAATGTG CTGCATGCGC TAGGGTTCGA CGCGTTCGGG
CTGCCCGCCG AGCAATACGC GGTACAAACC GGCACCCATC CGCGTACCCG GACCGAAGCC
AACGTCGTCA ACTTTCGCCG CCAGTTGGGC CGGCTGGGCT TCGGCCACGA CAGCCGACGA
AGCTTCTCGA CCACCGATGT CGACTTCTAC AGGTGGACTC AGTGGATCTT CCTACAGATA
TACAACGCGT GGTTCGACAC CACAGCCAAC AAGGCGCGCC CGATATCAGA GCTGGTCGCC
GAATTCGAGT CCGGTGCAAG GTGTCTCGAT GGCGGCCGGG ATTGGGCCAA GTTGACCGCG
GGGGAGCGAG CCGATGTGAT CGACGAGTAC CGGCTGGTCT ATCGGGCGGA TTCGCTGGTG
AACTGGTGCC CGGGGCTAGG TACGGTGCTT GCCAACGAAG AGGTGACCGC CGACGGCCGC
AGCGACCGGG GCAATTTTCC GGTGTTCCGG AAGCGGTTGC GGCAATGGAT GATGCGGATC
ACCGCCTATG CCGACCGGCT GCTCGACGAC CTGGATGTGC TGGATTGGCC TGAGCAGGTC
AAGACCATGC AGCGCAACTG GATCGGGCGT TCGACGGGTG CGGTGGCGCT GTTCTCGGCG
AGAGCGGCCA GCGATGACGG GTTCGAAGTC GACATCGAGG TGTTCACCAC GCGGCCCGAC
ACCTTGTTCG GCGCCACGTA TCTGGTGCTG GCTCCCGAGC ACGACTTGGT CGACGAGTTG
GTCGCCGCGT CCTGGCCGGC TGGGGTCAAC CCCTTGTGGA CATACGGCGG CGGCACACCT
GGTGAGGCCA TCGCCGCCTA CCGGCGTGCG ATCGCCGCCA AATCAGACCT CGAGCGCCAG
GAGAGCAGGG AAAAGACCGG CGTCTTCTTG GGCAGCTACG CCATCAACCC GGCCAACGGT
GAGCCGGTGC CGATCTTCAT CGCCGACTAC GTGCTGGCCG GGTACGGTAC CGGGGCAATC
ATGGCGGTGC CGGGTCATGA CCAGCGGGAC TGGGACTTCG CTCGGGCATT TGGTCTACCG
ATCGTGGAAG TAATTGCCGG CGGCAATATT TCGGAATCCG CGTATACAGG CGATGGCATC
CTGGTCAACT CGGATTACCT CAATGGAATG AGCGTGCCAG CAGCAAAGCG GGCCATCGTC
GACCGGTTGG AGTCCGCGGG CCGCGGCCGG GCTCGAATCG AATTCAAATT GCGCGACTGG
CTTTTTGCGC GGCAGCGGTA TTGGGGTGAA CCATTCCCGA TCGTCTATGA CAGCGACGGG
CGTCCGCATG CGCTCGACGA AGCTGCACTG CCCGTCGAGC TGCCTGATGT CCCGGACTAC
TCGCCGGTTT TGTTCGACCC CGACGATGCG GACAGCGAGC CTTCGCCCCC ACTGGCCAAG
GCGACTGAGT GGGTACACGT CGACCTGGAC CTCGGTGATG GCCTGAAGCC CTACAGCCGC
GACACCAACG TGATGCCGCA GTGGGCGGGC AGCTCCTGGT ATGAACTGCG CTACACCGAT
CCGCACAACT CAGAACGGTT CTGCGCCAAG GAAAACGAGG CCTATTGGAT GGGACCGCGG
CCGGCTGAGC ACGGCCCGGA CGACCCCGGT GGCGTCGACT TGTACGTCGG CGGTGCTGAA
CACGCGGTTT TGCACCTGCT GTATTCCAGG TTCTGGCACA AGGTCTTGTA CGACCTGGGT
CACGTCAGCT CTCGCGAGCC TTACCGCAGG CTGGTCAATC AGGGCTATAT TCAAGCTTAC
GCTTACACCG ATGCGCGCGG ATCCTATGTC CCTGCCGAGC AGGTGATCGA ACGCGGTGAC
AGATTTGTCT ATCCTGGACC TGACGGTGAG GTCGAAGTTT TCCAGGAATT CGGCAAAATC
GGTAAGAGCC TGAAGAATTC GGTATCGCCG GACGAAATCT GCGACGCATA CGGGGCAGAT
ACGCTTCGGG TTTACGAGAT GTCGATGGGG CCGCTGGAGG CTTCACGTCC ATGGGCCACA
AAGGATGTTG TCGGCGCGTA CCGTTTTCTG CAGCGGGTGT GGCGCTTGGT CGTCGACGAG
CACACCGGCG AAACTCGGGT GGCTGACGGC GTGGAACTCG ACATCGATAC GCTACGGGCG
TTGCACCGCA CCATCGTCGG CGTGTCAGAA GACTTTGCGG CACTTCGCAA TAACACCGCA
ACGGCTAAGT TGATCGAATA CACGAACCAC CTCACCAAGA AGCATCGTGA TGCGGTGCCT
CGGGCCGCCG TGGAGCCGCT TGTACAAATG CTGGCTCCGC TGGCCCCACA TATTGCCGAG
GAGCTGTGGC TGCGACTGGG CAACACCACC TCGTTGGCAC ACGGCCCGTT CCCGAAGGCC
GATGCCGCCT ACCTCGTCGA CGAGACGGTC GAGTATCCGG TGCAGGTGAA CGGCAAGGTA
CGTGGCCGGG TGGTGGTGGC CGCCGACACC GACGAGGAAA CGCTGAAAGC CGCCGTTCTG
ACCGACGAAA AGGTCCAGGC ATTCTTGGCT GGTGCCACCC CGCGCAAGGT TATCGTGGTC
GCCGGCCGGC TGGTCAATCT CGTCATCTAG
 
Protein sequence
MTESPTAGPG GVPRADDADS DVPRYRYTAE LAARLERTWQ ENWARLGTFN VPNPVGSLAP 
PDGAAVPDDK LFVQDMFPYP SGEGLHVGHP LGYIATDVYA RYFRMVGRNV LHALGFDAFG
LPAEQYAVQT GTHPRTRTEA NVVNFRRQLG RLGFGHDSRR SFSTTDVDFY RWTQWIFLQI
YNAWFDTTAN KARPISELVA EFESGARCLD GGRDWAKLTA GERADVIDEY RLVYRADSLV
NWCPGLGTVL ANEEVTADGR SDRGNFPVFR KRLRQWMMRI TAYADRLLDD LDVLDWPEQV
KTMQRNWIGR STGAVALFSA RAASDDGFEV DIEVFTTRPD TLFGATYLVL APEHDLVDEL
VAASWPAGVN PLWTYGGGTP GEAIAAYRRA IAAKSDLERQ ESREKTGVFL GSYAINPANG
EPVPIFIADY VLAGYGTGAI MAVPGHDQRD WDFARAFGLP IVEVIAGGNI SESAYTGDGI
LVNSDYLNGM SVPAAKRAIV DRLESAGRGR ARIEFKLRDW LFARQRYWGE PFPIVYDSDG
RPHALDEAAL PVELPDVPDY SPVLFDPDDA DSEPSPPLAK ATEWVHVDLD LGDGLKPYSR
DTNVMPQWAG SSWYELRYTD PHNSERFCAK ENEAYWMGPR PAEHGPDDPG GVDLYVGGAE
HAVLHLLYSR FWHKVLYDLG HVSSREPYRR LVNQGYIQAY AYTDARGSYV PAEQVIERGD
RFVYPGPDGE VEVFQEFGKI GKSLKNSVSP DEICDAYGAD TLRVYEMSMG PLEASRPWAT
KDVVGAYRFL QRVWRLVVDE HTGETRVADG VELDIDTLRA LHRTIVGVSE DFAALRNNTA
TAKLIEYTNH LTKKHRDAVP RAAVEPLVQM LAPLAPHIAE ELWLRLGNTT SLAHGPFPKA
DAAYLVDETV EYPVQVNGKV RGRVVVAADT DEETLKAAVL TDEKVQAFLA GATPRKVIVV
AGRLVNLVI