Gene TBFG_11947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_11947 
Symbol 
ID5222624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp2175904 
End bp2178945 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content61% 
IMG OID640606704 
ProductPPE family protein 
Protein accessionYP_001287884 
Protein GI148823130 
COG category[N] Cell motility 
COG ID[COG5651] PPE-repeat proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones241 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones201 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTACG AACGAGTCAG CGACGACGGG CGTGTGACCT TTCATCAGCA CCAGGCTCAC 
CCCGGAAGGT GGGGAGCGAT GCATTACTCA GTGTTGCCGC CGGAGATCAA CTCGGCCTTG
ATCTTCGCCG GGGCGGGCTC CGGACCGATG CTGGCGGCGG CGTCGGCCTG GGACGGGCTG
GCAACCGAAT TAGCCTCGGC TGCTGTCTCT TTCGGCTCGG TGACAGCCGG GCTGGTCGGC
GGGTCGTGGC AGGGTCGGTC ATCGGTGGCG ATGGCAGCGG CGGCAGCCCC GTATGCGGGG
TGGCTGGCCG CGGCGGCGAC CCAGGCCGAG CAGGCGGCCA CCCAGGCCCA GGTGATGGTG
GCCGAGTTCG AGGCTGTGCG GCTGGCGATG GTACAACCGG CGCTGGTGGC CGCCAACCGT
TCCGGCCTCA TATCGCTGGT GATATCGAAC CTTTTTGGTC AAAACGCTCC CGCGATCGCG
GCCGCCGAAG CCGCATACGA GGAGATGTGG GCTCTGGATG TATCGGCGAT GGCGGCCTAC
CATTCCGGGG CGTCGGCGGT CGCTGTGGCG CTACCGGCAT TCGCCCTCCC GCTGCGGCTT
CCGGCGGGTC TGGCGGCCGG GCCCGCGGCC GTGGTGACCG CGCTCACCAC GGCCGTGGGC
ATGCCGACTT TTGCCGGCCG GGCGATCGCC GCTAGCCTCG GCTTGGCCAA CGTCGGTGGT
GGCAACCTCG GCAATGCCAA CAATGGGCTC GGCAACATCG GCAACGCCAA CCTTGGCAAC
AACAATCTGG GGTCCGGCAA CTTCGGTAGC TTCAATATCG GCTCGGCCAA CCTAGGTGGC
AACAACATCG GCATAGGAAA CGCGGGCGCC AACAACTTCG GACTTGCAAA CCTGGGCAAT
TTGAACACGG GATTCGCCAA TGCAGGCATC GGCAACTTCG GAATTGCGAA CACCGGCAAC
AACAATATCG GCAACGGCCT GACTGGAAAC AACCAAATCG GCATTGGCGG ACTCAATTCC
GGCAACGGTA ACGTCGGATT ATTCAACGCG GGTAGCGCCA ATATCGGTTT CTTCAACTCC
GGCAATGGCA ACTTTGGCAT CGGGAACTCC GGTAACTTCA GCACTGGCCT GTTCAACCCC
GGACACGGCA ACACCGGATT CCTGAATGCG GGCTCTTTCA ATACGGGCAT GTTCGACGTT
GGGAACGCGA ACACCGGCAG CTTCAACGTC GGCCACTACA ACTTCGGTGC CTTCAACCCG
GGCCCGTCGA ACACGGGTAC CTTCAACACG GGCGGCGCCA ACACCGGCTG GTTCAACACA
GGAAGCATCA ACACCGGCGC CTTCAACATA GGCGACATGA ATAACGGCTT GTTCAACACG
GGCGACATGA ACAATGGTGT CTTTTACCGT GGTGTGGGCC AAGGCAGCCT GCAGTTCGCC
ATCACCAGCC CTGATTTGAC GCTTCCGTCT CTGGAAATAC CCGGAATCTC GGTTCCCGCG
TTCAGCCTGC CCGCGATAAC CTTGCCGTCG TTGACGATTC CGGCGGTGAC GACGCCGGCC
AACGTTACCG TGGGTGCGTT TGATTTGCCG GGGTTGACGG TGCCGTCGTT GACGATTCCA
GCGGCGATGA CGCCAGCTAA CATCACGGTG GGTGCGTTTG ATTTGCCGGG GTTGACGGTG
CCGTCGTTGA CGATTCCAGC TACAACGACA CCAGCCAACA TCACGGTAGG TGCGTTTAAC
TTGCCTCAGT TGAGTATTCC GTCGGTGACG GTTCCGCCGA TCACGATTCC GGCTGGCACA
GCGCTAGGTG CGTTCAATCT GCCGACGCTG AGTATTCCGT CGGTGACGGT TCCGCCGATC
ACGATTCCGG CTGGCACCAC TGTCGGCGGA TTTACGCTAC CCACGATACA CACCCCGTTA
ATAAGTACAC CCCAAATAAG TATAGGCGGC TTTAGCACTC CCGGCATAGC CACGCAAGCA
AATTCTGGTG TCATCAATCT TCCCACCTTT AGCCTTAACG GCATTACGAT AACTAATTTG
GTGGTGTTCA TTCCGAACAA CATCACTGCC TTGCAAACCA ATATGCCCGG GGTATTCCCG
CAGATTGGCG GCTTCGCTAA TACACCTCCT GCCTTTATTA ATACTGGGAC CATTACCGTG
GGTGGAGGTC AAATCAACGG CGTCGGCTTC TCGATCGGCG CAATCAACGT CACCCCCTTC
ACCCTCCCCA ACGTCGTCAT CCAACCGTGG TCCCTCGGGG GGATCTCGGT CGACGGGTTC
ACCCTGCCAG AGATCAGCAC CCAAGAATTC ACCACTCCGG CGTTGACGAT CAGTCCGATT
GGTGTCGGTG CATTGAGCCT GCCGGATATC ACTACTCAAC AGTTCACGAC CCCGGAGTTG
ACCATCGACC CGATCACGCT GGGTGGGTTT ACGCTGCCGC AGCTCAGCAT CCCGGCGATT
ACCACCCCGG CGTTCACGAT CGATCCGATA GCGCTGGGTG GTTTCACGCT TCCTCAGATC
ATGACGCCCG AGATAACGAC TCCACCGTTC GCCATCGACC CGATCGGACT TAGCGGTTTC
ACCCTCCCCC AGGTCAATAT CCCGGAGATC ACCACGCCAG AGTTCACCAT CCAGCCGGTG
GGCTTGGCGG CCTTCACCAC ACCCGCACTC ACCATCGCCA GCATCCACCT GCCGAGCACC
ACCATGGGCG GATTCGCAAT CCCAGCGGGG CCGGGATACT TCAACTCGAG CGCAACGCCC
TCGTTGGGCT TTTTCAACGC CGGAATCGGT GGGAACTCGG GCTTCGGCAA CAGCGGCTCG
GGACTGTCGG GTTGGTTCAA CACAAGTCCT GTTGGGCTGC TAGCCGGCTC GGGCTACCAG
AACTACGGTG GTCTTATCTC CGGCTTCTCC AACCTTGGCA GCGGCATATC GGGCTTCGCC
AACACCGGCA CCCTGCCGTT TGCCGTGACC AGCTTGGTCT CCGGTTTGGC CAACATCGGC
AACAACCTGT CGGGCCTGTT CTTCCAGAGC ACCACGCCAT AA
 
Protein sequence
MRYERVSDDG RVTFHQHQAH PGRWGAMHYS VLPPEINSAL IFAGAGSGPM LAAASAWDGL 
ATELASAAVS FGSVTAGLVG GSWQGRSSVA MAAAAAPYAG WLAAAATQAE QAATQAQVMV
AEFEAVRLAM VQPALVAANR SGLISLVISN LFGQNAPAIA AAEAAYEEMW ALDVSAMAAY
HSGASAVAVA LPAFALPLRL PAGLAAGPAA VVTALTTAVG MPTFAGRAIA ASLGLANVGG
GNLGNANNGL GNIGNANLGN NNLGSGNFGS FNIGSANLGG NNIGIGNAGA NNFGLANLGN
LNTGFANAGI GNFGIANTGN NNIGNGLTGN NQIGIGGLNS GNGNVGLFNA GSANIGFFNS
GNGNFGIGNS GNFSTGLFNP GHGNTGFLNA GSFNTGMFDV GNANTGSFNV GHYNFGAFNP
GPSNTGTFNT GGANTGWFNT GSINTGAFNI GDMNNGLFNT GDMNNGVFYR GVGQGSLQFA
ITSPDLTLPS LEIPGISVPA FSLPAITLPS LTIPAVTTPA NVTVGAFDLP GLTVPSLTIP
AAMTPANITV GAFDLPGLTV PSLTIPATTT PANITVGAFN LPQLSIPSVT VPPITIPAGT
ALGAFNLPTL SIPSVTVPPI TIPAGTTVGG FTLPTIHTPL ISTPQISIGG FSTPGIATQA
NSGVINLPTF SLNGITITNL VVFIPNNITA LQTNMPGVFP QIGGFANTPP AFINTGTITV
GGGQINGVGF SIGAINVTPF TLPNVVIQPW SLGGISVDGF TLPEISTQEF TTPALTISPI
GVGALSLPDI TTQQFTTPEL TIDPITLGGF TLPQLSIPAI TTPAFTIDPI ALGGFTLPQI
MTPEITTPPF AIDPIGLSGF TLPQVNIPEI TTPEFTIQPV GLAAFTTPAL TIASIHLPST
TMGGFAIPAG PGYFNSSATP SLGFFNAGIG GNSGFGNSGS GLSGWFNTSP VGLLAGSGYQ
NYGGLISGFS NLGSGISGFA NTGTLPFAVT SLVSGLANIG NNLSGLFFQS TTP