Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_11947 |
Symbol | |
ID | 5222624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | - |
Start bp | 2175904 |
End bp | 2178945 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640606704 |
Product | PPE family protein |
Protein accession | YP_001287884 |
Protein GI | 148823130 |
COG category | [N] Cell motility |
COG ID | [COG5651] PPE-repeat proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 241 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 201 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTACG AACGAGTCAG CGACGACGGG CGTGTGACCT TTCATCAGCA CCAGGCTCAC CCCGGAAGGT GGGGAGCGAT GCATTACTCA GTGTTGCCGC CGGAGATCAA CTCGGCCTTG ATCTTCGCCG GGGCGGGCTC CGGACCGATG CTGGCGGCGG CGTCGGCCTG GGACGGGCTG GCAACCGAAT TAGCCTCGGC TGCTGTCTCT TTCGGCTCGG TGACAGCCGG GCTGGTCGGC GGGTCGTGGC AGGGTCGGTC ATCGGTGGCG ATGGCAGCGG CGGCAGCCCC GTATGCGGGG TGGCTGGCCG CGGCGGCGAC CCAGGCCGAG CAGGCGGCCA CCCAGGCCCA GGTGATGGTG GCCGAGTTCG AGGCTGTGCG GCTGGCGATG GTACAACCGG CGCTGGTGGC CGCCAACCGT TCCGGCCTCA TATCGCTGGT GATATCGAAC CTTTTTGGTC AAAACGCTCC CGCGATCGCG GCCGCCGAAG CCGCATACGA GGAGATGTGG GCTCTGGATG TATCGGCGAT GGCGGCCTAC CATTCCGGGG CGTCGGCGGT CGCTGTGGCG CTACCGGCAT TCGCCCTCCC GCTGCGGCTT CCGGCGGGTC TGGCGGCCGG GCCCGCGGCC GTGGTGACCG CGCTCACCAC GGCCGTGGGC ATGCCGACTT TTGCCGGCCG GGCGATCGCC GCTAGCCTCG GCTTGGCCAA CGTCGGTGGT GGCAACCTCG GCAATGCCAA CAATGGGCTC GGCAACATCG GCAACGCCAA CCTTGGCAAC AACAATCTGG GGTCCGGCAA CTTCGGTAGC TTCAATATCG GCTCGGCCAA CCTAGGTGGC AACAACATCG GCATAGGAAA CGCGGGCGCC AACAACTTCG GACTTGCAAA CCTGGGCAAT TTGAACACGG GATTCGCCAA TGCAGGCATC GGCAACTTCG GAATTGCGAA CACCGGCAAC AACAATATCG GCAACGGCCT GACTGGAAAC AACCAAATCG GCATTGGCGG ACTCAATTCC GGCAACGGTA ACGTCGGATT ATTCAACGCG GGTAGCGCCA ATATCGGTTT CTTCAACTCC GGCAATGGCA ACTTTGGCAT CGGGAACTCC GGTAACTTCA GCACTGGCCT GTTCAACCCC GGACACGGCA ACACCGGATT CCTGAATGCG GGCTCTTTCA ATACGGGCAT GTTCGACGTT GGGAACGCGA ACACCGGCAG CTTCAACGTC GGCCACTACA ACTTCGGTGC CTTCAACCCG GGCCCGTCGA ACACGGGTAC CTTCAACACG GGCGGCGCCA ACACCGGCTG GTTCAACACA GGAAGCATCA ACACCGGCGC CTTCAACATA GGCGACATGA ATAACGGCTT GTTCAACACG GGCGACATGA ACAATGGTGT CTTTTACCGT GGTGTGGGCC AAGGCAGCCT GCAGTTCGCC ATCACCAGCC CTGATTTGAC GCTTCCGTCT CTGGAAATAC CCGGAATCTC GGTTCCCGCG TTCAGCCTGC CCGCGATAAC CTTGCCGTCG TTGACGATTC CGGCGGTGAC GACGCCGGCC AACGTTACCG TGGGTGCGTT TGATTTGCCG GGGTTGACGG TGCCGTCGTT GACGATTCCA GCGGCGATGA CGCCAGCTAA CATCACGGTG GGTGCGTTTG ATTTGCCGGG GTTGACGGTG CCGTCGTTGA CGATTCCAGC TACAACGACA CCAGCCAACA TCACGGTAGG TGCGTTTAAC TTGCCTCAGT TGAGTATTCC GTCGGTGACG GTTCCGCCGA TCACGATTCC GGCTGGCACA GCGCTAGGTG CGTTCAATCT GCCGACGCTG AGTATTCCGT CGGTGACGGT TCCGCCGATC ACGATTCCGG CTGGCACCAC TGTCGGCGGA TTTACGCTAC CCACGATACA CACCCCGTTA ATAAGTACAC CCCAAATAAG TATAGGCGGC TTTAGCACTC CCGGCATAGC CACGCAAGCA AATTCTGGTG TCATCAATCT TCCCACCTTT AGCCTTAACG GCATTACGAT AACTAATTTG GTGGTGTTCA TTCCGAACAA CATCACTGCC TTGCAAACCA ATATGCCCGG GGTATTCCCG CAGATTGGCG GCTTCGCTAA TACACCTCCT GCCTTTATTA ATACTGGGAC CATTACCGTG GGTGGAGGTC AAATCAACGG CGTCGGCTTC TCGATCGGCG CAATCAACGT CACCCCCTTC ACCCTCCCCA ACGTCGTCAT CCAACCGTGG TCCCTCGGGG GGATCTCGGT CGACGGGTTC ACCCTGCCAG AGATCAGCAC CCAAGAATTC ACCACTCCGG CGTTGACGAT CAGTCCGATT GGTGTCGGTG CATTGAGCCT GCCGGATATC ACTACTCAAC AGTTCACGAC CCCGGAGTTG ACCATCGACC CGATCACGCT GGGTGGGTTT ACGCTGCCGC AGCTCAGCAT CCCGGCGATT ACCACCCCGG CGTTCACGAT CGATCCGATA GCGCTGGGTG GTTTCACGCT TCCTCAGATC ATGACGCCCG AGATAACGAC TCCACCGTTC GCCATCGACC CGATCGGACT TAGCGGTTTC ACCCTCCCCC AGGTCAATAT CCCGGAGATC ACCACGCCAG AGTTCACCAT CCAGCCGGTG GGCTTGGCGG CCTTCACCAC ACCCGCACTC ACCATCGCCA GCATCCACCT GCCGAGCACC ACCATGGGCG GATTCGCAAT CCCAGCGGGG CCGGGATACT TCAACTCGAG CGCAACGCCC TCGTTGGGCT TTTTCAACGC CGGAATCGGT GGGAACTCGG GCTTCGGCAA CAGCGGCTCG GGACTGTCGG GTTGGTTCAA CACAAGTCCT GTTGGGCTGC TAGCCGGCTC GGGCTACCAG AACTACGGTG GTCTTATCTC CGGCTTCTCC AACCTTGGCA GCGGCATATC GGGCTTCGCC AACACCGGCA CCCTGCCGTT TGCCGTGACC AGCTTGGTCT CCGGTTTGGC CAACATCGGC AACAACCTGT CGGGCCTGTT CTTCCAGAGC ACCACGCCAT AA
|
Protein sequence | MRYERVSDDG RVTFHQHQAH PGRWGAMHYS VLPPEINSAL IFAGAGSGPM LAAASAWDGL ATELASAAVS FGSVTAGLVG GSWQGRSSVA MAAAAAPYAG WLAAAATQAE QAATQAQVMV AEFEAVRLAM VQPALVAANR SGLISLVISN LFGQNAPAIA AAEAAYEEMW ALDVSAMAAY HSGASAVAVA LPAFALPLRL PAGLAAGPAA VVTALTTAVG MPTFAGRAIA ASLGLANVGG GNLGNANNGL GNIGNANLGN NNLGSGNFGS FNIGSANLGG NNIGIGNAGA NNFGLANLGN LNTGFANAGI GNFGIANTGN NNIGNGLTGN NQIGIGGLNS GNGNVGLFNA GSANIGFFNS GNGNFGIGNS GNFSTGLFNP GHGNTGFLNA GSFNTGMFDV GNANTGSFNV GHYNFGAFNP GPSNTGTFNT GGANTGWFNT GSINTGAFNI GDMNNGLFNT GDMNNGVFYR GVGQGSLQFA ITSPDLTLPS LEIPGISVPA FSLPAITLPS LTIPAVTTPA NVTVGAFDLP GLTVPSLTIP AAMTPANITV GAFDLPGLTV PSLTIPATTT PANITVGAFN LPQLSIPSVT VPPITIPAGT ALGAFNLPTL SIPSVTVPPI TIPAGTTVGG FTLPTIHTPL ISTPQISIGG FSTPGIATQA NSGVINLPTF SLNGITITNL VVFIPNNITA LQTNMPGVFP QIGGFANTPP AFINTGTITV GGGQINGVGF SIGAINVTPF TLPNVVIQPW SLGGISVDGF TLPEISTQEF TTPALTISPI GVGALSLPDI TTQQFTTPEL TIDPITLGGF TLPQLSIPAI TTPAFTIDPI ALGGFTLPQI MTPEITTPPF AIDPIGLSGF TLPQVNIPEI TTPEFTIQPV GLAAFTTPAL TIASIHLPST TMGGFAIPAG PGYFNSSATP SLGFFNAGIG GNSGFGNSGS GLSGWFNTSP VGLLAGSGYQ NYGGLISGFS NLGSGISGFA NTGTLPFAVT SLVSGLANIG NNLSGLFFQS TTP
|
| |