Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_11771 |
Symbol | |
ID | 5222451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | - |
Start bp | 1978314 |
End bp | 1981241 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640606531 |
Product | PPE family protein |
Protein accession | YP_001287714 |
Protein GI | 148822960 |
COG category | [N] Cell motility |
COG ID | [COG5651] PPE-repeat proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1.49393e-28 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 125 |
Fosmid unclonability p-value | 0.0000000153091 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATTTTT CTGTACTGCC GCCGGAGATC AATTCAGCGC TGATATTCGC CGGGGCAGGG CCGGAACCGA TGGCGGCGGC CGCGACGGCC TGGGACGGGT TGGCCATGGA ATTGGCCTCG GCCGCAGCCT CTTTCGGCTC AGTGACATCC GGACTCGTGG GCGGGGCGTG GCAGGGCGCG TCGTCGTCGG CGATGGCGGC AGCGGCAGCC CCCTATGCGG CGTGGCTTGC CGCGGCGGCG GTCCAGGCCG AGCAGACGGC CGCTCAGGCT GCGGCGATGA TAGCCGAGTT TGAAGCGGTC AAGACGGCGG TGGTGCAGCC GATGCTGGTG GCGGCCAACC GTGCCGACCT GGTGTCGCTG GTGATGTCGA ACCTGTTTGG ACAGAACGCT CCGGCGATCG CTGCCATTGA AGCCACGTAC GAGCAAATGT GGGCTGCCGA TGTGTCGGCG ATGTCTGCCT ACCATGCCGG GGCATCGGCG ATCGCCTCGG CGCTGTCCCC GTTCAGTAAA CCGCTGCAGA ACCTGGCTGG CTTGCCGGCT TGGTTGGCCA GCGGCGCGCC TGCGGCCGCC ATGACCGCAG CCGCAGGCAT ACCGGCGCTT GCGGGCGGAC CCACCGCCAT CAACCTGGGC ATAGCCAACG TCGGCGGTGG CAACGTCGGC AACGCCAACA ACGGCCTTGC CAACATCGGC AACGCCAACC TTGGCAACTA CAATTTCGGG TCCGGAAATT TCGGTAACTC CAATATCGGC TCAGCAAGCC TGGGTAATAA CAACATCGGC TTCGGGAACC TCGGCAGCAA CAATGTCGGC GTGGGAAACC TTGGCAATCT CAACACCGGG TTTGCCAACA CCGGCTTGGG CAACTTCGGC TTTGGCAACA CTGGCAACAA CAACATCGGC ATCGGTCTTA CCGGCAACAA CCAGATCGGA ATCGGCGGGC TCAACTCGGG CACCGGGAAT TTCGGATTGT TCAACTCGGG CAGCGGAAAC GTCGGCTTCT TCAACTCCGG CAATGGAAAC TTTGGCATCG GAAACTCGGG TAATTTCAAC ACCGGTGGCT GGAATTCTGG ACACGGGAAC ACGGGCTTCT TCAATGCGGG CTCGTTTAAC ACCGGTATGT TGGACGTCGG CAACGCGAAC ACAGGCAGCC TGAACACCGG CAGTTATAAC ATGGGCGACT TCAATCCGGG GTCGTCCAAC ACCGGCACGT TCAACACGGG AAATGCTAAC ACGGGTTTCC TCAACGCCGG AAATATCAAC ACTGGTGTCT TCAATATTGG CCACATGAAT AATGGGCTGT TCAACACGGG TGACATGAAC AATGGCGTCT TCTACCGGGG CGTGGGGCAG GGCAGCCTGC AGTTCAGTAT TACGACACCT GATCTGACTC TGCCGCCGCT GCAAATACCG GGGATATCGG TTCCCGCCTT CAGTCTGCCG GCAATAACGC TGCCGTCGCT GACCATCCCG GCCGCCACCA CACCGGCCAA CATCACCGTC GGCGCCTTCA GCCTGCCCGG GTTGACGTTG CCGTCGTTGA ACATCCCGGC CGCCACCACA CCAGCCAACA TCACCGTCGG CGCCTTCAGC CTGCCCGGGT TGACGTTGCC GTCGTTGAAC ATCCCGGCCG CCACCACACC CGCCAACATC ACCGTAAGCG GCTTTCAGTT GCCTCCGCTG AGTATTCCTT CCGTAGCCAT TCCGCCGGTG ACGGTCCCGC CCATTACGGT GGGTGCTTTT AATTTGCCGC CATTGCAGAT TCCGGAAGTA ACTATTCCGC AGCTGACGAT ACCCGCGGGT ATCACAATCG GTGGCTTTAG TCTACCTGCG ATACATACTC AACCGATAAC GGTCGGCCAG ATTGGCGTGG GCCAATTTGG CCTGCCCTCC ATAGGCTGGG ATGTTTTCCT AAGCACACCT AGGATAACAG TACCGGCTTT TGGAATACCC TTTACCCTAC AATTCCAGAC CAATGTGCCT GCGCTTCAGC CGCCCGGCGG CGGGCTTAGT ACTTTCACCA ATGGCGCCCT CATCTTCGGT GAGTTTGACT TACCACAATT GGTGGTTCAC CCATACACAT TGACCGGCCC TATTGTCATC GGTTCATTCT TTCTGCCCGC CTTCAACATA CCCGGGATCG ATGTCCCCGC TATCAACGTC GATGGCTTCA CCCTGCCGCA GATCACCACC CCAGCTATCA CCACCCCGGA GTTCGCGATC CCTCCGATCG GCGTGGGCGG CTTCACTCTG CCGCAGATCA CCACCCAGGA AATCATCACC CCGGAGCTAA CCATCAACTC GATCGGCGTC GGCGGGTTCA CCCTGCCGCA AATCACCACC CCACCCATCA CCACCCCACC GCTGACCATC GACCCCATCA ACCTCACCGG CTTCACCCTC CCCCAAATCA CCACCCCACC CATCACCACC CCACCGCTGA CCATCGACCC CATCAACCTC ACCGGCTTCA CCCTCCCCCA AATCACCACC CCACCCATCA CCACCCCACC GCTCACCATC GAGCCGATCG GCGTGGGGGG CTTCACCACG CCCCCGCTCA CCGTTCCCGG CATCCACCTG CCCAGCACCA CGATCGGGGC CTTCGCGATC CCCGGGGGGC CGGGCTACTT CAACTCGAGC ACCGCGCCTT CGTCGGGCTT CTTCAATTCC GGTGCGGGCG GCAACTCGGG CTTCGGCAAC AACGGCTCGG GCCTCTCGGG TTGGTTCAAC ACCAACCCGG CCGGGCTGTT GGGCGGCTCG GGCTATCAGA ACTTCGGCGG GCTATCCTCG GGCTTTTCCA ACCTTGGCAG CGGCGTCTCA GGCTTCGCCA ACAGGGGCAT CCTGCCGTTC TCGGTAGCCA GCGTCGTTTC CGGCTTTGCC AATATCGGCA CCAACCTGGC GGGTTTCTTC CAAGGCACCA CGTCCTAA
|
Protein sequence | MNFSVLPPEI NSALIFAGAG PEPMAAAATA WDGLAMELAS AAASFGSVTS GLVGGAWQGA SSSAMAAAAA PYAAWLAAAA VQAEQTAAQA AAMIAEFEAV KTAVVQPMLV AANRADLVSL VMSNLFGQNA PAIAAIEATY EQMWAADVSA MSAYHAGASA IASALSPFSK PLQNLAGLPA WLASGAPAAA MTAAAGIPAL AGGPTAINLG IANVGGGNVG NANNGLANIG NANLGNYNFG SGNFGNSNIG SASLGNNNIG FGNLGSNNVG VGNLGNLNTG FANTGLGNFG FGNTGNNNIG IGLTGNNQIG IGGLNSGTGN FGLFNSGSGN VGFFNSGNGN FGIGNSGNFN TGGWNSGHGN TGFFNAGSFN TGMLDVGNAN TGSLNTGSYN MGDFNPGSSN TGTFNTGNAN TGFLNAGNIN TGVFNIGHMN NGLFNTGDMN NGVFYRGVGQ GSLQFSITTP DLTLPPLQIP GISVPAFSLP AITLPSLTIP AATTPANITV GAFSLPGLTL PSLNIPAATT PANITVGAFS LPGLTLPSLN IPAATTPANI TVSGFQLPPL SIPSVAIPPV TVPPITVGAF NLPPLQIPEV TIPQLTIPAG ITIGGFSLPA IHTQPITVGQ IGVGQFGLPS IGWDVFLSTP RITVPAFGIP FTLQFQTNVP ALQPPGGGLS TFTNGALIFG EFDLPQLVVH PYTLTGPIVI GSFFLPAFNI PGIDVPAINV DGFTLPQITT PAITTPEFAI PPIGVGGFTL PQITTQEIIT PELTINSIGV GGFTLPQITT PPITTPPLTI DPINLTGFTL PQITTPPITT PPLTIDPINL TGFTLPQITT PPITTPPLTI EPIGVGGFTT PPLTVPGIHL PSTTIGAFAI PGGPGYFNSS TAPSSGFFNS GAGGNSGFGN NGSGLSGWFN TNPAGLLGGS GYQNFGGLSS GFSNLGSGVS GFANRGILPF SVASVVSGFA NIGTNLAGFF QGTTS
|
| |