Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_13591 |
Symbol | |
ID | 5224280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | + |
Start bp | 4010764 |
End bp | 4012419 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640608360 |
Product | PPE family protein |
Protein accession | YP_001289518 |
Protein GI | 148824764 |
COG category | [N] Cell motility |
COG ID | [COG5651] PPE-repeat proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 219 |
Plasmid unclonability p-value | 0.0000128496 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 188 |
Fosmid unclonability p-value | 0.122271 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCATT TTTCGGTGTT GCCGCCGGAG ATCAACTCGT TGCGGATGTA CCTGGGTGCC GGTTCGGCGC CGATGCTTCA GGCGGCGGCC TGGGACGGGC TGGCCGCGGA GTTGGGAACC GCCGCGTCGT CGTTCTCCTC GGTGACCACG GGGTTAACCG GGCAGGCGTG GCAGGGCCCG GCGTCGGCGG CGATGGCCGC CGCGGCGGCG CCGTATGCGG GCTTTTTGAC CACAGCCTCG GCTCAAGCCC AGCTGGCTGC CGGGCAGGCT AAGGCGGTGG CCAGCGTGTT CGAGGCCGCC AAGGCCGCGA TCGTGCCTCC GGCCGCGGTG GCGGCCAACC GTGAGGCGTT CTTGGCGTTG ATTCGGTCGA ATTGGCTGGG GCTCAACGCG CCGTGGATCG CCGCCGTTGA AAGCCTTTAC GAGGAATACT GGGCCGCTGA TGTGGCGGCG ATGACCGGCT ATCACGCCGG GGCCTCGCAG GCCGCCGCGC AGTTGCCGTT GCCGGCCGGC CTGCAACAGT TCCTCAACAC CCTGCCCAAT CTGGGCATCG GCAACCAGGG CAACGCCAAC CTCGGCGGCG GCAACACCGG CAGCGGCAAC ATCGGCAACG GAAACAAAGG CAGCTCCAAC CTCGGCGGCG GCAACATCGG CAATAACAAC ATCGGCAGCG GCAACCGAGG CAGCGACAAC TTCGGCGCCG GCAACGTCGG CACCGGAAAC ATCGGCTTCG GCAACCAGGG CCCCATAGAC GTTAACCTCT TGGCGACGCC GGGCCAGAAC AACGTGGGCC TGGGCAACAT CGGCAACAAC AACATGGGCT TCGGCAACAC CGGCGACGCC AACACCGGCG GCGGCAACAC CGGCAACGGC AACATCGGTG GCGGCAACAC CGGCAACAAC AACTTCGGCT TCGGCAACAC CGGCAACAAC AACATCGGAA TCGGGCTCAC CGGCAACAAT CAGATGGGCA TCAACCTGGC CGGGCTGCTG AACTCCGGCA GCGGCAATAT CGGCATCGGC AACTCCGGCA CCAACAACAT CGGCTTGTTC AACTCCGGCA GCGGCAACAT CGGCGTCTTC AACACCGGAG CCAATACCCT GGTGCCTGGC GACCTCAACA ACCTGGGCGT CGGGAATTCC GGCAACGCCA ACATCGGCTT CGGGAACGCG GGCGTTCTCA ACACCGGCTT CGGGAACGCG AGCATCCTCA ACACCGGCTT GGGGAACGCG GGTGAATTAA ACACCGGCTT CGGAAACGCG GGCTTCGTCA ACACGGGGTT TGACAACTCC GGCAACGTCA ACACCGGCAA TGGGAACTCG GGCAACATCA ACACCGGCTC GTGGAATGCG GGCAATGTGA ACACCGGTTT CGGGATCATT ACCGACAGCG GCCTGACCAA CTCGGGCTTC GGCAACACCG GCACCGACGT CTCGGGCTTC TTCAACACCC CCACCGGCCC CTTAGCCGTC GACGTCTCCG GGTTCTTCAA CACGGCCAGC GGGGGCACTG TCATCAACGG CCAGACCTCG GGCATTGGCA ACATCGGCGT CCCGGGCACC CTCTTTGGCT CCGTCCGGAG CGGCTTGAAC ACGGGCCTGT TTAACATGGG CACCGCCATA TCGGGGTTGT TCAACCTGCG CCAGCTGTTG GGGTAG
|
Protein sequence | MAHFSVLPPE INSLRMYLGA GSAPMLQAAA WDGLAAELGT AASSFSSVTT GLTGQAWQGP ASAAMAAAAA PYAGFLTTAS AQAQLAAGQA KAVASVFEAA KAAIVPPAAV AANREAFLAL IRSNWLGLNA PWIAAVESLY EEYWAADVAA MTGYHAGASQ AAAQLPLPAG LQQFLNTLPN LGIGNQGNAN LGGGNTGSGN IGNGNKGSSN LGGGNIGNNN IGSGNRGSDN FGAGNVGTGN IGFGNQGPID VNLLATPGQN NVGLGNIGNN NMGFGNTGDA NTGGGNTGNG NIGGGNTGNN NFGFGNTGNN NIGIGLTGNN QMGINLAGLL NSGSGNIGIG NSGTNNIGLF NSGSGNIGVF NTGANTLVPG DLNNLGVGNS GNANIGFGNA GVLNTGFGNA SILNTGLGNA GELNTGFGNA GFVNTGFDNS GNVNTGNGNS GNINTGSWNA GNVNTGFGII TDSGLTNSGF GNTGTDVSGF FNTPTGPLAV DVSGFFNTAS GGTVINGQTS GIGNIGVPGT LFGSVRSGLN TGLFNMGTAI SGLFNLRQLL G
|
| |