Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_13541 |
Symbol | |
ID | 5224230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | + |
Start bp | 3938922 |
End bp | 3943133 |
Gene Length | 4212 bp |
Protein Length | 1403 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 640608310 |
Product | PE-PGRS family protein |
Protein accession | YP_001289468 |
Protein GI | 148824714 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1.3382500000000002e-18 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 142 |
Fosmid unclonability p-value | 0.000118269 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGTTTG TGTTGGTTTC GCCGGAGACC GTGGCGGCGG TGGCCACGGA TCTCAAGCGC ATCGGCGCCT CGCTGGCCCA CGAAAACGCG TCGGCGGCCG CTTCGACGAC GGCGGTGGTC TCCGCGGCCG CCGACGAGGT ATCGACGGCG GTCGCCGCTC TGTTCTCCCA ACACGCCCAG GGCTACCAAG CGGCGGCCGC TCAGGTAGCA GCGTTTCATA GCCGGTTTGT GCAAGCCCTG ACGGCCGGTG CCGGGGCGTA CGCATTTGCC GAGGCGGCCA ACGCGTCGCC GCTACAGTCA GCCATGGGTG CGGTAAGCGC GTCTGCGCAG ACGCTGTTGT CGCGCCCGTT GATCGGCAAT GGCGCCAATG CGACGACGCC GGGCGGTAAC GGCGGCGACG GCGGATGGCT ATTCGGCAGC GGCGGCAACG GCGCGCCCGG CGCGGCGGGC CAGTCCGGCG GTAACGGCGG GTCAGCCGGA CTGTGGGGTA ACGGCGGCGC GGGTGGCGCC GGCGGCAGCG GCGGCGCCGC CGGCGGCAAC GGCGGTAACG GCGGGTGGCT GTTCGGCGCC GGCGGCACCG GCGGTATCGG CGGCACCGGT GCTCCCGGCG CCATGGGCGG CACCGGCGGC AACGGCGGCA ACGGCGCGCT GCTGATCGGC GGCGGCGGCC TCGGCGGCGC CGGCGGCATG GGTGGCACCG GCGGCGGCAC CGGCGGCACC GGCGGCAACG GCGGCAACGG CGCGCTGCTG ATCGGCGCTG GTGGTGTCGG AGGTGCTGGC GGGATCGGTG GCCAGGGTAC CGGCGCCGGC GGTGCCGCCG GCGCCGGCGG CACCGGGGGC AACGGCGGCG CCGGGGGGTT GTTCATGAAC GGCGGCGACG GCGGCGCCGG CGGTCAAGGC GGCGACGGTG CGGCCGGCGA CGCGGCTGCC AGCGCCGGCG GCACCGGCGG CAAAGGCGGC CAAGGCGGCG ACGGCGGCAC CGGAGGGGCC GGCGGCGCAG GCCCAGTGCT GTTCGGCCAC GGCGGCGCCG GCGGCATGGG CGGCCAAGGC GGCACCGGTG GAATGGGCGG CGCCGGCGGA GACGGCACCA CCGTCATCGC GGCCGGTACC GGGGGGGAGG GCGGCACCGG CGGCGCGGCC GGCGCCGGCG GAGCCGCAGG CGCTCGCGGG GCTCTCACCA GCGGCGGCCT AGCCGGCGGC GTCGGGGCCG GCGGCACCGG CGGCACCGGC GGTACCGGCG GCAACGGCGC TGACGCCGCT GCTGTGGTGG GCTTCGGCGC GAACGGCGAC CCTGGCTTCG CTGGCGGCAA AGGCGGTAAC GGCGGAATAG GTGGGGCCGC GGTGACAGGC GGGGTCGCCG GCGACGGCGG CACCGGCGGC AAAGGTGGCA CCGGCGGTGC CGGCGGCGCC GGCAACGACG CCGGCAGCAC CGGCAATCCC GGCGGTAAGG GCGGCGACGG CGGGATCGGC GGTGCCGGCG GGGCCGGCGG CGCGGCCGGC ACCGGCAACG GCGGCCATGC CGGCAACACA GGTGACGGCG GCGACGGCGG GACCGGCGGT AACGGCGGCA ACGGCACCGG AGGCGTGAAC GGCGCCGACA ACACCCTCAA CCCCGACACC CCCGGCGGCG CCGGGGAGCC CGGCGGGGCC GGCGGGGCCG GCGGGGCCGG CGGGGCCGCC GGCGGCCCGG GCGGTACCGG CGGTACCGGC GGTAACGGCG GCAACGGCGG CAACGGCGGC AACGGCGGCA ACGGCGGCAA CGGCGGCAAC GGCGGCAATG CCGGCAACAA CAGCACCAAT GCCCCAGTCG GTGGCGAAGG CGGCGCCGGC GGCGACGGCG GCGCCGGCGG CGCAGGCGGG GCCGCCAACG GCGGCACCGC GGGCAGCCAG GGCACTGGGG GCGTCGGCGG CGACGGCGGC GCGGGCGGCA ACGGCGGCGG CGGCAAGGCT GGCACCGGCA ACAGCGGCAA CTTTGGGGTG GACGGCGAAG CCGGCTTCAG CGGCGGCGCC GGTGGCAACG GCGGCGTAGG CGGGGCCGCC GGCGCCAATG GCGGAACCGG CGGCAGCGGT GGTAATGGCG GTGACGGCGG TGCGGGAGGC ATTGGCGGGG CCGGCGGCAA CGGCATACCG GGCACTGGCA CAGAGCCTGC CGGGGGCACC GGCGCCAAAG GTGGAGACGG CGGCGACGGT GGCGCCGGCG GCGCAGGCGG CAATGCCGGC GGGGCCGGCG GTAACGGCGG GGCCGGCGGC CAGGGCGGCA ATGCCGGCCA GGGTGGCGCC GGCGGTGCGG GCGGCAACGC CGTGATTCCC GGCGACGGCG TCGGGAAGGC GCCGCACGGC GACGCGGGCG GCAGCGGCGG AGACGGCGGC AAAGGCGGCC AGGGCGGTAG TGGCGGCACC GGCGGATCCG GTGCCCCGAT CGGTGGCGGC GCCGGAGGCA CCGGAGGGTC CGGCGGACAC GCCGGCAAGG GTGGCGCCGG CGGCATCGGC GCACAGGGCA CCACCATCAC CGTGCCCGGG AACGGCGGCA ACGCCGGCGA CGGCGGCAAC GGCGGCAACG CCGGCGCCGG TGGAAACGGC GGCTCCGGCG ACTTCGGTGG CAATACCACC AGCGGCGCCT CCGGCAGCGG CGGCAACGGC GGCAACGCCG GCACCGCGGG TAGCGGCGGT GCGGGCGGAA CCGGCGGCAC CGGCCTTAGC GGCGGCAACG GTGGCAACGG CGGCAACGGC GGCAACGGCG GTGACGGCGG TAACGGCGCC CACGGCACCG TCGGCGCCCA GTTCGTCCCG GCCACCAGCT TGCCCACACC CAACGGCGGG GCCGGTGGCA ACGGTGGCAC CGGAAGCAAC GGCGGCGCGC CCGGCCCCGC CGGGGCGCCC GGCCCCACTA CCGGCGGTAA CGCTGGCAGC CAGGGCATCG GCGGCGACGG GGGCAACGGC GGCGACGGCG GTAAAGGCGG TGACGGCGCC GACGCTGTCA ACGTCGTATT CATGCCGACT GAGCCACAGG CCGCGACCGG CACTGCCGGC AGCGCCGGTG ACCCCACCGG CGGTAACGGA GGGCCCGGCA CTCCCGGCAG CCCCATGGTT GCCCCGCCCC CGCCAACGCC AATCACTCAA GTCCAACAGG GCGGTGACGG TGGCGCCGGG GGCACCGGAT CCACCAACGC CAACGACGGC ACAGCCACCG GCGGAAAGGG CGGAGAAGGC GGAGTCGGCA GCATTCTCGG CGGGCCCGGC GGCAACGGCG GAACTGGCGG CAACGCCTCG GCAACCGGCA CCAACGGGGT GGCCAACGCC GGGAATGGCG GCAAGGGTGG CGACGGCGGC CAGTTTGGGG CCGGCGGCAA CGGTGGTGCC GGCGGCAGCG TAACCGACGG ATCCGCCGGC AGCACCGCAG GCAACGGCGG CAACGGCGGC AACGCAACCA ACGGCACCAT CGCAGGCCAA CCCGCCGGCG GCAACGGCTC GGCCGGCGGG AAAGGCGGCG ACGGCGGCAA CATCGCCGCC GGTGCCACCG GCACCGCCGG CAACGGCGGG AACGGCGGCA ACGGCAACGA CGGCGCCGTC AACGCCGGCA CCGGCGGCTC CGGCGGGAAC GGCGGTAACG CCGGTGGCGG CGGCGCCAAT GGCGGCGACG GCGGCGCCGG CGGCGCCGGC GGGGCCAATG GCGGCGACGG CGGCGCCGGC GGCGCCGGCG GGGCCGGCGG GCGTGGCGGC AAGGGCATCG ACGGCGGGTT CGGCGGTGAC GGCGGCAACG GCGGCAGCAA CAACGGCACC GGCGCCGGTG GCAACGGCGG CAACGGCGGC ACCGGCGGGG TCGGCTCGGT TGGCGCGGCT GGTGGCGATG GCGGCAACGG CGGCACCGGC GGCTTCGCCG GTTTCGGCGG CACCGCAGGC AATGGCGGTT CCGGCGGCAC GGGCGGGGCC GGCGGCGACG GCGGCACCGG CGGGGACGGC GGCAACGGCG GCACCGGCGT TATCGCCGGC GGCGGGGGGA CCGGCGGCAA CGGCGGCGCC AGCGGGGCCG GCGGCGCCGG CGGCACGGGC GGGTTCGCCG GCAACGGCAA TGCCGGCGGC AATGGCGGCA CCGGCGGCGC GAGCGAGGAC GGCGACAACG GCAACGCTGG CAGCGGCGCC ACCGGCGGTA CCGGCGGCAA CGGCGGCACC GGCGGCGACG GCGGCGCTGC CGGGCTGGGC GGCGTCGCGT GA
|
Protein sequence | MSFVLVSPET VAAVATDLKR IGASLAHENA SAAASTTAVV SAAADEVSTA VAALFSQHAQ GYQAAAAQVA AFHSRFVQAL TAGAGAYAFA EAANASPLQS AMGAVSASAQ TLLSRPLIGN GANATTPGGN GGDGGWLFGS GGNGAPGAAG QSGGNGGSAG LWGNGGAGGA GGSGGAAGGN GGNGGWLFGA GGTGGIGGTG APGAMGGTGG NGGNGALLIG GGGLGGAGGM GGTGGGTGGT GGNGGNGALL IGAGGVGGAG GIGGQGTGAG GAAGAGGTGG NGGAGGLFMN GGDGGAGGQG GDGAAGDAAA SAGGTGGKGG QGGDGGTGGA GGAGPVLFGH GGAGGMGGQG GTGGMGGAGG DGTTVIAAGT GGEGGTGGAA GAGGAAGARG ALTSGGLAGG VGAGGTGGTG GTGGNGADAA AVVGFGANGD PGFAGGKGGN GGIGGAAVTG GVAGDGGTGG KGGTGGAGGA GNDAGSTGNP GGKGGDGGIG GAGGAGGAAG TGNGGHAGNT GDGGDGGTGG NGGNGTGGVN GADNTLNPDT PGGAGEPGGA GGAGGAGGAA GGPGGTGGTG GNGGNGGNGG NGGNGGNGGN GGNAGNNSTN APVGGEGGAG GDGGAGGAGG AANGGTAGSQ GTGGVGGDGG AGGNGGGGKA GTGNSGNFGV DGEAGFSGGA GGNGGVGGAA GANGGTGGSG GNGGDGGAGG IGGAGGNGIP GTGTEPAGGT GAKGGDGGDG GAGGAGGNAG GAGGNGGAGG QGGNAGQGGA GGAGGNAVIP GDGVGKAPHG DAGGSGGDGG KGGQGGSGGT GGSGAPIGGG AGGTGGSGGH AGKGGAGGIG AQGTTITVPG NGGNAGDGGN GGNAGAGGNG GSGDFGGNTT SGASGSGGNG GNAGTAGSGG AGGTGGTGLS GGNGGNGGNG GNGGDGGNGA HGTVGAQFVP ATSLPTPNGG AGGNGGTGSN GGAPGPAGAP GPTTGGNAGS QGIGGDGGNG GDGGKGGDGA DAVNVVFMPT EPQAATGTAG SAGDPTGGNG GPGTPGSPMV APPPPTPITQ VQQGGDGGAG GTGSTNANDG TATGGKGGEG GVGSILGGPG GNGGTGGNAS ATGTNGVANA GNGGKGGDGG QFGAGGNGGA GGSVTDGSAG STAGNGGNGG NATNGTIAGQ PAGGNGSAGG KGGDGGNIAA GATGTAGNGG NGGNGNDGAV NAGTGGSGGN GGNAGGGGAN GGDGGAGGAG GANGGDGGAG GAGGAGGRGG KGIDGGFGGD GGNGGSNNGT GAGGNGGNGG TGGVGSVGAA GGDGGNGGTG GFAGFGGTAG NGGSGGTGGA GGDGGTGGDG GNGGTGVIAG GGGTGGNGGA SGAGGAGGTG GFAGNGNAGG NGGTGGASED GDNGNAGSGA TGGTGGNGGT GGDGGAAGLG GVA
|
| |