Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_12653 |
Symbol | |
ID | 5223335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | - |
Start bp | 2973422 |
End bp | 2975788 |
Gene Length | 2367 bp |
Protein Length | 788 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640607415 |
Product | PE-PGRS family protein |
Protein accession | YP_001288582 |
Protein GI | 148823828 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 153 |
Plasmid unclonability p-value | 0.00000028408 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 212 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGTTG TCCGGTCCAG GAGGTCGCAG ATGTCATTTG TGATTGCGGT GCCGGAAGCA TTGACGATGG CGGCTTCGGA TCTGGCCAAC ATTGGGTCGA CGATCAACGC GGCGAATGCG GCGGCGGCAT TGCCGACCAC GGGGGTGGTG GCGGCTGCCG CGGATGAGGT TTCGGCGGCA GTTGCGGCCT TGTTCGGGTC GTACGCGCAG AGCTATCAGG CTTTTGGTGC GCAGCTGTCG GCGTTTCACG CCCAGTTCGT GCAGTCCCTT ACGAACGGCG CGCGCTCATA CGTAGTTGCC GAGGCCACCA GTGCTGCGCC GTTGCAGGAT TTGTTGGGCG TGGTAAATGC CCCCGCCCAG GCGTTGTTGG GGCGCCCGTT GATCGGCAAT GGCGCTAACG GGGCCGACGG GACGGGGGCT CCCGGTGGGC CGGGCGGGCT GTTGCTTGGC AACGGCGGGA ATGGCGGATC GGGTGCGCCG GGTCAGCCAG GTGGTGCTGG CGGGGATGCG GGGTTGATCG GTAACGGCGG GACTGGCGGT AAAGGTGGGG ACGGGCTGGT CGGGTCCGGT GCTGCCGGGG GTGTCGGTGG TCGCGGTGGA TGGTTGCTGG GTAATGGCGG GACCGGTGGG GCTGGTGGGG CTGCAGGGGC CACTTTGGTC GGCGGTACTG GCGGTGTCGG TGGGGCGACG GGGTTGATCG GCAGCGGGGG CTTCGGCGGT GCTGGCGGGG CCGCGGCGGG GGTGGGCACC ACCGGCGGCG TGGGCGGGAG CGGTGGCGTT GGCGGCGTGT TCGGCAATGG TGGATTCGGC GGGGCCGGTG GCCTTGGCGC CGCCGGCGGC GTCGGAGGGG CGGCCAGCTA CTTCGGGACC GGGGGCGGTG GCGGCGTTGG TGGGGACGGT GCGCCCGGTG GTGACGGCGG TGCGGGTCCG CTATTGATCG GCAATGGCGG TGTTGGGGGT CTGGGTGGGG CCGGGGCGGC CGGTGGTAAT GGCGGTGCCG GCGGGATGTT GTTGGGCGAT GGCGGTGCCG GCGGACAGGG TGGGCCGGCC GTGGCGGGTG TCCTGGGCGG GATGCCCGGC GCGGGCGGCA ACGGCGGTAA TGCCAACTGG TTCGGGTCCG GTGGTGCCGG CGGGCAGGGT GGCACCGGTC TGGCCGGGAC AAACGGGGTC AACCCCGGCT CGATTGCGAA CCCCAACACC GGTGCGAACG GTACCGACAA CAGCGGCAAC GGGAATCAAA CTGGCGGGAA CGGGGGTCCC GGCCCCGCCG GTGGCGTCGG CGAGGCTGGC GGCGTCGGCG GGCAGGGCGG GCTGGGGGAG TCGCTCGACG GCAACGACGG CACCGGCGGT AAGGGTGGAG CCGGGGGTAC TGCCGGTACC GATGGCGGTG CCGGCGGCGC TGGCGGCGCT GGCGGCATAG GTGAGACCGA CGGCAGCGCC GGCGGCGTGG CTACCGGGGG TGAGGGGGGT GACGGTGCCA CCGGAGGGGT CGACGGTGGC GTGGGTGGTG CTGGCGGCAA GGGGGGGCAG GGGCACAACA CGGGTGTAGG TGACGCTTTC GGCGGTGACG GCGGAATCGG CGGTGACGGT AACGGGGCAC TAGGCGCGGC GGGCGGTAAC GGCGGCACCG GTGGTGCCGG TGGAAACGGT GGACGTGGCG GGATGTTAAT CGGCAACGGC GGCGCCGGTG GGGCCGGCGG GACGGGCGGC ACCGGTGGTG GTGGCGCCGC CGGCTTCGCG GGCGGTGTCG GCGGCGCGGG CGGAGAGGGT CTCACCGACG GTGCGGGTAC CGCGGAAGGC GGCACCGGCG GTCTGGGGGG CCTCGGCGGT GTCGGCGGTA CCGGCGGTAT GGGTGGCAGC GGCGGTGTCG GCGGCAACGG CGGGGCGGCT GGGTCGCTCA TCGGGCTTGG TGGTGGCGGG GGTGCCGGCG GTGTCGGCGG CACCGGTGGC ATCGGCGGCA TCGGTGGTGC CGGCGGCAAC GGTGGCGCCG GCGGCGCGGG TACCACCACC GGCGGGGGAG CGACAATTGG CGGTGGCGGC GGTACAGGCG GCGTGGGGGG CGCTGGTGGC ACTGGCGGTA CCGGCGGCGC CGGCGGGACC ACCGGCGGCA GCGGCGGAGC CGGCGGGCTG ATCGGGTGGG CAGGAGCTGC CGGCGGCACC GGCGCAGGCG GCACGGGTGG GCAAGGTGGC CTCGGCGGCC AGGGCGGCAA CGGCGGCAAC GGCGGGACCG GCGCGACCGG CGGTCAGGGC GGCGATTTCG CGCTGGGCGG CAACGGGGGC GCCGGCGGCG CGGGTGGGTC ACCGGGTGGC AGCTCCGGCA TCCAGGGCAA TATGGGCCCG CCCGGCACCC AGGGCGCCGA CGGATAG
|
Protein sequence | MRVVRSRRSQ MSFVIAVPEA LTMAASDLAN IGSTINAANA AAALPTTGVV AAAADEVSAA VAALFGSYAQ SYQAFGAQLS AFHAQFVQSL TNGARSYVVA EATSAAPLQD LLGVVNAPAQ ALLGRPLIGN GANGADGTGA PGGPGGLLLG NGGNGGSGAP GQPGGAGGDA GLIGNGGTGG KGGDGLVGSG AAGGVGGRGG WLLGNGGTGG AGGAAGATLV GGTGGVGGAT GLIGSGGFGG AGGAAAGVGT TGGVGGSGGV GGVFGNGGFG GAGGLGAAGG VGGAASYFGT GGGGGVGGDG APGGDGGAGP LLIGNGGVGG LGGAGAAGGN GGAGGMLLGD GGAGGQGGPA VAGVLGGMPG AGGNGGNANW FGSGGAGGQG GTGLAGTNGV NPGSIANPNT GANGTDNSGN GNQTGGNGGP GPAGGVGEAG GVGGQGGLGE SLDGNDGTGG KGGAGGTAGT DGGAGGAGGA GGIGETDGSA GGVATGGEGG DGATGGVDGG VGGAGGKGGQ GHNTGVGDAF GGDGGIGGDG NGALGAAGGN GGTGGAGGNG GRGGMLIGNG GAGGAGGTGG TGGGGAAGFA GGVGGAGGEG LTDGAGTAEG GTGGLGGLGG VGGTGGMGGS GGVGGNGGAA GSLIGLGGGG GAGGVGGTGG IGGIGGAGGN GGAGGAGTTT GGGATIGGGG GTGGVGGAGG TGGTGGAGGT TGGSGGAGGL IGWAGAAGGT GAGGTGGQGG LGGQGGNGGN GGTGATGGQG GDFALGGNGG AGGAGGSPGG SSGIQGNMGP PGTQGADG
|
| |