Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_10995 |
Symbol | |
ID | 5221669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | + |
Start bp | 1094196 |
End bp | 1096967 |
Gene Length | 2772 bp |
Protein Length | 923 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640605746 |
Product | PE-PGRS family protein |
Protein accession | YP_001286940 |
Protein GI | 148822186 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 69 |
Plasmid unclonability p-value | 5.02084e-24 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 196 |
Fosmid unclonability p-value | 0.770391 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTTCG TGGTCACAGC ACCGCCGGTG CTCGCGTCGG CGGCGTCGGA TCTGGGCGGT ATCGCGTCCA TGATCAGCGA GGCCAACGCG ATGGCAGCGG TCCGAACGAC GGCGTTGGCG CCCGCCGCCG CCGACGAGGT TTCGGCGGCG ATCGCGGCGC TGTTTTCCAG CTACGCGCGG GACTATCAAA CGCTGAGCGT CCAGGTGACG GCCTTCCACG TGCAGTTCGC GCAGACATTG ACCAATGCGG GGCAGCTGTA TGCGGTCGTC GACGTCGGCA ATGGCGTGCT GTTGAAGACC GAGCAGCAGG TGCTGGGTGT GATCAATGCG CCCACCCAGA CGTTGGTGGG TCGTCCGCTG ATCGGCGATG GCACCCACGG GGCGCCGGGG ACCGGGCAGA ACGGTGGGGC GGGCGGAATC TTGTGGGGCA ACGGCGGTAA CGGCGGGTCC GGGGCTCCCG GACAGCCGGG CGGCCGGGGC GGTGATGCCG GCCTGTTCGG CCACGGCGGT CATGGCGGTG TCGGGGGGCC GGGCATCGCC GGTGCCGCTG GCACCGCGGG CCTGCCCGGG GGCAACGGCG CCAACGGCGG AAGCGGCGGC ATCGGCGGCG CCGGCGGCGC CGGCGGCAAC GGCGGGCTGC TATTCGGCAA CGGTGGTGCC GGCGGCCAGG GTGGCTCCGG CGGACTTGGG GGCTCCGGCG GGACGGGCGG CGCGGGCATG GCTGCCGGTC CCGCCGGCGG CACCGGCGGC ATCGGGGGCA TCGGCGGCAT CGGCGGCGCG GGCGGGGTCG GCGGCCACGG CTCGGCGTTG TTCGGCCACG GGGGAATCAA CGGCGATGGC GGTACCGGCG GCATGGGTGG CCAGGGCGGT GCTGGCGGCA ACGGCTGGGC CGCTGAGGGC ATCACGGTCG GCATTGGTGA GCAAGGCGGC CAGGGCGGCG ACGGGGGAGC CGGCGGCGCC GGCGGGATCG GTGGTTCGGC GGGTGGGATC GGCGGCAGCC AGGGTGCGGG TGGGCACGGC GGCGACGGCG GCCAGGGCGG CGCCGGCGGT AGTGGCGGCG TTGGCGGCGG CGGCGCAGGC GCCGGCGGCG ACGGCGGCGC GGGCGGCATC GGCGGCACTG GCGGTAACGG CAGCATCGGC GGGGCCGCCG GCAATGGCGG TAACGGCGGC CGCGGCGGCG CCGGTGGCAT GGCCACCGCG GGAAGTGATG GCGGCAATGG CGGCGGCGGC GGCAACGGCG GCGTCGGTGT TGGCAGCGCC GGAGGGGCCG GCGGCACCGG CGGTGACGGC GGGGCGGCCG GGGCGGGCGG CGCGCCGGGC CACGGCTACT TCCAACAGCC CGCGCCCCAA GGGCTGCCCA TCGGAACCGG CGGGACCGGC GGCGAAGGCG GTGCCGGCGG CGCCGGTGGA GACGGCGGGC AGGGCGACAT CGGCTTCGAT GGCGGCCGGG GTGGCGACGG CGGCCCGGGC GGTGGCGGCG GCGCCGGCGG TGACGGCAGC GGCACCTTCA ATGCCCAAGC CAACAACGGC GGCGACGGTG GTGCCGGCGG TGTTGGGGGA GCCGGCGGCA CCGGCGGCAC GGGTGGGGTC GGGGCCGACG GGGGTCGCGG GGGGGACTCG GGCCGCGGCG GCGACGGCGG CAACGCCGGC CACGGCGGCG CCGCCCAATT CTCCGGTCGC GGCGCCTACG GCGGTGAAGG TGGCAGCGGC GGCGCCGGCG GCAACGCCGG TGGCGCCGGC ACCGGTGGCA CCGCGGGCTC CGGCGGTGCC GGAGGTTTCG GCGGCAACGG TGCCGATGGC GGCAATGGCG GCAACGGTGG CAACGGCGGC TTCGGCGGAA TTAACGGCAC GTTCGGCACC AACGGTGCCG GCGGCACCGG CGGGCTCGGC ACCCTGCTCG GCGGCCACAA CGGCAACATC GGCCTCAACG GGGCCACCGG CGGCATCGGC AGCACCACGT TGACCAACGC GACCGTACCG CTGCAGCTGG TGAATACCAC CGAGCCGGTG GTATTCATCT CCTTAAACGG CGGCCAAATG GTGCCCGTGC TGCTCGACAC CGGATCCACC GGTCTGGTCA TGGACAGCCA ATTCCTGACG CAGAACTTCG GCCCCGTCAT CGGGACGGGC ACCGCCGGTT ACGCCGGCGG GCTGACCTAC AACTACAACA CCTACTCAAC GACGGTGGAT TTCGGCAATG GCCTTCTCAC CCTGCCGACC AGCGTTAACG TCGTCACCTC GTCATCACCG GGAACCCTGG GCAACTTCTT GTCGAGATCC GGTGCGGTGG GCGTCTTGGG AATCGGGCCC AACAACGGGT TCCCGGGCAC CAGCTCCATC GTTACCGCGA TGCCCGGCCT GCTCAACAAC GGTGTGCTCA TCGACGAATC GGCGGGCATC CTGCAGTTCG GTCCCAACAC ATTAACCGGC GGTATCACGA TTTCTGGAGC ACCGATTTCC ACCGTGGCTG TTCAGATCGA CAACGGGCCG CTGCAACAAG CTCCGGTGAT GTTCGACTCC GGCGGCATCA ACGGAACCAT CCCGTCAGCC CTCGCCAGCC TGCCGTCCGG GGGATTCGTG CCGGCGGGAA CGACCATTTC GGTCTACACC AGCGACGGCC AGACGCTGTT GTACTCCTAC ACCACCACCG CGACAAACAC CCCATTTGTC ACCTCCGGCG GCGTGATGAA CACCGGGCAC GTCCCCTTCG CGCAGCAACC GATATACGTC TCCTACAGCC CCACCGCCAT CGGGACGACC ACCTTTAACT GA
|
Protein sequence | MSFVVTAPPV LASAASDLGG IASMISEANA MAAVRTTALA PAAADEVSAA IAALFSSYAR DYQTLSVQVT AFHVQFAQTL TNAGQLYAVV DVGNGVLLKT EQQVLGVINA PTQTLVGRPL IGDGTHGAPG TGQNGGAGGI LWGNGGNGGS GAPGQPGGRG GDAGLFGHGG HGGVGGPGIA GAAGTAGLPG GNGANGGSGG IGGAGGAGGN GGLLFGNGGA GGQGGSGGLG GSGGTGGAGM AAGPAGGTGG IGGIGGIGGA GGVGGHGSAL FGHGGINGDG GTGGMGGQGG AGGNGWAAEG ITVGIGEQGG QGGDGGAGGA GGIGGSAGGI GGSQGAGGHG GDGGQGGAGG SGGVGGGGAG AGGDGGAGGI GGTGGNGSIG GAAGNGGNGG RGGAGGMATA GSDGGNGGGG GNGGVGVGSA GGAGGTGGDG GAAGAGGAPG HGYFQQPAPQ GLPIGTGGTG GEGGAGGAGG DGGQGDIGFD GGRGGDGGPG GGGGAGGDGS GTFNAQANNG GDGGAGGVGG AGGTGGTGGV GADGGRGGDS GRGGDGGNAG HGGAAQFSGR GAYGGEGGSG GAGGNAGGAG TGGTAGSGGA GGFGGNGADG GNGGNGGNGG FGGINGTFGT NGAGGTGGLG TLLGGHNGNI GLNGATGGIG STTLTNATVP LQLVNTTEPV VFISLNGGQM VPVLLDTGST GLVMDSQFLT QNFGPVIGTG TAGYAGGLTY NYNTYSTTVD FGNGLLTLPT SVNVVTSSSP GTLGNFLSRS GAVGVLGIGP NNGFPGTSSI VTAMPGLLNN GVLIDESAGI LQFGPNTLTG GITISGAPIS TVAVQIDNGP LQQAPVMFDS GGINGTIPSA LASLPSGGFV PAGTTISVYT SDGQTLLYSY TTTATNTPFV TSGGVMNTGH VPFAQQPIYV SYSPTAIGTT TFN
|
| |