Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_13914 |
Symbol | |
ID | 5224610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | - |
Start bp | 4370593 |
End bp | 4372782 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640608690 |
Product | hypothetical protein |
Protein accession | YP_001289841 |
Protein GI | 148825087 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0810] Periplasmic protein TonB, links inner and outer membranes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 172 |
Plasmid unclonability p-value | 0.00000201716 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 191 |
Fosmid unclonability p-value | 0.350893 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTATTA CCAGGCCGAC GGGCAGCTAT GCCAGACAGA TGCTGGATCC GGGCGGCTGG GTGGAAGCCG ATGAAGACAC TTTCTATGAC CGGGCCCAGG AATATAGCCA GGTTTTGCAA AGGGTCACCG ATGTATTGGA CACCTGCCGC CAGCAGAAAG GCCACGTCTT CGAAGGCGGC CTATGGTCCG GCGGCGCCGC CAATGCTGCC AACGGCGCCC TGGGTGCAAA CATCAATCAA TTGATGACGC TGCAGGATTA TCTCGCCACG GTGATTACCT GGCACAGGCA TATTGCCGGG TTGATTGAGC AAGCTAAATC CGATATCGGC AATAATGTGG ATGGCGCTCA ACGGGAGATC GATATCCTGG AGAATGACCC TAGCCTGGAT GCTGATGAGC GCCATACCGC CATCAATTCA TTGGTCACGG CGACGCATGG GGCCAATGTC AGTCTGGTCG CCGAGACCGC TGAGCGGGTG CTGGAATCCA AGAATTGGAA ACCTCCGAAG AACGCACTCG AGGATTTGCT TCAGCAGAAG TCGCCGCCAC CCCCAGACGT GCCTACCCTG GTCGTGCCAT CCCCGGGCAC ACCGGGCACA CCGGGAACCC CGATCACCCC GGGAACCCCG ATCACCCCGG GAACCCCAAT CACACCCATC CCGGGAGCGC CGGTAACTCC GATCACACCA ACGCCCGGCA CTCCCGTCAC GCCGGTGACC CCGGGCAAGC CGGTCACCCC GGTGACCCCG GTCAAACCGG GCACACCAGG CGAGCCAACC CCGATCACGC CGGTCACCCC CCCGGTCGCC CCGGCCACAC CGGCAACCCC GGCCACGCCC GTTACCCCAG CTCCCGCTCC ACACCCGCAG CCGGCTCCGG CACCGGCGCC ATCGCCTGGG CCCCAGCCGG TTACACCGGC CACTCCCGGT CCGTCTGGTC CAGCAACACC GGGCACCCCA GGGGGCGAGC CGGCGCCGCA CGTCAAACCC GCGGCGTTGG CGGAGCAACC TGGTGTGCCG GGCCAGCATG CGGGCGGGGG GACGCAGTCG GGGCCTGCCC ATGCGGACGA ATCCGCCGCG TCGGTGACGC CGGCTGCGGC GTCCGGTGTC CCGGGCGCAC GGGCGGCGGC CGCCGCGCCG AGCGGTACCG CCGTGGGAGC GGGCGCGCGT TCGAGCGTGG GTACGGCCGC GGCCTCGGGC GCGGGGTCGC ATGCTGCCAC TGGGCGGGCG CCGGTGGCTA CCTCGGACAA GGCGGCGGCA CCGAGCACGC GGGCGGCCTC GGCGCGGACG GCACCTCCTG CCCGCCCGCC GTCGACCGAT CACATCGACA AACCCGATCG CAGCGAGTCT GCAGATGACG GTACGCCGGT GTCGATGATC CCGGTGTCGG CGGCTCGGGC GGCACGCGAC GCCGCCACTG CAGCTGCCAG CGCCCGCCAG CGTGGCCGCG GTGATGCGCT GCGGTTGGCG CGACGCATCG CGGCGGCGCT CAACGCGTCC GACAACAACG CGGGCGACTA CGGGTTCTTC TGGATCACCG CGGTGACCAC CGACGGTTCC ATCGTCGTGG CCAACAGCTA TGGGCTGGCC TACATACCCG ACGGGATGGA ATTGCCGAAT AAGGTGTACT TGGCCAGCGC GGATCACGCA ATCCCGGTTG ACGAAATTGC ACGCTGTGCC ACCTACCCGG TTTTGGCCGT GCAAGCCTGG GCGGCTTTCC ACGACATGAC GCTGCGGGCG GTGATCGGTA CCGCGGAGCA GTTGGCCAGT TCGGATCCCG GTGTGGCCAA GATTGTGCTG GAGCCAGATG ACATTCCGGA GAGCGGCAAA ATGACGGGCC GGTCGCGGCT GGAGGTCGTC GACCCCTCGG CGGCGGCTCA GCTGGCCGAC ACTACCGATC AGCGTTTGCT CGACTTGTTG CCGCCGGCGC CGGTGGATGT CAATCCACCG GGCGATGAGC GGCACATGCT GTGGTTCGAG CTGATGAAGC CCATGACCAG CACCGCTACC GGCCGCGAGG CCGCTCATCT GCGGGCGTTC CGGGCCTACG CTGCCCACTC ACAGGAGATT GCCCTGCACC AAGCGCACAC TGCGACTGAC GCGGCCGTCC AGCGTGTGGC CGTCGCGGAC TGGCTGTACT GGCAATACGT CACCGGGTTG CTCGACCGGG CCCTGGCCGC CGCATGCTGA
|
Protein sequence | MSITRPTGSY ARQMLDPGGW VEADEDTFYD RAQEYSQVLQ RVTDVLDTCR QQKGHVFEGG LWSGGAANAA NGALGANINQ LMTLQDYLAT VITWHRHIAG LIEQAKSDIG NNVDGAQREI DILENDPSLD ADERHTAINS LVTATHGANV SLVAETAERV LESKNWKPPK NALEDLLQQK SPPPPDVPTL VVPSPGTPGT PGTPITPGTP ITPGTPITPI PGAPVTPITP TPGTPVTPVT PGKPVTPVTP VKPGTPGEPT PITPVTPPVA PATPATPATP VTPAPAPHPQ PAPAPAPSPG PQPVTPATPG PSGPATPGTP GGEPAPHVKP AALAEQPGVP GQHAGGGTQS GPAHADESAA SVTPAAASGV PGARAAAAAP SGTAVGAGAR SSVGTAAASG AGSHAATGRA PVATSDKAAA PSTRAASART APPARPPSTD HIDKPDRSES ADDGTPVSMI PVSAARAARD AATAAASARQ RGRGDALRLA RRIAAALNAS DNNAGDYGFF WITAVTTDGS IVVANSYGLA YIPDGMELPN KVYLASADHA IPVDEIARCA TYPVLAVQAW AAFHDMTLRA VIGTAEQLAS SDPGVAKIVL EPDDIPESGK MTGRSRLEVV DPSAAAQLAD TTDQRLLDLL PPAPVDVNPP GDERHMLWFE LMKPMTSTAT GREAAHLRAF RAYAAHSQEI ALHQAHTATD AAVQRVAVAD WLYWQYVTGL LDRALAAAC
|
| |