Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3875 |
Symbol | |
ID | 4649192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 4145470 |
End bp | 4146495 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639807341 |
Product | sulfate ABC transporter, periplasmic sulfate-binding protein |
Protein accession | YP_954662 |
Protein GI | 120404833 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.33215 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAGA TCCTGAAATC CACGAGACGC TGGCGGACAG CGGCCGCACT CACGCTGTCG GCGGCGGTGC TGGCCGCCTG CGGCGGCGGC GCCAGTGATG TGGTGGGCGA GGAAGGCGGC AACAGCGACG CAGAGACCAC GCTGACGCTG GTGGCGTTCG CGGTTCCCGA GCCGGGCTGG TCGAAGGTGT CACCTGCGTT CGCCGCCACC GAGGAGGGCA AGGGCGTCGC GGTCACGGGT TCCTACGGTG CCTCGGGAGA CCAGTCCCGC GCCGTCGAGT CCGGCAAGCC CGCCGACATC GTGAACTTCT CGGTCGAACC CGACATCACC CGCCTGGTCA AGGCCGGCAA GGTCGACGAG AACTGGAACG CCGGCCCCAA CAAGGGCATC GCGTTCGGCT CGGTGGTCAG CTTCGCGGTG CGCCCCGGCA ACCCCAAGAA CATCCGCACC TGGGACGACC TGCTGCAGCC GGGCATCGAG GTCATCACGC CGAGTCCGCT CAGCTCCGGC GCGGCGAAGT GGAACCTGCT CGCGCCGTAC GCCTACGCCA GCAACGGCGG CAAGGATCCG CAGGCCGGCA TCGACTTCGT CAACAAGTTG GTCACCGAGC ACGTCAAGCT GCGTCCCGGC TCGGGCCGTG AGGCCACCGA CGTGTTCCGC CAGGGCAGCG GTGACGTGCT GCTGGCCTAC GAGAACGAGG CGCTGAACTT CGACCTGGAA CACGTCAACC CGGCGCAGAC CTTCAAGATC GAGAACCCGA CCGCGGTGGT GAACACCAGC CAGCACCCGG ACCAGGCCCA GGCGTTCGTG AACTTCCAGT TCACCCCGGA AGCCCAGAAG CTGTGGGCGG AAGCCAACTT CCGGCCGGTC GACCCGGCGG TGCTCGCCGA GTTCGCCGAC AAGTTCCCCA CGCCGGAGAA GCTGTGGACC ATCGAGGACC TGGGTGGCTG GTCTAAGGTC GACTCCGAGC TGTTCGACAA GGAGAACGGC ACGATCACGA AGATCTACAA GCAGGCCACT GGATGA
|
Protein sequence | MRKILKSTRR WRTAAALTLS AAVLAACGGG ASDVVGEEGG NSDAETTLTL VAFAVPEPGW SKVSPAFAAT EEGKGVAVTG SYGASGDQSR AVESGKPADI VNFSVEPDIT RLVKAGKVDE NWNAGPNKGI AFGSVVSFAV RPGNPKNIRT WDDLLQPGIE VITPSPLSSG AAKWNLLAPY AYASNGGKDP QAGIDFVNKL VTEHVKLRPG SGREATDVFR QGSGDVLLAY ENEALNFDLE HVNPAQTFKI ENPTAVVNTS QHPDQAQAFV NFQFTPEAQK LWAEANFRPV DPAVLAEFAD KFPTPEKLWT IEDLGGWSKV DSELFDKENG TITKIYKQAT G
|
| |