Gene Mvan_3875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3875 
Symbol 
ID4649192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4145470 
End bp4146495 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content67% 
IMG OID639807341 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_954662 
Protein GI120404833 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.33215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGA TCCTGAAATC CACGAGACGC TGGCGGACAG CGGCCGCACT CACGCTGTCG 
GCGGCGGTGC TGGCCGCCTG CGGCGGCGGC GCCAGTGATG TGGTGGGCGA GGAAGGCGGC
AACAGCGACG CAGAGACCAC GCTGACGCTG GTGGCGTTCG CGGTTCCCGA GCCGGGCTGG
TCGAAGGTGT CACCTGCGTT CGCCGCCACC GAGGAGGGCA AGGGCGTCGC GGTCACGGGT
TCCTACGGTG CCTCGGGAGA CCAGTCCCGC GCCGTCGAGT CCGGCAAGCC CGCCGACATC
GTGAACTTCT CGGTCGAACC CGACATCACC CGCCTGGTCA AGGCCGGCAA GGTCGACGAG
AACTGGAACG CCGGCCCCAA CAAGGGCATC GCGTTCGGCT CGGTGGTCAG CTTCGCGGTG
CGCCCCGGCA ACCCCAAGAA CATCCGCACC TGGGACGACC TGCTGCAGCC GGGCATCGAG
GTCATCACGC CGAGTCCGCT CAGCTCCGGC GCGGCGAAGT GGAACCTGCT CGCGCCGTAC
GCCTACGCCA GCAACGGCGG CAAGGATCCG CAGGCCGGCA TCGACTTCGT CAACAAGTTG
GTCACCGAGC ACGTCAAGCT GCGTCCCGGC TCGGGCCGTG AGGCCACCGA CGTGTTCCGC
CAGGGCAGCG GTGACGTGCT GCTGGCCTAC GAGAACGAGG CGCTGAACTT CGACCTGGAA
CACGTCAACC CGGCGCAGAC CTTCAAGATC GAGAACCCGA CCGCGGTGGT GAACACCAGC
CAGCACCCGG ACCAGGCCCA GGCGTTCGTG AACTTCCAGT TCACCCCGGA AGCCCAGAAG
CTGTGGGCGG AAGCCAACTT CCGGCCGGTC GACCCGGCGG TGCTCGCCGA GTTCGCCGAC
AAGTTCCCCA CGCCGGAGAA GCTGTGGACC ATCGAGGACC TGGGTGGCTG GTCTAAGGTC
GACTCCGAGC TGTTCGACAA GGAGAACGGC ACGATCACGA AGATCTACAA GCAGGCCACT
GGATGA
 
Protein sequence
MRKILKSTRR WRTAAALTLS AAVLAACGGG ASDVVGEEGG NSDAETTLTL VAFAVPEPGW 
SKVSPAFAAT EEGKGVAVTG SYGASGDQSR AVESGKPADI VNFSVEPDIT RLVKAGKVDE
NWNAGPNKGI AFGSVVSFAV RPGNPKNIRT WDDLLQPGIE VITPSPLSSG AAKWNLLAPY
AYASNGGKDP QAGIDFVNKL VTEHVKLRPG SGREATDVFR QGSGDVLLAY ENEALNFDLE
HVNPAQTFKI ENPTAVVNTS QHPDQAQAFV NFQFTPEAQK LWAEANFRPV DPAVLAEFAD
KFPTPEKLWT IEDLGGWSKV DSELFDKENG TITKIYKQAT G