Gene Mvan_4539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4539 
Symbol 
ID4648750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4877009 
End bp4878835 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content72% 
IMG OID639808009 
Productextracellular solute-binding protein 
Protein accessionYP_955320 
Protein GI120405491 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.874043 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCCGC CGCCGGCGCC GCAGAGCACC GACACCACCG AGGTGACCCC GCCGCCGCCG 
ATGAAGGCGA CGCAGATCAT CGTGGCGATC GACTCGATCG GGCCCGGATT CAACTCCCAT
CTGTTGTCCG ACCAGTCACC GGTCAACGCG GCGATCAGCT CGCTGGTGCT GCCCAGCTCG
TTCCGACCGA TCCCGGATTC CCGCACCCCG ACGGGTTCGC GCTGGGAGCT GGACACCTCC
CTGCTGGAGT CGGCCGAGGT CACCGGCGAG GACCCGTTCA CCGTCACCTA CAAGATCCGG
CCGGAGGCGC AGTGGACCGA CAACGCGCCG ATCGGCGCGG ACGACTACTG GTACCTGTGG
CGGCAGATGG TCAGCCAGCC CGGCGCTGCC GACCCGGCCG GCTACGACCT CATCACCGGC
GTGCAGTCCG TCGAGGGCGG CAAGACGGCC GTCGTGACGT TCGCGCAGCC CTACCCGGCG
TGGCGTGAGA TGTTCAACGA CATCCTGCCC GCCCACATCG TCAAGGACGT CCCCGGCGGA
TTCGGCGCAG GCCTGGCGCA GGCGCTGCCG GTCACCGGAG GGCAGTTCCG CGTCGACACC
ATCGACCCGC AGCGTGACGA GATCCTGCTC GCGCGTAACG ACAGATATTG GGGCACGCCC
GCCACACCGG ACCTGATCCT GTTCCGCCGC GGCGGGGCGC CCGCCGCGCT GGCCGATTCC
ATCCGCAACG GGGACACCCA GGTCGCCCAG GTGCATGGCG GGTCGGCGGT GTTCGCCCAG
CTGTCCGCGA TCCCCGACGT GCGCACCGCC CGGATCGTCA CCCCGCGCGT CATGCATCTG
ACGCTGCGGG CCCAGCAGCC GATGCTGGCC GACGCGCTGG TCCGCAAGGC GGTCCTCGGC
CTGCTCGACG TCGACCTGCT GGCCGCCGTC GGCGCCGGCG ACGACAACAC CGTCACCCTG
GCCCAGGCCC AGGTGCGCTC GCCGTCGGAC CCCGGTTACG TCCCGACCGC CCCGCCCGCG
ATGACGCGGG AGGATGCCAT GACCCTGCTG GCCGAGGCCG GGTATCAGGT GGACCCCGTG
CAGGTGCCGA CCTCACCGCC GCCGGCGCCC GGTGCGCCGG AGAGCAACCG CGGCCGGCTC
ACCAAGGACG GCGAGCCGCT GACGCTGGTG CTCGGCGTCG CCGCCAATGA CCCGACGGCG
GTCGCGGTCG CCAACACCGC CGCCGATCAG CTGCGCAGCG TCGGGATCGC GGCGACGGTG
GCGGCACTCG ACCCGGTGGT GCTGTACGGA GACGCCATGG TCAACAACCA GATCGATGCG
GTAATCGGCT GGCACCCGGC AGGCGGTGAC CTCGCGACGT CGCTGGCGTC GCGCTACGGC
TGCCCGGCGC TGGAGGCCAC CGCAGTCGAA ACCACCACCG GGGCACCGGC GCCGACCTCC
GACCCGCCCA GACCGAGCGG AACGTCCGGA CCGCGCGGCC CGTCCGATCC GCCGACGACG
TCGACAACGC CGACGACAGC GACGCAGACT TCGCCCGCCC CCGAGCCGGA CTCCGACCAG
CTCGTCCAGG CGCCGAGCAA CATCACCGGA ATCTGCGACC CGCACATTCA GCCCAGGATC
GACGCCGCAC TGCACGGCAC CGCCGACATC GCCGAGGTCA TCGACGAGGT CGAGCCCAGG
CTGTGGGAGA TGTCCACGGT GCTGCCGATC CTGCAGGACA CCACGATCGT CGCGGCCGGC
CCCAGCGTGC AGCACGTCAG CCTGACCGGC GCTGTGCCGG TCGGCATCGT CGGCGACGCA
GGCAGCTGGG TCAAGCTGCC GCAGTGA
 
Protein sequence
MSPPPAPQST DTTEVTPPPP MKATQIIVAI DSIGPGFNSH LLSDQSPVNA AISSLVLPSS 
FRPIPDSRTP TGSRWELDTS LLESAEVTGE DPFTVTYKIR PEAQWTDNAP IGADDYWYLW
RQMVSQPGAA DPAGYDLITG VQSVEGGKTA VVTFAQPYPA WREMFNDILP AHIVKDVPGG
FGAGLAQALP VTGGQFRVDT IDPQRDEILL ARNDRYWGTP ATPDLILFRR GGAPAALADS
IRNGDTQVAQ VHGGSAVFAQ LSAIPDVRTA RIVTPRVMHL TLRAQQPMLA DALVRKAVLG
LLDVDLLAAV GAGDDNTVTL AQAQVRSPSD PGYVPTAPPA MTREDAMTLL AEAGYQVDPV
QVPTSPPPAP GAPESNRGRL TKDGEPLTLV LGVAANDPTA VAVANTAADQ LRSVGIAATV
AALDPVVLYG DAMVNNQIDA VIGWHPAGGD LATSLASRYG CPALEATAVE TTTGAPAPTS
DPPRPSGTSG PRGPSDPPTT STTPTTATQT SPAPEPDSDQ LVQAPSNITG ICDPHIQPRI
DAALHGTADI AEVIDEVEPR LWEMSTVLPI LQDTTIVAAG PSVQHVSLTG AVPVGIVGDA
GSWVKLPQ