Gene Mvan_2544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2544 
Symbol 
ID4645425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2683523 
End bp2684917 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content67% 
IMG OID639806029 
Productextracellular solute-binding protein 
Protein accessionYP_953361 
Protein GI120403532 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGTGTG ACACACTCTC ACACGTACCG AATGTGCTGC CCGTGGCCCT CATCGGAGAC 
CACAACTTCA TAGACGAGGA CGTGATGAAA CGAAATCAGT TCGCCCGCGG GCGACGACGT
CGTGCGATCG CACTGGCCAC CGCGCCTCTG ATCGCGGCGT CGTTGTTGTC CGGGTGCGGC
AGTCAGGGCG GTCCCCCGAC GCTGACGTGG TACATCCTTC CCGACAACGG AGGCTCCGTG
GCCCGCGCGG AGCAATGCGC CGAGGCCTCC AACGGCGCCT ACCAGGTCCG GATCGAATCG
TTGCCGAGCA CCGCGACCGC CCAACGTGAG CAGATGGTGC GCCGCCTCGC CGCAGGCGAT
TCGTCGATCG ACCTGGTCAG CATGGATGTG GTGTTCACCG CCGAGTTCGC CAACGCCGGC
TTCCTGCGCC CGTACACCGA GGAGGAGACC AGCCGGCTCA CCGCAGGCAT GCTCCCGGCG
CCGGTCGAGA CCGGCATGTG GGAGGACACG CTTTACGGAG CGCCCTACAA GTCCAACACC
CAGTTGCTGT GGTACCGCAA GTCTCATGCC GCCGCTGCGG GCGTGGACCC CGCCAGTCCG
ACGTTCACGT GGGACGAGAT GCTCAAAGCC GCTGAGCAAC AACAGAAGAA GATTGCCGTC
CAGGCGCAGC GCTACGAGGG TTACACGGTG TGGATCAACG CACTGGTGCT CTCGGGCGGT
GGCGAGTTGC TTCAGGACGT GGAGGCCGGC CGCAACGCCA AACCCTCCAT GGCCACCCCG
CCCGGCGAGA AGGCCGCCGA GATCGTCGGC AACCTGGGCC GGTCCAGCGC GGCGCCGACC
GACCTGTCCA ACGCCTCGGA AGAGCAGGCA CGCGCCAACT TCCAATCCGA TCAGGGCATG
TTCATGGTCA ACTGGCCCTA CGTGCTGGCG GCGGCCCGCA GCGCCGCCGA AGAGGGCACC
TTGCCGCAGG CGGTCGTCGA CGACATCGGT TGGGCGCGCT ACCCGAGGGT CTCGCCGGAC
CGGCCCAGTG CGCCGCCGCT GGGTGGTGCG AACCTCGGCA TCGGCGCCTA CACCAAACAT
CCGGACCAGG CCGTTGCGCT GGTGGAGTGC ATCAACGCAG AGCCCAAGGC CACCCAGTAC
ATGCTCGACG AGAGTGAGCC CTCGCCGTAC GCCGCGTCGT ACGACAATCC TGAGATCCGG
GAGACCTACG AGAATGCCGA CCTGATCCGG GAGTCCATCG GAGAAGGCGG CCCCCGTCCG
CCGACCCCGT TCTATACCGA CATCTCGGGC GCCATCCAGC AGACCTGGCA CCCACCCGCC
TCGGTCAACG CTGAAACCCC GGAAAGGACA GATCAATTCA TGGCCGACGT GCTGGCGGGG
AGGCGGCTGC TGTGA
 
Protein sequence
MRCDTLSHVP NVLPVALIGD HNFIDEDVMK RNQFARGRRR RAIALATAPL IAASLLSGCG 
SQGGPPTLTW YILPDNGGSV ARAEQCAEAS NGAYQVRIES LPSTATAQRE QMVRRLAAGD
SSIDLVSMDV VFTAEFANAG FLRPYTEEET SRLTAGMLPA PVETGMWEDT LYGAPYKSNT
QLLWYRKSHA AAAGVDPASP TFTWDEMLKA AEQQQKKIAV QAQRYEGYTV WINALVLSGG
GELLQDVEAG RNAKPSMATP PGEKAAEIVG NLGRSSAAPT DLSNASEEQA RANFQSDQGM
FMVNWPYVLA AARSAAEEGT LPQAVVDDIG WARYPRVSPD RPSAPPLGGA NLGIGAYTKH
PDQAVALVEC INAEPKATQY MLDESEPSPY AASYDNPEIR ETYENADLIR ESIGEGGPRP
PTPFYTDISG AIQQTWHPPA SVNAETPERT DQFMADVLAG RRLL