Gene Mvan_5320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5320 
Symbol 
ID4644445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5697841 
End bp5698872 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content69% 
IMG OID639808795 
Productperiplasmic solute binding protein 
Protein accessionYP_956097 
Protein GI120406268 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.723333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTTCCT CGTTCCGGGG CGTGCGCGTC CTCGCCGCGA CCCTCGCCGT CTCCCTGCCC 
TTCGTCCTGT CCGCCTGCGG CGGCGACGAC ACCTCCGCCA TGGACACCTC GGCCGCCCCG
GACTGCCCGA CGACGCCCGT GAACGTGGTG GTCAGCGTCG ACCAGTGGGG AAACATCGTG
TCGCAGCTCG GCGGTGCGTG CGCCGAGGTC ACCACGGTGC TGGCCGGCTC GGCAGTCGAC
CCCCACGACT ACGAGCCCGC ACCGTCGGAC GCCGCGGACT TCGAGGGTGC GCAGCTCGTC
GTCCTCAACG GCGGCCATTA CGACGAGTGG GCTGCCAAGC TGGCCGCGAC GTCGGCGGCG
GACGCACCGG TGGTCAACGC CGTCGAGCTC AGCGGCGGTC ACGCCGAGCA AGGCCAAGAG
CACGCGGGCG AGGACGGGCA CGGTCACGCC GAGGAGGGGC AAGAGCACGC GGGCGAGCAA
GGCCAAGAGC TCGGCGACGA GGGCAACCCA CACGTCTGGT ACAACCCGAC CGCGGTGACC
GAGTTCGCCG AAGCCGTCAC CGCCCAACTC GGCAAGCTCT CGCCCGACGC GGCGGGGTAC
TTCGCCGAAC GTCACGCCGA GTTCGCCGAG TCGATGAAGC CGTATGACGA GGTGATCGCT
GCGATCAAGG CGGGTGCGAC CGGCAGGACC TACGCGGCCA CCGAGAGCGT GTTCGGCGAT
ATGGCCACCG CGCTCGGGTT GGTGGATCGG ACGCCGCAGG GCTACCAGGT CGCCGCGGCC
AACGAGAGCG ACCCGTCGCC GGCAGACCTC GACGCCTTCC TGCAGCTGCT CGCCGACCGT
GGTGTCGACG TACTGATCTA CAACACCCAG ACCGAGGGTT CGGTGCCCGA GCAGATCCGC
TCGGCGGCCG AACAGGCGGG CATCGCCGTC GTCGACGTGA CCGAAACATT GCCCTCGGAC
GCCAAGTCGT TCCAGGATTG GCAGGTGGCA CAACTCGATT CCCTGGCCAA GGCCCTCGAT
GTCAGGACCT GA
 
Protein sequence
MISSFRGVRV LAATLAVSLP FVLSACGGDD TSAMDTSAAP DCPTTPVNVV VSVDQWGNIV 
SQLGGACAEV TTVLAGSAVD PHDYEPAPSD AADFEGAQLV VLNGGHYDEW AAKLAATSAA
DAPVVNAVEL SGGHAEQGQE HAGEDGHGHA EEGQEHAGEQ GQELGDEGNP HVWYNPTAVT
EFAEAVTAQL GKLSPDAAGY FAERHAEFAE SMKPYDEVIA AIKAGATGRT YAATESVFGD
MATALGLVDR TPQGYQVAAA NESDPSPADL DAFLQLLADR GVDVLIYNTQ TEGSVPEQIR
SAAEQAGIAV VDVTETLPSD AKSFQDWQVA QLDSLAKALD VRT