Gene Mvan_0122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0122 
Symbol 
ID4648054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp136098 
End bp137135 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content66% 
IMG OID639803633 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_950979 
Protein GI120401150 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4521] ABC-type taurine transport system, periplasmic component 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family
[TIGR01729] taurine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.592711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCA AAGCCCTTCT CGTGGTTCTC GTCTCGGCCG TGCTGGCCCT GGCGGGCTGC 
TCGGTGGACA ACGGAGGGCA GCACGGCGAC GACTCAGGCA AGCCGACCAT CCGGATCGGC
TACCAGACGT TCCCGAGCGG CGACCTGATC GTCAAGAACA ACAAGTGGCT CGAAGAAGCG
TTGCCCGACT ACAACATCAA GTGGACGAAG TTCGACTCGG GCGCCGACGT GAACACGGCC
TTCGTGGCAG GAGAACTCGA CTTCGGTGCG CTGGGCTCCA GCCCCGTCGC GCGCGGCCTG
TCCGAGCCGC TGAACATCCC GTACAAGGTC GCGTTCGTGC TCGACGTCGC CGGCGACAAC
GAGGCCCTGG TGGTGCGTAA CGGGGCAGGC GTCGACACCA TCGCGCAACT GAAGGGCAGG
CGTATCGGCA CCCCGTTCGC GTCCACCGCG CACTACAGCC TGCTGGCCGC GCTCGACCAG
AATGGCTTGT CGGCCAACGA TGTTCAGCTA ATCGACCTGC AACCGCAGGC CATCCTCGCG
GCCTGGGAGC GCGGGGACAT CGACGCCGCC TACACCTGGC TGCCGACCCT GGACGAGCTG
CGCAAGACCG GCCGGGATCT GATCACCAGT CGTCAGCTCG CCGATGCCGG CAAGCCCACG
CTCGATCTGG CGACCGTCAG CGACGAGTTC GCGTCCGCCC ACCCCGAGGC CGTCGATGTG
TGGCGGCAGC AGCAGGGACG CGCGCTTGAC CTCATCCGGG AGGATCCGCA GGCTGCCGCC
GAAGCCATCG CCGCCGAGAT CGGCCTGACC CCGCAGGATG TGGCCGGTCA ACTCAAGCAG
ATGGTGTTCC TCACCCCGCA GGACATCTCA TCCACGGAAT GGCTTGGCAC TGAGGGTAAT
CCAGGCAACC TCGCGGTGAA CCTGGAATCC GCTTCGCAGT TCCTGGCCGA TCAGTCGCAG
ATCCCGGCCG CGGCGCCGTT GAAGACGTTC CAGGACGCCG TCTACACGAA AGGCCTACCG
GGTGCCCTCA ACGAATGA
 
Protein sequence
MKLKALLVVL VSAVLALAGC SVDNGGQHGD DSGKPTIRIG YQTFPSGDLI VKNNKWLEEA 
LPDYNIKWTK FDSGADVNTA FVAGELDFGA LGSSPVARGL SEPLNIPYKV AFVLDVAGDN
EALVVRNGAG VDTIAQLKGR RIGTPFASTA HYSLLAALDQ NGLSANDVQL IDLQPQAILA
AWERGDIDAA YTWLPTLDEL RKTGRDLITS RQLADAGKPT LDLATVSDEF ASAHPEAVDV
WRQQQGRALD LIREDPQAAA EAIAAEIGLT PQDVAGQLKQ MVFLTPQDIS STEWLGTEGN
PGNLAVNLES ASQFLADQSQ IPAAAPLKTF QDAVYTKGLP GALNE