Gene Mvan_3388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3388 
Symbol 
ID4644998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3605074 
End bp3606255 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content66% 
IMG OID639806866 
Productextracellular solute-binding protein 
Protein accessionYP_954191 
Protein GI120404362 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.645484 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGTG AGATCGACCC GCAACTGTTG GCCCGACTGA ACGCGCGCCG GACCTCCCGC 
CGCCGGTTCA TCGGTGGCGG CGCCGCAGCC GCCGCGGGCC TGGCCCTCGG TTCGTCGTTC
CTGGCGGCGT GCGGGTCCGA CAGTGGAACG TCGAGCACCA CCTCGGAGGC CAGTGGCCCC
GCCAGCGGCA CCCTGCGCAT CTCGAACTGG CCGTTGTACA TGGCCGACGG TTTCGTCGCC
GCATTCCAGA CCGCCTCCGG CATCACCGTC GACTACAAAG AGGACTTCAA CGACAACGAG
CAGTGGTTCG CCAAGGTCAA GGAGCCGTTG TCGCGCAAGC AGGACATCGG CGCCGACCTG
GCCGTTCCGA CGTCGTTCCT TGCGGTGCGG CTGCATCAGC TCGGCTGGCT CAACGACATC
AGCGACGAAG GTGTGCCGAA CAAGAAGAAC ATCCGTCCGG ACCTGCTCGA GGCCAGCGTC
GACCCGGGCC GCAAGTTCAG CGCCCCGTAC ATGTCGGGCC TGGTCGGCCT TGCCTACAAC
CGCGCCGCCA CCGGCCGCGA CATCAAGACG ATCGACGACC TGTGGGATCC GGCGTTCAAG
GGCCGGGTCA GCCTGTTCTC CGACGCCCAG GACGGCCTCG GCATGATCAT GCTCTCGCAG
GGCAACTCGC CGGAGAACCC CTCCATGGAG TCGGTCCAGA AGGCGGTCGA TCTGGTCCGT
GAGCAGAACG ACAAGGGCCA GATCCGCAGG TTCACCGGCA ACGACTACGC GGACGACCTT
GCTGCGGGCA ACGTCGCCGT GGCACAGGCG TATTCGGGTG ACGTGGTCCA GCTTCAGGCG
GACAACCCCG ATCTGCAGTT CATCGTTCCG GAGTCCGGTG CGACGACATT CGTCGACACG
ATGGTGATCC CCTACACGAC GCAGAACCAG AAGGCCGCCG AGGCGTGGAT CAACTACGTA
TACGACAGGG CCAATTACGC GAAGCTGGTG TCGTACGTCC AGTACGTTCC GGTGCTGTCG
GACATGACCG AGGAACTGGA GAAGATCGAT CCGGCCGCTG CGGCCAACCC ACTGATCAAC
CCGCCCGCCG ACGTCCTGGC GAAGTCCAAG GGCTGGGCCG CACTCACCGA CGAGCAGACG
CAGGAGTACA ACACCGCGTA CGCCGCCGTC ACCGGCGGCT GA
 
Protein sequence
MSREIDPQLL ARLNARRTSR RRFIGGGAAA AAGLALGSSF LAACGSDSGT SSTTSEASGP 
ASGTLRISNW PLYMADGFVA AFQTASGITV DYKEDFNDNE QWFAKVKEPL SRKQDIGADL
AVPTSFLAVR LHQLGWLNDI SDEGVPNKKN IRPDLLEASV DPGRKFSAPY MSGLVGLAYN
RAATGRDIKT IDDLWDPAFK GRVSLFSDAQ DGLGMIMLSQ GNSPENPSME SVQKAVDLVR
EQNDKGQIRR FTGNDYADDL AAGNVAVAQA YSGDVVQLQA DNPDLQFIVP ESGATTFVDT
MVIPYTTQNQ KAAEAWINYV YDRANYAKLV SYVQYVPVLS DMTEELEKID PAAAANPLIN
PPADVLAKSK GWAALTDEQT QEYNTAYAAV TGG