Gene Mflv_3137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_3137 
Symbol 
ID4974458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp3320977 
End bp3322158 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content67% 
IMG OID640457360 
Productextracellular solute-binding protein 
Protein accessionYP_001134402 
Protein GI145223724 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.713269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGTG ACATCGACCC TCAGTTGCTC GCCCAGTTGA CGGCACGACG GACCTCGCGC 
CGTCGCTTCA TCGGTGGCAG CGCCGCCGCG GCGGCGGGTC TGACGCTCGG CGCGTCGTTC
CTGGCGGCCT GCGGATCGTC GGACACCGGA ACCTCGGGCA CCACCGACGA CGGCGGACCC
GCCAGCGGCA CCCTGCGGAT CTCGAACTGG CCGCTGTACA TGGCGGACGG GTTCGTCGCC
GCATTCCAGA CCGCGTCGGG CATCACGGTC GACTACAAAG AGGACTACAA CGACAACGAA
CAGTGGTTCG CCAAGGTCAA GGAGCCGCTG TCCCGCAAGC AGGACATCGG CGCCGACCTC
GTGGTTCCCA CGGAGTTCAT GGCCATCCGG CTGCACCAGC TCGGCTGGCT CAACGACATC
AGCGACGAGG GCGTCCCGAA CAAGAAGAAC CTGCGCCCCG ACCTCATGGA GGCCAGCGCC
GACCCGGGCC GCAAGTTCAG CGCCCCGTAC ATGTCCGGCC TCGTCGGGCT TGCCTACAAC
AGGGCCGCGA CCGGACGCGA CATCAGCTCG ATCGACGACC TGTGGGATCC CGCGTTCAAG
GGCCGGGTGA GCCTGCTGTC CGACACCCAG GACGGGCTCG GGATGATCAT GCTGTCCCAG
GGCAATTCGC CCGAGAACCC GTCGACCGAA TCGGTACAGC GCGCCGTCGA CCTCGTGCGT
GAGCAGAACG ACCGGGGCCA GATCCGCCGC TTCACCGGCA ACGACTACGC CGACGACCTC
GCCGCGGGGA ACATCGCTGT GGCACAGGCC TATTCGGGTG ACGTGGTGCA GCTGCAGGCC
GACAACCCGG ACCTGCAGTT CATCGTCCCG CAGTCCGGTG GCACCACGTT CCTCGACACC
ATGGTGATCC CGTACACCAC GCAGAACCAG AAGGCCGCCG AGGCGTGGAT CGACTACGTC
TACGACCGCG CCAACTACGC CAAGCTGGTG GCCTTCACCC AGTTCGTTCC GGTGCTGTCC
GAGATGACCG AGGAACTGGA GAAGGTCGAT CCCGCCGCGG CCAGCAACCC GTTGATCAAC
CCGCCGGCCG ACGTCCTGGA GCGGTGCAAG AGCTGGGCCG CGCTGACCGA CGAGCAGACG
CAGGAGTTCA ACACCGCGTA CGCCGCAGTC ACCGGTGGCT GA
 
Protein sequence
MSRDIDPQLL AQLTARRTSR RRFIGGSAAA AAGLTLGASF LAACGSSDTG TSGTTDDGGP 
ASGTLRISNW PLYMADGFVA AFQTASGITV DYKEDYNDNE QWFAKVKEPL SRKQDIGADL
VVPTEFMAIR LHQLGWLNDI SDEGVPNKKN LRPDLMEASA DPGRKFSAPY MSGLVGLAYN
RAATGRDISS IDDLWDPAFK GRVSLLSDTQ DGLGMIMLSQ GNSPENPSTE SVQRAVDLVR
EQNDRGQIRR FTGNDYADDL AAGNIAVAQA YSGDVVQLQA DNPDLQFIVP QSGGTTFLDT
MVIPYTTQNQ KAAEAWIDYV YDRANYAKLV AFTQFVPVLS EMTEELEKVD PAAASNPLIN
PPADVLERCK SWAALTDEQT QEFNTAYAAV TGG