Gene Mflv_3851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_3851 
Symbol 
ID4975167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp4113571 
End bp4114881 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content69% 
IMG OID640458075 
Productextracellular solute-binding protein 
Protein accessionYP_001135111 
Protein GI145224433 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0718166 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAA ATCAGTTCGC CCGCGGGCGA CGGCGTCGCG CAGTCGCGCT GGCCAGTGCG 
CCTCTGGTGG CCGCGTCGTT GTTGTCCGGG TGCGGCGCCC AGAGCGGTCC CCCGACCTTG
ACGTGGTACA TCCTTCCCGA CAACGGCGGC TCGGTCGCGC GTGCCGAGCA GTGCGCCGAG
GCGTCCAACG GCGCCTATCA GGTGCGTATC GAGTCGCTGC CGAGCACCGC GACCGCCCAG
CGCGAACAGA TGGTTCGCCG TCTGGCCGCC GGTGATTCGT CGATCGACCT CGTCAGCATG
GACGTGGTCT TCACCGCGGA GTTCGCCAAC GCCGGGTTCC TGCGCCCGTA CACCGCCGAG
GAGACCAGCC GGTTGACCGC CGGGATGCTG CCCGCCCCGA TCGAGACCGG TATGTGGGAG
GACACCCTCT ACGGCGCGCC GTACAAGTCG AACGCGCAGC TGCTCTGGTA CCGCAAGTCC
GCGGCGGCCG CGGCCGGCGT CGACCCCGCC TCTCCGACCT TCACCTGGGA CGAGATGCTC
AAAGCCGCTG TGGGGCAGCA GAAGAAGATC GCGGTGCAGG CCCAGCGGTA CGAGGGCTAC
ACCGTGTTGA TCAACGCCCT GGTGCTCTCG GGCGGCGGTG CGCTCCTCGA GGACGTGGAA
GCCGGACGCG ACGCCAAACC GTCGCTGAAC ACTCCGCCCG GGCTCAAGGC CGCCGAGATC
GTCGGCACCC TCGGTCGCTC TCCCGCCGCG CCCACCGACA TGTCGAACGC GTCGGAAGAG
CAGGCGCGCG CCAACTTCCA GTCCGATCAG GGCATGTTCA TGGTCAACTG GCCGTACGTG
CTGGCCGCCG CGCGCAGCGC CGCCGAGGAG GGCACGCTGC CGCAGGAGGT CGTCGACGAC
ATCGGCTGGG CCCGCTACCC CAGGGTGTCC CCGGATATGC CGAGCGCGCC GCCGCTGGGC
GGTGCGAACC TCGGCATCGG CGCGTACACG GAGTATCCGG AGGAGGCCGT CGCGCTGGTC
GAGTGCATCA ACGCTGAACC GAAGGCCACT CAGTACATGC TCGACGAGAG TGAGCCGTCA
CCGTATGCGG CGTCCTACGA CAACCCCGAG ATCCGGGAGA CCTACGAGAA CGCCGACCTG
ATCCGGGAGT CGATCGGCGA CGGCGGCCCC CGTCCACCCA CGCCGTTCTA CACCGACATC
TCGGGCGCGA TCCAGCAGAC GTGGCATCCG CCCGCGTCGG TGACTTCCGA AACTCCGGAA
AGGACAGACC AATTCATGGC TGACGTGCTG GCGGGGAGGC GACTGCTGTG A
 
Protein sequence
MKRNQFARGR RRRAVALASA PLVAASLLSG CGAQSGPPTL TWYILPDNGG SVARAEQCAE 
ASNGAYQVRI ESLPSTATAQ REQMVRRLAA GDSSIDLVSM DVVFTAEFAN AGFLRPYTAE
ETSRLTAGML PAPIETGMWE DTLYGAPYKS NAQLLWYRKS AAAAAGVDPA SPTFTWDEML
KAAVGQQKKI AVQAQRYEGY TVLINALVLS GGGALLEDVE AGRDAKPSLN TPPGLKAAEI
VGTLGRSPAA PTDMSNASEE QARANFQSDQ GMFMVNWPYV LAAARSAAEE GTLPQEVVDD
IGWARYPRVS PDMPSAPPLG GANLGIGAYT EYPEEAVALV ECINAEPKAT QYMLDESEPS
PYAASYDNPE IRETYENADL IRESIGDGGP RPPTPFYTDI SGAIQQTWHP PASVTSETPE
RTDQFMADVL AGRRLL