Gene M446_5587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5587 
Symbol 
ID6133330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6127720 
End bp6129090 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content58% 
IMG OID641645710 
Productextracellular solute-binding protein 
Protein accessionYP_001772324 
Protein GI170743669 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTGCT CGGTAAGTCG AGAAATCAAG ATCAAAGCAA CCGTTTCACG AGTGTATGTA 
CACATCCTCA GCGGCCTCAT CGGCCTGATA CTGACAATGT CACCCGTCAC TCCCCACGCG
GCCACCGAGC TTCAGCTCTG GCACTCTCTC GACGGCGCGA ACGGCGCCCT GGTGGCCCGG
ATAGCGGAAG AATTCAACGC CTCGCAGAGT ACCTACCGGG TCATCACGAT TTACAAGGGC
CCGTACGCTG AGACGATCAA TGGTGGGATC GCGGCTTACC GAGCGGGCGT CGCACCTCAC
CTCCTGCAGG TCTTCGAGGT CGGGAACGGA ACAATGGCAG CGGCGATCGG CGCCGTGAAG
CCCGTCTCGG AAGTGCTGAG TGCGGCTGGA AGCACGGTCG CGCCTGACGC ATTCCTGCCA
GTCATCGCGA CCAATTACCT CGCCCGTGAT GGCAGTATGC TGTCGCTGCC CTTTAACATC
TCGGCAATGG TAATGTGGGT CAATCTCGAC CGATTTCAAC AGGCAGGTCT TGATCCTCAG
CGCCTTCCCG AGACGTGGCC GCAAGTATTC GTAGCTGCTC GGGCGTTAAA ACATGCAGAG
CCGTCCAAGT GTGCGCTCTC GACCGCATGG CCCACATGGG CTCATATCGA GCAGCTGTCG
GCATGGCACG GCCAGCCGGT CGCAAGTCGC GCGAACGGCC TCGACGGATT CGATGCTGAG
CTCGTCTTTA ACAAGCCGCT CCAAGTACGT CACTTGCAGA ACATCGTTGA TCTGAACCGC
GAGGGTGCGT TCAGCTACTC AGGCCGCACG AATCGCGGGG AGGCACAATT TATTTCGGGC
GAGTGCGGTA TCTTTTTGAC TTCCTCCAGT ATGTACGGGA CGATTGCAGC GCGAGGGCAT
TTTCGCTGGG CGATCCAACC GATGCCTTAC TATCCCGATC TTGTCGATAC GCCTAAAAAC
GCTATCCTGG GCGGTGGATC TCTGTACGTC ATGCAGGGAA AGACGTCGCA GGAATATGCT
GGCGTAGCTG CATTCTTGAA TTTTGTATTG AGTTTCCCTG TGCAATCCCT CATCTACCGC
AATTCCGGCT ACATGCCAGT TACCCGCGCA GCCTACGCCG ATGCTCAGGC GTCCGGATTT
TACAAGCGTT ACCCTATGCT TCAGGTCGCG GTTCGCGAGA TAACGGATCG AGAGCCGACG
CGTGAAACTT CGGGCCTGCG GCTGGGCAAT ATGCTGCAGA TCCGCGATAT TTGGGCGGAT
GAGATTGAGG CGGCGCTCGC CGGTACGAAA GCACCGCAGC AAGCTCTGGA CGATGCAGTG
GCTCGCGGCA ACGCTGTGTT GCGGCTGTTC CAACGCAGAA CCTCTCGATA A
 
Protein sequence
MHCSVSREIK IKATVSRVYV HILSGLIGLI LTMSPVTPHA ATELQLWHSL DGANGALVAR 
IAEEFNASQS TYRVITIYKG PYAETINGGI AAYRAGVAPH LLQVFEVGNG TMAAAIGAVK
PVSEVLSAAG STVAPDAFLP VIATNYLARD GSMLSLPFNI SAMVMWVNLD RFQQAGLDPQ
RLPETWPQVF VAARALKHAE PSKCALSTAW PTWAHIEQLS AWHGQPVASR ANGLDGFDAE
LVFNKPLQVR HLQNIVDLNR EGAFSYSGRT NRGEAQFISG ECGIFLTSSS MYGTIAARGH
FRWAIQPMPY YPDLVDTPKN AILGGGSLYV MQGKTSQEYA GVAAFLNFVL SFPVQSLIYR
NSGYMPVTRA AYADAQASGF YKRYPMLQVA VREITDREPT RETSGLRLGN MLQIRDIWAD
EIEAALAGTK APQQALDDAV ARGNAVLRLF QRRTSR