Gene M446_3428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3428 
Symbol 
ID6132055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3804262 
End bp3805539 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content72% 
IMG OID641643598 
Productextracellular solute-binding protein 
Protein accessionYP_001770250 
Protein GI170741595 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.390518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.012309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGGA CCCTTTCCCT GGCCGCGATG GCGGCGGCGG CGATGCTCGC CGCCAGCGCC 
GCGTCGGCGG CATCGTTGAA GATCCTGGAC CACGGCAGCC GCGGCGCCGC CGAACTCGAC
GCCATCGCGG CCCAGGTCGC GGCCTGGAAC CGATCGCACC CGGACATCCC GGCCGAACTC
GTGACCCTGC CCAAGCCGAT CGAGAACCAG ACGGTTCAGG CGAAGGCCCT GGCCGGCACC
TGGCCGGACA TCCTCGACTT CGACGGGCCG AGCTTCGCGA ACGCGGCCTG GGCCGGCCTG
CTGGCGCCCC TCGACGACCT GCTCCCGCCC GACCTGATGC GCGCCCTGCT GCCGTCGATC
CGCGCACAGG GCCTCTACGC CCCGGACGGC AAGATCTACG CCCTCGGCCA GTTCGATTCC
GGGCTCGGAC TCTGGGCCTC CCGCTCGGCC CTGCGCCAGG CCGGGATCCG GATCCCCAGC
GGCCTCGACG ATGCCTGGAC CGGCGAGGAG TTCGAGGCCG CGCTCGCCGC CCTCAAGCGC
GCCGGCTACC CGACGCCCCT CGACATGAAG CTGAATTACG GCGTCGGCGA GTGGTACACC
TACGGCTTCG CGCCGATCCT GCAATCGTAC GGCGGGGATC TGATCAACCG CACGACCTGG
CAGGCCGAAG GGACGATCAA TTCCGAGGCG TCGATCGCGG CGTTGGGCCG GATCCAGTCC
TGGATGAAGG CCGGGTACAT CGTGCCCGCC TCGGAAGGCG ACGACGCCTT CTACGGCAAG
CGGAGCGCGG CCCTGGCTTT GGTCGGGCAC TGGATGTGGC CGACCCACAG CGCCGCCCTC
GGCTCCGACC TGGTGCTGCT GCCGATGCCG CGCTTCGGCG CGCGCCACGT CACCGGGATG
GGGAGCTGGA ACTGGGGGAT CTGGTCGGGC TCCCCGAACA AGGAGGCGGC CGCCAAGTTC
CTGGAATTCC TGATGTCCGA GCCGGCGATG GAGGCGGTGG CCGGGGCGGC GGGCGCGATC
CCGTCGCGCC AGGCGGCGGC GGAGCGCAAC CCGCTCTTCC GCCAGGGCGG GCCGATGGCG
CTCTACCGCG AGCAGCTGAC CCGCATCGCG GTGCCCCGCC CGCCGCATCC GGCCTACCCG
GTGATCTCGC GGGCCTTCGC CGCCGCGGTC AATTCGGTCA TGAAGGGCGA GGATCCGAAG
CGGGCGCTCG ACCGGGCGGC GGCGGCGATC GACCAGGAGA TCGAGCAGAC CAACGGCTAC
AAGCCCTTCG GCGGCTGA
 
Protein sequence
MKRTLSLAAM AAAAMLAASA ASAASLKILD HGSRGAAELD AIAAQVAAWN RSHPDIPAEL 
VTLPKPIENQ TVQAKALAGT WPDILDFDGP SFANAAWAGL LAPLDDLLPP DLMRALLPSI
RAQGLYAPDG KIYALGQFDS GLGLWASRSA LRQAGIRIPS GLDDAWTGEE FEAALAALKR
AGYPTPLDMK LNYGVGEWYT YGFAPILQSY GGDLINRTTW QAEGTINSEA SIAALGRIQS
WMKAGYIVPA SEGDDAFYGK RSAALALVGH WMWPTHSAAL GSDLVLLPMP RFGARHVTGM
GSWNWGIWSG SPNKEAAAKF LEFLMSEPAM EAVAGAAGAI PSRQAAAERN PLFRQGGPMA
LYREQLTRIA VPRPPHPAYP VISRAFAAAV NSVMKGEDPK RALDRAAAAI DQEIEQTNGY
KPFGG