Gene M446_1317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1317 
Symbol 
ID6134637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1449677 
End bp1450957 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content69% 
IMG OID641641598 
Productextracellular solute-binding protein 
Protein accessionYP_001768269 
Protein GI170739614 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.12099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGAG GGAGGGTGGC CGCGCTGGTC CTCGGGGCCG CGCTGGCGGC CGGCGGCGCG 
CGGGCCGCGG AGCCGACGCA GATCACCATG TGGTCGAACT GGCCCGACGA GCCCGCCAAG
CGCGAGTGGG TGAGCGCCCG GGTCAAGGAA TTCGAGGCCG CGAATGCCCA GTGCCGGGTG
AAGCTGAGCT TCATCCCCAA GGCCGACATC TACACGCAGG CCAAGTCCGC CGTGCGCACC
GGTCAGGCGC CGGACGTCTT CTACATGGAG CCGGACCAGC CCGAATTCCT GCAGGGCGGC
TTCCTCGAAC CGCTCGAGAC CCGCGTCGAC ACGGGGGCGA TCGAGGAATG GGCGAAGCCC
GCCTGGACCG CGAAGGGCCA CCTCTACGGC CTGCCGGTCG AGGCCTACAC GGTCGAACTC
TACTACAACC GGGATCTCGT CAGGAAGGTC GGCGTCGCGG TGCCGGAATC CGGCCAGCTG
ACGCAGGGCG CCTTCGCCGA CCTCGTCAGG AAGGGCGTGG CCGCGGGCGT GACGCCCGTG
GCGCAGGGCG TCGGCGACCG GCCCTTCCCG GGCGGCCTGC TGCTGTTCGA GTCGCTGCTG
CGCAAGCTCG GGACCGAGGA TTACGGCAAG CTCCTCAGCG GGGACCTGTC CTTCCGCGAT
CCCCGGGTCC TCGCGGTGAT GACGTGGTTC AAGGACCTGG TCGATGCCGG GGCCTATCCG
AAGAGCTTCT CGACCCTGAA GCTCGGGGAG TCGCACTACT ACTTCTACCA GAAGCCCGGC
GCGCTGGTTT TCCCCGACCC GAGCTGGTTC ACCGGCCGCG CCTTCGCGCC GCCGGAGAGC
GGCGGCATGC CGGCGGATTT TCCCCTCGGC ATCATGCAGT TCCCGGCCAT GGACGAGGGC
CGGTGCCCGA CCTGCAAGAC GCTCTCGGTC GCGGGCAGCT TCGTGGTCTA CGCGCGCAGC
CGCAACAAGG ACTGCGCCGG CGCGCTGCTC AGGTCCATCG GCAGCGTCGA GAACGGCACC
AAGTGGATGG AGCAGGTCTC GCTCCAGACC GGCCTCAAGT CCGACCCGAG CCGGATCAAG
TCCGCCCACG AGGATTATTT CCGCCAGCTG CAGGCCCGCA ACCAGGGCGT GACGTACTTC
TTCGGCACGC CGCTGTTCCA CTACCGGGCG AAGTGCGCCG AGACCTACAC CCAGGTGATC
AATAACGCGC TGCCGGCCGG GCTGATCTCC GTGAAGGACG CGGCCGAGCG GATGGACGCC
GCCTGCCACA AGGGAAGCTG A
 
Protein sequence
MGRGRVAALV LGAALAAGGA RAAEPTQITM WSNWPDEPAK REWVSARVKE FEAANAQCRV 
KLSFIPKADI YTQAKSAVRT GQAPDVFYME PDQPEFLQGG FLEPLETRVD TGAIEEWAKP
AWTAKGHLYG LPVEAYTVEL YYNRDLVRKV GVAVPESGQL TQGAFADLVR KGVAAGVTPV
AQGVGDRPFP GGLLLFESLL RKLGTEDYGK LLSGDLSFRD PRVLAVMTWF KDLVDAGAYP
KSFSTLKLGE SHYYFYQKPG ALVFPDPSWF TGRAFAPPES GGMPADFPLG IMQFPAMDEG
RCPTCKTLSV AGSFVVYARS RNKDCAGALL RSIGSVENGT KWMEQVSLQT GLKSDPSRIK
SAHEDYFRQL QARNQGVTYF FGTPLFHYRA KCAETYTQVI NNALPAGLIS VKDAAERMDA
ACHKGS