Gene M446_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1969 
Symbol 
ID6134356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2195368 
End bp2197134 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content67% 
IMG OID641642200 
Productextracellular solute-binding protein 
Protein accessionYP_001768868 
Protein GI170740213 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.386461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGCC ACAGCCGCGC CCTGCTCCAC GGTGCGAGCG CGCTCGCGCT CGGCGCCGCC 
CTCGCGCTCG GCCCCGCCGC CCCCGCGCGG GCCGGCATGG AGGAGGCCAG GCGCTGGGTC
GAGACCGAGT TCCAGCCCTC GACGCTGTCC AAGGACGAGC AGCTCAAGGA GATGCAGTGG
TTCGTCGACG CGGCGAAGCC CTTCGTCGGC CAGGAGATCA ACGTCGTCTC CGAGACCCTC
ACCACGCACG AGTACGAGGC CAAGACCCTG GCCAAGGCCT TCACGGAGAT CACCGGGATC
AGGATCCGCC ACGACGTCAT CCAGGAGGGC GACGTCGTCG AAAAGATCCA GACGCAGATG
CAGTCGGGCA AGAACATCTA CGACGGCTGG ATCAACGATT CCGACTTCAT CGGCACCCAC
GCCCGCTACA ACCAGACCGT CAACCTGACC GACTGGATGG CCGGCGCGGG CCGGGACGTC
ACCCTGCCGA GCCTCGACGT CGAGGATTTC ATCGGCAAGT CGTTCGGCAC CTGGACCGAT
GGCAAGCTGT TCCAGCTGCC CGACCAGCAA TTCGCCAACC TGTACTGGTT CCGTTACGAC
TGGTTCCAGC GCCCCGACCT CAAGGAGAAG TTCAAGGCCA AGTACGGCTA CGAACTCGGC
GTGCCGGTGA ACTGGTCGGC CTACGAGGAC ATCGCGGAAT TCTTCACCAA CGACGTGAAG
GAGATCGATG GCCAGCGGGT CTACGGCCAT ATGGATTACG GCAAGAAGGA CCCGTCGCTG
GGCTGGCGGT TCACCGACGC GTGGCTCTCG ATGGCCGGCA ACGGCGACAA GGGCATCCCG
AACGGCAAGC CGGTGGACGA GTGGGGCATC CGCCTCGACG GCTGCCGGCC GGTCGGCTCC
TCGGTCGAGC GCGGCGGGGA CACCAACGCC CCGGCCTCCG TCTACGCCGT GACCAAGTAC
GTCGAGTGGC TGAAGAAGTA CGCGCCGCCG CAGGCCGCCG GCATGACCTT CTCGGAATCC
GGGCCGGTGC CGGCGCAGGG CAACGTCGCC CAGCAGATCT TCTGGTACAC CGCCTTCACG
GCCGACATGG TCAAGCCCGG CCTGCCGGTG GTGAACCCGG ACGGCTCGCC GAAATGGCGC
GTCGCGCCCT CGCCGCACGG CGCCTACTGG AAGGAGGGCA TGAAGCTCGG CTACCAGGAT
GCCGGCTCGG TCACGCTGCT CAACTCCACC CCGGTCGAGC GCCGCAAGGC CGCCTGGCTG
TACCTCCAGT TCATCAACTC GAAATCGGTG AGCCTGAAGA AGAGCCACGT CGGCCTCACC
TTCACGCGCG AGAGCGACAT CTGGGACAAG TCCTTCACCG AGCGGGCGCC GAGGCTCGGC
GGGCTGATCG AGTTCTACCG CTCGCCGGCC CGGGTGCAGT GGACGCCGAC CGGCGTCAAC
GTGCCGGACT ACCCGAAGCT GGCGCAGCTC TGGTGGCAGA ACATCGGCGA CGCCTCCTCG
GGCGCCAAGA CCCCGCAGGC GGCCATGGAC GCGCTCGCGG CCGCCCAGGA CGACGTGATG
GCCCGGCTCG AACGCTCCAA GGTCCAGGGC GAGTGCGGGC CGAAGCTCAA CCCCAAATCC
TCGGCCGAGG AATGGTACAA GAGGGCCGAG ACGAGCGGCA CCATCGCGCC CCAGCGCAAG
CTCTCCACCG AGAAGCCGAA GGGCGAGACG GTGGATTACG ACACGCTGAT CAAGAGCTGG
CCGGCCTCGC CGCCGCGCCG CAGCTGA
 
Protein sequence
MTRHSRALLH GASALALGAA LALGPAAPAR AGMEEARRWV ETEFQPSTLS KDEQLKEMQW 
FVDAAKPFVG QEINVVSETL TTHEYEAKTL AKAFTEITGI RIRHDVIQEG DVVEKIQTQM
QSGKNIYDGW INDSDFIGTH ARYNQTVNLT DWMAGAGRDV TLPSLDVEDF IGKSFGTWTD
GKLFQLPDQQ FANLYWFRYD WFQRPDLKEK FKAKYGYELG VPVNWSAYED IAEFFTNDVK
EIDGQRVYGH MDYGKKDPSL GWRFTDAWLS MAGNGDKGIP NGKPVDEWGI RLDGCRPVGS
SVERGGDTNA PASVYAVTKY VEWLKKYAPP QAAGMTFSES GPVPAQGNVA QQIFWYTAFT
ADMVKPGLPV VNPDGSPKWR VAPSPHGAYW KEGMKLGYQD AGSVTLLNST PVERRKAAWL
YLQFINSKSV SLKKSHVGLT FTRESDIWDK SFTERAPRLG GLIEFYRSPA RVQWTPTGVN
VPDYPKLAQL WWQNIGDASS GAKTPQAAMD ALAAAQDDVM ARLERSKVQG ECGPKLNPKS
SAEEWYKRAE TSGTIAPQRK LSTEKPKGET VDYDTLIKSW PASPPRRS