Gene M446_6664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_6664 
Symbol 
ID6134985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp7329051 
End bp7330409 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content72% 
IMG OID641646751 
Productextracellular solute-binding protein 
Protein accessionYP_001773350 
Protein GI170744695 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0689323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGGG CAAATGATCG CGCGCAAGCG CGTGCCGCGA CCTCCTGGTG GAGGGCCGCG 
CGCCTCGCGG CCGCACTCCT CCTCGGCGGC GCGGCCGGCG CGGCGGCGGC GGACCTCGAC
CTGTTCTTCC CGGTGCCGGT CGACGGGGCC CTGGCCAGGA CCATGACGGG GCTGGTCAAG
GAGTTCAACG AGGCCCATCC CGGCACCCGG GTGACGCCCG TCTTCACGGG CTCCTACGAC
GACACGCTGC TGAAGACCCG GGCGGCGATC AAGGCCGGCA AGCCGCCGGG CGCCGTCATC
ATGTCGGCGA ACTTCCTCAC CGACCTCGCC ATCGAGCGCG AGATCGCGCC CTTCGACGAC
CTGATCGCGG CCGAGGGCGG GACCCCGGAC GCCTTCATGG ACCAGTTCTT CCCGGCGCTG
AAGGGCAACG CCGTCGTCGA GCGCAAGGTC TACGGGGTGC CGTTCCACAA CTCGACCCCC
CTGCTCTACT ACAATGTCGA GCAGTTCCGC GAGGCCGGCC TCGACCCCGA CGCGCCGCCG
CGGACCTGGG ACGCGCTCGC CGCCGCCGCC CGCAAGCTCA CGCGGCGCGA GGGCGGCCGG
GTCACCCGCT GGGGGATCAT GATGCCGTCC AACTACGATT ACGGCGGCTG GATCCTGCAG
GCGCTCACCC TGTCGAATGG CGGGCGCTGG TACAACGAGG AGTATGGCGG CGAGGTCTAC
TACGACACGC CGACGGTGCT GGGCGCCCTG AGCTTCTGGG CCGACCTCGT GCACAGGGCC
AAGGTGCATC CGGCCGGCGA GATCAAGGGC CCGGCGGTCA CGGCCGCGTT CCTGTCGGGC
CAGGCCGCGA TGATGATCAT CTCGACCGGT TCCCTCACCT TCATCCGCGA CAGCGCCAAG
TTCCCGTTCC GGGTCGCCTT CGTGCCGATG AACGTGCGCC CGGCGGTGCC GATCGGCGGC
GCCTCCCTGG TCCAGCCGAC CGGCCTCGAC CCCGAGACCC GCAAGGCCGG CTGGACCCTG
ATCCGATGGC TGACCTCGCC GGCGATCTCG GGCCGCTGGA GCCGGGCGAC CGGCTACTTC
GCGCCGAACC GGGCCGCCTA CGACCTGCCC GAGATGCGGG CCTTCCTCGC CGGGAACCCG
GACGCGAAGA TCGCCGTCGA CCAGCTCGCC AACGCCAAGC CCTGGTTTGC CACCTACCGC
ACGGTGCCGG TGCGCAAGGC GATCGAGGAC GAGCTGCAGG CGGTGCTCGC CGGCAAGCGC
CAGCCGAAGG AGGCCCTCGC CGCCGCCCAG ACGAGCGCCG ACGCGATCCT GCGCCCCTAC
GTCGAGGACA CGGCGCTGCG GCTGCCGGGC GCGGAGTGA
 
Protein sequence
MKRANDRAQA RAATSWWRAA RLAAALLLGG AAGAAAADLD LFFPVPVDGA LARTMTGLVK 
EFNEAHPGTR VTPVFTGSYD DTLLKTRAAI KAGKPPGAVI MSANFLTDLA IEREIAPFDD
LIAAEGGTPD AFMDQFFPAL KGNAVVERKV YGVPFHNSTP LLYYNVEQFR EAGLDPDAPP
RTWDALAAAA RKLTRREGGR VTRWGIMMPS NYDYGGWILQ ALTLSNGGRW YNEEYGGEVY
YDTPTVLGAL SFWADLVHRA KVHPAGEIKG PAVTAAFLSG QAAMMIISTG SLTFIRDSAK
FPFRVAFVPM NVRPAVPIGG ASLVQPTGLD PETRKAGWTL IRWLTSPAIS GRWSRATGYF
APNRAAYDLP EMRAFLAGNP DAKIAVDQLA NAKPWFATYR TVPVRKAIED ELQAVLAGKR
QPKEALAAAQ TSADAILRPY VEDTALRLPG AE