Gene M446_2974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2974 
Symbol 
ID6129920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3298924 
End bp3300516 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content71% 
IMG OID641643165 
Productextracellular solute-binding protein 
Protein accessionYP_001769820 
Protein GI170741165 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0342651 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGAC GGACCCGACC GGCCCCCCGC CTCGCCCGCA CCGCCCCGCG CGGTCGCGGA 
GCGGCGGCCC TGCTCCTTGC CGGGACGGCC TTCCTGGCCG GAACGGCCTT CCTGGCCGGG
GCGGCGGGCC CGGCCCTCGC CGGCAAGGCC AACGACACCC TGGTCTACGC CTCGGACAGC
GAGCCCGAGA ACGTCAGCCC CTATCACAAC AGCGCCCGCG AGGGGGTCAT CCTGGCCCGC
AACGCCTGGG ACACGCTGCT CTACCGCGAT CCCGCCACCG GCACCTACCA GCCGATGCTG
GCGACCGCCT GGACATGGGC CGATCCCGTC ACCCTCGACC TCACCATCCG GGAGGGCGTG
GTCTTCCACA ACGGCGATCC GCTCACGCCG GAGGACGTCG CCTTCACCTT CAACTACGTG
CTCACGCCCG AGGCGCGCAC GGTCACCAAG CAGAACGTCG ACTGGATGAA GTCCACCGAG
GTGCTGGGCC CGCACACGGT GCGCATCCAC CTGAAGGCGC CGTTCCCGGC CGCGCTCGAA
TACCTCGCGG GTCCGACCCC GATCTTCCCG GCGGCCTATT TCAAGAAGGT GGGGCTCGAC
GGCTTCGCCA AGGCGCCGGT CGGCACGGGG CCCTACCGGA TCGTCAGCGT CGAGAGCGGG
CGCGGCGTCA AGCTCGAGCG CTTCGAGAAG TACTGGTCCG GCAGCCCGAT CGGGCGGCCG
AAGATCGGCA AGCTCGAATT CCGGGTCATC CCGGACGCCG ACAGCCGGAT GGCCGAGCTC
GTCACCGGCG GCATCGACTG GATCTGGCGC GTGCCGAGCG ACCAGGCCGA TCAGCTGCGC
GCGGCGCCCG GGATCACGGT GCTGAGCGCC GAGACGATGC GGGTCGGCTT CCTGCAATTC
GACGTCGGCG GCCGGGCGAT GGAGAAGTCG CCCCTCAGGG ACGTGCGGGT GCGCCGGGCG
ATCTCCTACG CGATCGACCG CAAGGCGATG GTCGACAACC TCGTGCGCGG CGGCGCGCGC
GTGATGAACG TGCTGTGCTT CTCCGGGCAG TTCGGCTGCG TCGAGGAGGG CGCCCCGCGC
TACGCCTACG ATCCCGCCAA GGCCAAGGCG CTGCTCAAGG AGGCGGGCTA CCCGGACGGG
TTCGAGATCG ACCTCGCGGC CTATCGCGAG CGCGATTACG CCGAGGCCGT GATCGGCTAC
CTGCGGGCGG TCGGCATCCG GGCGCGGCTC AACTACCTGC GCTACGCCGC CTTCCGGGAC
GCGCTGCGCG GCGGCAAGGT CTCGATCGGC TTCCAGACCT GGGGCTCGTT CTCGGTCAAC
GACGTCTCGG CCTTCACGGG CGTGTATTTC CGCGGCGGCG ACGAGGATCT GACCCGCGAC
CCGGCGGTGA TCGCGGCGCT CCAGGCCGGC GACACCGCGA GCGACCCGGG CGAGCGCAAG
GCGAAGTACG CCGAGGCGCT CTCGCGCATC GCCGGCGAGG CCTACGCGCT GCCGATGTTC
TCCTACCCGT CGAACTACGC CTTCACCCAG GACCTGAACT TCACGGCGCA GCCCGACGAG
GTGCCGCGCT TCTACGCCGC CTCCTGGAAG TGA
 
Protein sequence
MLRRTRPAPR LARTAPRGRG AAALLLAGTA FLAGTAFLAG AAGPALAGKA NDTLVYASDS 
EPENVSPYHN SAREGVILAR NAWDTLLYRD PATGTYQPML ATAWTWADPV TLDLTIREGV
VFHNGDPLTP EDVAFTFNYV LTPEARTVTK QNVDWMKSTE VLGPHTVRIH LKAPFPAALE
YLAGPTPIFP AAYFKKVGLD GFAKAPVGTG PYRIVSVESG RGVKLERFEK YWSGSPIGRP
KIGKLEFRVI PDADSRMAEL VTGGIDWIWR VPSDQADQLR AAPGITVLSA ETMRVGFLQF
DVGGRAMEKS PLRDVRVRRA ISYAIDRKAM VDNLVRGGAR VMNVLCFSGQ FGCVEEGAPR
YAYDPAKAKA LLKEAGYPDG FEIDLAAYRE RDYAEAVIGY LRAVGIRARL NYLRYAAFRD
ALRGGKVSIG FQTWGSFSVN DVSAFTGVYF RGGDEDLTRD PAVIAALQAG DTASDPGERK
AKYAEALSRI AGEAYALPMF SYPSNYAFTQ DLNFTAQPDE VPRFYAASWK