Gene M446_3922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3922 
Symbol 
ID6134894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4369701 
End bp4371281 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content74% 
IMG OID641644080 
Productextracellular solute-binding protein 
Protein accessionYP_001770722 
Protein GI170742067 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.824445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0767976 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGAC GCCAAGTCCT GACACTCGGG GGCGCCGCCC TGGCCTGTCC GGCCCTCGCC 
CGCGCGGCGA GCGAGACGAC GCTGCGCTTC GTGCCCTACG CCGACCTGGC GCTGCTCGAC
CCGATCATCA CCACGAACTA CGTCACGCGG ACGCACGCGC TCCTGGTCTT CGACACGCTC
TACGGGACCG ACGCGCAGTT CCGGCCTCAG CCCCAGATGG TGGCGGGGCA CGAGGTCGAG
GCCGACGGGC GCCTCTGGCG GCTGACCCTG CGGGAGGGCC TGCGCTTCCA CGACGGCAGC
CCGGTCCTCG CCCGCGACGC GGTGGCGAGC CTCAGGCGCT GGGCCGTGCG GGACGCCTTC
GGCGGTGCGC TCTTCGCGGC CCTGGACGAG ATCTCGGCAC CCTCCGACCG CGTGGTGCAG
TTCCGCATGA GGCGGCCCTT CCCGCTGCTG CCCCAGGCGC TGGCCAAGCC GACCTCGTAC
GTGCCGGTCA TCATGCCCGA GCGCCTCGCC GCGACGCCCG CGACGAGCGC GGTGCCGGAG
ATGGTCGGCA GCGGTCCCTA CCGCTTCGTC GCGCAGGAGC GGGTCCCGGG GGCGCTCGCG
GTCTACCGTC GCTTCGCCGA GTACCGGCCG CGGGAGGGGG GCGAGGCGAG CTTCACGGCC
GGGCCGCGGA TCGCCCATTT CGAGCGGGTC GAGTGGCGCA CCATGCCCGA TCCCGCCACC
GCGGCGAGCG CGCTGCGGGC CGGCGAGGTC GACTGGATCG AGCAGCCCGC GATCGACCTC
GTGCCGCAGC TCGCGCGCGC CCGGGGCGTC ACGGTGGCGG TGGTCGAGCC GGCGGGGCTG
ATCGGGCAGA TCCGCTTCAA CCACCTGCAG CCGCCCTTCG ACAACCCGGC CATCTGCCGG
GCCTTCCTGG GGGCGGTCGA CCAGACCGAG ATGATGGACG CGGTGGCCGG CACCGATCCG
GCGATCCTGC GCGGGCCGGC CGGCATCTTC ACGCCGGGCG GGCCGATGGC CTCCGAGGCC
GGGATGGAGA TCCTGACCGG TCCCCGCGAC ATCGCCCGCA GCCGGCGCGA ACTGGAAGCG
GCCGGCTACC GCGGCGAGCC GGTGGTGCTC CTCGCCGGCA CGGACGTGCC GCGGATTAAC
GCGGTCTGCG AGGTCATGGC GGAGGTCTGC CGCCGGCTCG GCGTCGCCCT CGACTACGTC
GCCACCGATT GGGGCACGGT CAACCAGCGC ATCCTCAACC CGAAGCCCCT GGACCAGGGC
GGCTGGAGCC TGTTCGGCAT CTTCTCCGGC GGGCTCGATC ACCTCTCGCC GGCCTACCAC
CTCGCGACCC GCGGCATCGG CCGGGCCGGC GTGCCGAGCT GGCTCACCGA CGCCCCGCTG
GAGGAGCTCC GCGACGCGTG GTTCGCGGCG CCCGACCTCG CCGCCCAGCA GGCGATCGCG
GCGAAGATCC AGGCCCGCGC CCTCGCGGTC GGCGCCTACA TTCCCTGCGG CCGCTACGTC
CAGCCGACGG CCTACCGGTC GGAGCTGACC GGGATGCTCA CCGGGCTGCC CCTGTTCACC
AACCTGCGGC GGGGCGGGTA G
 
Protein sequence
MNRRQVLTLG GAALACPALA RAASETTLRF VPYADLALLD PIITTNYVTR THALLVFDTL 
YGTDAQFRPQ PQMVAGHEVE ADGRLWRLTL REGLRFHDGS PVLARDAVAS LRRWAVRDAF
GGALFAALDE ISAPSDRVVQ FRMRRPFPLL PQALAKPTSY VPVIMPERLA ATPATSAVPE
MVGSGPYRFV AQERVPGALA VYRRFAEYRP REGGEASFTA GPRIAHFERV EWRTMPDPAT
AASALRAGEV DWIEQPAIDL VPQLARARGV TVAVVEPAGL IGQIRFNHLQ PPFDNPAICR
AFLGAVDQTE MMDAVAGTDP AILRGPAGIF TPGGPMASEA GMEILTGPRD IARSRRELEA
AGYRGEPVVL LAGTDVPRIN AVCEVMAEVC RRLGVALDYV ATDWGTVNQR ILNPKPLDQG
GWSLFGIFSG GLDHLSPAYH LATRGIGRAG VPSWLTDAPL EELRDAWFAA PDLAAQQAIA
AKIQARALAV GAYIPCGRYV QPTAYRSELT GMLTGLPLFT NLRRGG