Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3922 |
Symbol | |
ID | 6134894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 4369701 |
End bp | 4371281 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641644080 |
Product | extracellular solute-binding protein |
Protein accession | YP_001770722 |
Protein GI | 170742067 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.824445 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0767976 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGAC GCCAAGTCCT GACACTCGGG GGCGCCGCCC TGGCCTGTCC GGCCCTCGCC CGCGCGGCGA GCGAGACGAC GCTGCGCTTC GTGCCCTACG CCGACCTGGC GCTGCTCGAC CCGATCATCA CCACGAACTA CGTCACGCGG ACGCACGCGC TCCTGGTCTT CGACACGCTC TACGGGACCG ACGCGCAGTT CCGGCCTCAG CCCCAGATGG TGGCGGGGCA CGAGGTCGAG GCCGACGGGC GCCTCTGGCG GCTGACCCTG CGGGAGGGCC TGCGCTTCCA CGACGGCAGC CCGGTCCTCG CCCGCGACGC GGTGGCGAGC CTCAGGCGCT GGGCCGTGCG GGACGCCTTC GGCGGTGCGC TCTTCGCGGC CCTGGACGAG ATCTCGGCAC CCTCCGACCG CGTGGTGCAG TTCCGCATGA GGCGGCCCTT CCCGCTGCTG CCCCAGGCGC TGGCCAAGCC GACCTCGTAC GTGCCGGTCA TCATGCCCGA GCGCCTCGCC GCGACGCCCG CGACGAGCGC GGTGCCGGAG ATGGTCGGCA GCGGTCCCTA CCGCTTCGTC GCGCAGGAGC GGGTCCCGGG GGCGCTCGCG GTCTACCGTC GCTTCGCCGA GTACCGGCCG CGGGAGGGGG GCGAGGCGAG CTTCACGGCC GGGCCGCGGA TCGCCCATTT CGAGCGGGTC GAGTGGCGCA CCATGCCCGA TCCCGCCACC GCGGCGAGCG CGCTGCGGGC CGGCGAGGTC GACTGGATCG AGCAGCCCGC GATCGACCTC GTGCCGCAGC TCGCGCGCGC CCGGGGCGTC ACGGTGGCGG TGGTCGAGCC GGCGGGGCTG ATCGGGCAGA TCCGCTTCAA CCACCTGCAG CCGCCCTTCG ACAACCCGGC CATCTGCCGG GCCTTCCTGG GGGCGGTCGA CCAGACCGAG ATGATGGACG CGGTGGCCGG CACCGATCCG GCGATCCTGC GCGGGCCGGC CGGCATCTTC ACGCCGGGCG GGCCGATGGC CTCCGAGGCC GGGATGGAGA TCCTGACCGG TCCCCGCGAC ATCGCCCGCA GCCGGCGCGA ACTGGAAGCG GCCGGCTACC GCGGCGAGCC GGTGGTGCTC CTCGCCGGCA CGGACGTGCC GCGGATTAAC GCGGTCTGCG AGGTCATGGC GGAGGTCTGC CGCCGGCTCG GCGTCGCCCT CGACTACGTC GCCACCGATT GGGGCACGGT CAACCAGCGC ATCCTCAACC CGAAGCCCCT GGACCAGGGC GGCTGGAGCC TGTTCGGCAT CTTCTCCGGC GGGCTCGATC ACCTCTCGCC GGCCTACCAC CTCGCGACCC GCGGCATCGG CCGGGCCGGC GTGCCGAGCT GGCTCACCGA CGCCCCGCTG GAGGAGCTCC GCGACGCGTG GTTCGCGGCG CCCGACCTCG CCGCCCAGCA GGCGATCGCG GCGAAGATCC AGGCCCGCGC CCTCGCGGTC GGCGCCTACA TTCCCTGCGG CCGCTACGTC CAGCCGACGG CCTACCGGTC GGAGCTGACC GGGATGCTCA CCGGGCTGCC CCTGTTCACC AACCTGCGGC GGGGCGGGTA G
|
Protein sequence | MNRRQVLTLG GAALACPALA RAASETTLRF VPYADLALLD PIITTNYVTR THALLVFDTL YGTDAQFRPQ PQMVAGHEVE ADGRLWRLTL REGLRFHDGS PVLARDAVAS LRRWAVRDAF GGALFAALDE ISAPSDRVVQ FRMRRPFPLL PQALAKPTSY VPVIMPERLA ATPATSAVPE MVGSGPYRFV AQERVPGALA VYRRFAEYRP REGGEASFTA GPRIAHFERV EWRTMPDPAT AASALRAGEV DWIEQPAIDL VPQLARARGV TVAVVEPAGL IGQIRFNHLQ PPFDNPAICR AFLGAVDQTE MMDAVAGTDP AILRGPAGIF TPGGPMASEA GMEILTGPRD IARSRRELEA AGYRGEPVVL LAGTDVPRIN AVCEVMAEVC RRLGVALDYV ATDWGTVNQR ILNPKPLDQG GWSLFGIFSG GLDHLSPAYH LATRGIGRAG VPSWLTDAPL EELRDAWFAA PDLAAQQAIA AKIQARALAV GAYIPCGRYV QPTAYRSELT GMLTGLPLFT NLRRGG
|
| |