Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_0497 |
Symbol | |
ID | 6129238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 586456 |
End bp | 588075 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641640819 |
Product | extracellular solute-binding protein |
Protein accession | YP_001767494 |
Protein GI | 170738839 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00026016 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGTGAGT TGTCCCTCTC CCGCCGGCGC TTCGTCGCGG GCGCCGCGGC CCTGGCGGCC CTGGGTCCGA CCGGGTCCGC GCTCGCCCAG GGAGCAGCCT CCGGGAGCCT GACCTACGGC ATCTCGATGT TCGACCTGCC CCTCACCACC GGCCAGCCGG ACCGGGGTGC GGGCGGCTAC CAATTCACCG GGCTCACCCT CTACGACCCG CTGGTCGCCT GGGAACTCGA CGTGGCCGAC CGGCCCGGCA AGCTGATCCC GGGCCTCGCC ACCGCCTGGG AGAGCGATCC GGCCGACCGC CGGAACTGGA TCTTCCGCCT GCGCGAGGGC GTGACCTTCC ACGACGGCTC CGCCTTCGAC GCGGACGCGG TGATCTGGAA CTTCGAGAAG GTGCTCAACG ACAAGGCCGC GCACTACGAC CAGCGGCAGG CCTCGCAGGT GCGCCCGCGC CTGCCCTCGG TGGCCTCCTA CCGGAAGCTC GACGCCATGA CCGTGCAGGT CACCACCAAG GCGGTCGACG CGCTGTTCCC CTACCAGATG CTGTGGTTCC TGATCTCCTC GCCCGCCCAG TACGAGGCGG TGGGGCGCGA CTGGACCAAG TTCGCCTTCC AGCCCTCCGG CACCGGGCCC TACCGCATGG GCCAGCTCGT GCCGCGGGTG CGGCTCGACC TCGTGCCCAA CGAGACCTAC TGGAACAGGA AGCGGATGCC GAAGCTCGCG CGGCTGACGC TGACCTGCAT CCCCGACGCG CTCGCCCGCG CGAACGCGCT GCTCAGCGGC ACCGTCGACC TGATCGAGAC GCCCGCCCCC GACGCGGTGC CGCGGCTCAA GGCGGCGGGC ATGCGGGTCG TCGGCAACGA CACGCCGCAC GTCTGGAACT ACCACCTGTC GATACTGGAG GGCAGCCCCT GGCGGGACCT GCGCCTGCGC CGGGCGGCGA ACCTCGCCAT CGACCGCGAG GGCGTGGTGG CGCTGATGGG CGGGCTCGCC ACCCCGGCGG TCGGGCAGGT GCAGCCGTCG AGCCCCTGGT TCGGCAAGCC GAGCTTCGCG ATCCGCACCG ACGTCGAGGC CGCCCGCAAG CTCGTCGAGG AGGCCGGCTA CTCGGTCCGG AACCCGCTGC GCACCAAGTT CATCATCCCG ACCGGCGGCT CGGGCCAGAT GCTGTCGCTG CCGATCAACG AGTTCGTGCA GCAGAGCTGG GCCGAAATCG GCATCGCGGT GGAGTTCCAG CCGGTGGAGC TGGAGGTCGC CTACACGGCC TGGCGCCAGG GCGCGGCCGA TCCCTCGCTG AAGGGCGTCA CCGGCGGCAA CATCGCCTAC GTGACCTCGG ACCCGCTCTA CGCGATCCTG CGCTTCTACT CCTCCAGGCA GATCGCCCCG ACCGGCGTGA ACTGGAGCCA CTACCGCAAC CCCGAGGTCG ATGCCCTCTG CGACAAGGTC CAGGCGAGCT TCGACCCGGC CGAGCAGGAC CGGCTCCTCG CGCGCATCCA CGAGATCGTG GTGGACGACG CCGTGCAGGT CTGGGTGGTG CACGACACCA ACCCGCACGC CCTCTCGGCC AAGGTGAAGG GCTATACCCA GGCGCAGCAC TGGTTCCAGG ACCTCACCAC GCTGGCCTGA
|
Protein sequence | MSELSLSRRR FVAGAAALAA LGPTGSALAQ GAASGSLTYG ISMFDLPLTT GQPDRGAGGY QFTGLTLYDP LVAWELDVAD RPGKLIPGLA TAWESDPADR RNWIFRLREG VTFHDGSAFD ADAVIWNFEK VLNDKAAHYD QRQASQVRPR LPSVASYRKL DAMTVQVTTK AVDALFPYQM LWFLISSPAQ YEAVGRDWTK FAFQPSGTGP YRMGQLVPRV RLDLVPNETY WNRKRMPKLA RLTLTCIPDA LARANALLSG TVDLIETPAP DAVPRLKAAG MRVVGNDTPH VWNYHLSILE GSPWRDLRLR RAANLAIDRE GVVALMGGLA TPAVGQVQPS SPWFGKPSFA IRTDVEAARK LVEEAGYSVR NPLRTKFIIP TGGSGQMLSL PINEFVQQSW AEIGIAVEFQ PVELEVAYTA WRQGAADPSL KGVTGGNIAY VTSDPLYAIL RFYSSRQIAP TGVNWSHYRN PEVDALCDKV QASFDPAEQD RLLARIHEIV VDDAVQVWVV HDTNPHALSA KVKGYTQAQH WFQDLTTLA
|
| |