Gene M446_0497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0497 
Symbol 
ID6129238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp586456 
End bp588075 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content71% 
IMG OID641640819 
Productextracellular solute-binding protein 
Protein accessionYP_001767494 
Protein GI170738839 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00026016 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTGAGT TGTCCCTCTC CCGCCGGCGC TTCGTCGCGG GCGCCGCGGC CCTGGCGGCC 
CTGGGTCCGA CCGGGTCCGC GCTCGCCCAG GGAGCAGCCT CCGGGAGCCT GACCTACGGC
ATCTCGATGT TCGACCTGCC CCTCACCACC GGCCAGCCGG ACCGGGGTGC GGGCGGCTAC
CAATTCACCG GGCTCACCCT CTACGACCCG CTGGTCGCCT GGGAACTCGA CGTGGCCGAC
CGGCCCGGCA AGCTGATCCC GGGCCTCGCC ACCGCCTGGG AGAGCGATCC GGCCGACCGC
CGGAACTGGA TCTTCCGCCT GCGCGAGGGC GTGACCTTCC ACGACGGCTC CGCCTTCGAC
GCGGACGCGG TGATCTGGAA CTTCGAGAAG GTGCTCAACG ACAAGGCCGC GCACTACGAC
CAGCGGCAGG CCTCGCAGGT GCGCCCGCGC CTGCCCTCGG TGGCCTCCTA CCGGAAGCTC
GACGCCATGA CCGTGCAGGT CACCACCAAG GCGGTCGACG CGCTGTTCCC CTACCAGATG
CTGTGGTTCC TGATCTCCTC GCCCGCCCAG TACGAGGCGG TGGGGCGCGA CTGGACCAAG
TTCGCCTTCC AGCCCTCCGG CACCGGGCCC TACCGCATGG GCCAGCTCGT GCCGCGGGTG
CGGCTCGACC TCGTGCCCAA CGAGACCTAC TGGAACAGGA AGCGGATGCC GAAGCTCGCG
CGGCTGACGC TGACCTGCAT CCCCGACGCG CTCGCCCGCG CGAACGCGCT GCTCAGCGGC
ACCGTCGACC TGATCGAGAC GCCCGCCCCC GACGCGGTGC CGCGGCTCAA GGCGGCGGGC
ATGCGGGTCG TCGGCAACGA CACGCCGCAC GTCTGGAACT ACCACCTGTC GATACTGGAG
GGCAGCCCCT GGCGGGACCT GCGCCTGCGC CGGGCGGCGA ACCTCGCCAT CGACCGCGAG
GGCGTGGTGG CGCTGATGGG CGGGCTCGCC ACCCCGGCGG TCGGGCAGGT GCAGCCGTCG
AGCCCCTGGT TCGGCAAGCC GAGCTTCGCG ATCCGCACCG ACGTCGAGGC CGCCCGCAAG
CTCGTCGAGG AGGCCGGCTA CTCGGTCCGG AACCCGCTGC GCACCAAGTT CATCATCCCG
ACCGGCGGCT CGGGCCAGAT GCTGTCGCTG CCGATCAACG AGTTCGTGCA GCAGAGCTGG
GCCGAAATCG GCATCGCGGT GGAGTTCCAG CCGGTGGAGC TGGAGGTCGC CTACACGGCC
TGGCGCCAGG GCGCGGCCGA TCCCTCGCTG AAGGGCGTCA CCGGCGGCAA CATCGCCTAC
GTGACCTCGG ACCCGCTCTA CGCGATCCTG CGCTTCTACT CCTCCAGGCA GATCGCCCCG
ACCGGCGTGA ACTGGAGCCA CTACCGCAAC CCCGAGGTCG ATGCCCTCTG CGACAAGGTC
CAGGCGAGCT TCGACCCGGC CGAGCAGGAC CGGCTCCTCG CGCGCATCCA CGAGATCGTG
GTGGACGACG CCGTGCAGGT CTGGGTGGTG CACGACACCA ACCCGCACGC CCTCTCGGCC
AAGGTGAAGG GCTATACCCA GGCGCAGCAC TGGTTCCAGG ACCTCACCAC GCTGGCCTGA
 
Protein sequence
MSELSLSRRR FVAGAAALAA LGPTGSALAQ GAASGSLTYG ISMFDLPLTT GQPDRGAGGY 
QFTGLTLYDP LVAWELDVAD RPGKLIPGLA TAWESDPADR RNWIFRLREG VTFHDGSAFD
ADAVIWNFEK VLNDKAAHYD QRQASQVRPR LPSVASYRKL DAMTVQVTTK AVDALFPYQM
LWFLISSPAQ YEAVGRDWTK FAFQPSGTGP YRMGQLVPRV RLDLVPNETY WNRKRMPKLA
RLTLTCIPDA LARANALLSG TVDLIETPAP DAVPRLKAAG MRVVGNDTPH VWNYHLSILE
GSPWRDLRLR RAANLAIDRE GVVALMGGLA TPAVGQVQPS SPWFGKPSFA IRTDVEAARK
LVEEAGYSVR NPLRTKFIIP TGGSGQMLSL PINEFVQQSW AEIGIAVEFQ PVELEVAYTA
WRQGAADPSL KGVTGGNIAY VTSDPLYAIL RFYSSRQIAP TGVNWSHYRN PEVDALCDKV
QASFDPAEQD RLLARIHEIV VDDAVQVWVV HDTNPHALSA KVKGYTQAQH WFQDLTTLA