Gene M446_5343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5343 
Symbol 
ID6134678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5870593 
End bp5872200 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content74% 
IMG OID641645473 
Productextracellular solute-binding protein 
Protein accessionYP_001772095 
Protein GI170743440 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.220066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0289508 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCGG CCATGCGCCG CCGCGACCTC CTCGCCGCGA GCGGCGGGGC CATCCTCGGG 
GCCGCGGCGC GCGGCGCGCG CGCGGCCGGG CCGGCCCTCT ACCGGCGCGG CAACGACGCC
GATCCGGAAA CCCTCGATCC GCACCGCTCC TCGACCCTCG CGGAGGCCAC GATCCTCCTC
GACCTCTTCG AGGGCCTGAC CTGCTACGAC GCGGCCGGGC GGATCGTGCC CGGGGCGGCG
GAGAGCTGGA CCACCTCCGA GGACGGCCTC ACCTGGAGCT TCACCCTGCG CCCGGACGGG
CGCTGGTCGA ACGGCGACCC CGTGGTGGCG GAGGATTTCC TGGCCGGGTT CCGCCGCATC
CTCGACCCGG CGACCGGCGC CAAATACGCC AACGTGCTGT TCCCGATCCG CGGGGCGGAG
GCGGCGAACC GGGGCGCCGC GCCGGTGGAG CGGATCGGCG TCGCGGCGCC GGATCCGCGC
CGGGTGGTGA TCACGCTCGC CCAGCCGACC CCCTACCTCC TCGAACTGAT GACCCATCAG
GCGAGTTCGC CGATCCACCG GCCAACGCTC GCCCGGCACG GCGACGCCTT CAGCCGGCCC
GGGCTGCTCG TCTCGAACGG CGCCTACCGG CTGACGGATT TCGTGCCGAA CGACCGGGTC
ACGGCGGCGC GCAACCCGCA TTTCCGCGAG GCCGGGCGCG TGCGGATCCC CGAGATCGCC
TACATCCCGA CCCCGGACCT CGCGGCGGCG GTGCGGCGCT TCGCGGCCGG CGAGATCGAT
TCCCTGAGCG ATCTGCCGGC CGACCAGATC CGGGCCCTGA AGGCCCGCTT CGGCGAGCAG
GTCTGGCTCG CGCCGGCGCT CGGCGTGCTG GTGCTGATGG TCAACCTGCG CCGGGCGCCG
CTCGGCGACC GGCGGATCCG CCAGGCCCTG TCGCTCGCCA TCGACCGGGA ATTCCTGGCC
GAGGCGGTCT GGGGCGAGTC GATGCTGCCG GCCTACTCGC TGACGCCGGC GGGGCTCGAC
AACGCCCTGC CGCCCCCCGA GATGCCGGGC CGGGACCTCT CGCCCCTGGA GCGGGAGGAC
CGCGCGGTGG CGCTGCTGCG CGAGGCCGGA TACGGGCCGG GAGGGCGGCC GCTGCCCGTC
GAGATCCGCT ACAACACCAC CGACAACAAC CGGAACACCA TGGTGGCGAT CGCCGACATG
TGGCAGCCGC TCGGGGTCAG CACGCGCCTC GTCAACACCG ATGCCAAGAC CCACTTCGCG
CATCTGCGCG ACGGCGGACC GTTCGACCTC GCCCGCTACG CCTGGATCGC CGATTACGCG
GACCCGCAGA ACTTCCTGTT CCTGGTCGAG AGCGACAACG ACGGCTTCAA CTCGGGCCAC
TACGCCAACC CGGCCTACGA CGCGCTGATG CGCGAGGCGG CCGGGACCGT CGATCTCGCG
CGGCGGGCCG GGCTGCTGCA CCGGGCGGAG GAGGTCTTCC TCGCCGACCT GCCCTGGATC
CCGCTGCTGC ACTACCGGCA CAAGCACATG GTCTCGCCGC GGCTGCGCGG CATCGTGCCC
AACCTGCGCG GCGTCTCCCC GACCCGCTAC CTCTGGCTGG ACGGATGA
 
Protein sequence
MRPAMRRRDL LAASGGAILG AAARGARAAG PALYRRGNDA DPETLDPHRS STLAEATILL 
DLFEGLTCYD AAGRIVPGAA ESWTTSEDGL TWSFTLRPDG RWSNGDPVVA EDFLAGFRRI
LDPATGAKYA NVLFPIRGAE AANRGAAPVE RIGVAAPDPR RVVITLAQPT PYLLELMTHQ
ASSPIHRPTL ARHGDAFSRP GLLVSNGAYR LTDFVPNDRV TAARNPHFRE AGRVRIPEIA
YIPTPDLAAA VRRFAAGEID SLSDLPADQI RALKARFGEQ VWLAPALGVL VLMVNLRRAP
LGDRRIRQAL SLAIDREFLA EAVWGESMLP AYSLTPAGLD NALPPPEMPG RDLSPLERED
RAVALLREAG YGPGGRPLPV EIRYNTTDNN RNTMVAIADM WQPLGVSTRL VNTDAKTHFA
HLRDGGPFDL ARYAWIADYA DPQNFLFLVE SDNDGFNSGH YANPAYDALM REAAGTVDLA
RRAGLLHRAE EVFLADLPWI PLLHYRHKHM VSPRLRGIVP NLRGVSPTRY LWLDG