Gene M446_3191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3191 
Symbol 
ID6134078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3532361 
End bp3533368 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content69% 
IMG OID641643379 
Productbasic membrane lipoprotein 
Protein accessionYP_001770031 
Protein GI170741376 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0191984 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACC CGGGACGACA GCTGAGCCGG CGCACGCTGC TGGCGGCGGG CGGGGCGGCC 
CTCGCGGGGC TCGGATGCGG CGGCGCCCGC GCCGCGAAGG CGAAGCCCAT CGCCGTCGCC
CTCGTCGCCT CGGTGCCGAT CGAGCAGCAA TGGATCAGCC GCATCCACCT CGCCCTCAAG
GCGGCGCAGG CGCGCGGCGA CATCACCTAC GCGTATTCCG AGAACGTCGC CAACACCGAT
GCCGAGCGCG TCCTGCGCGA ATACGCGGAG GGCAGGAAGG ACCTGATCAT CGGCGAGGCC
TTCGGCCTGG AGCGGCCGGC GCGCCGGATC GCCGCCGACT ACAAGTCCAC CGCCTTCCTG
ATGGGCTCGT CCTTCCCGGC GCAGGCGCCC AACCTCTCGG TCTTCGACAA CTACATCCAG
GACGCCTCGT ACCTGACCGG GATCGTGGCC GGGAAGGAGA CGAGGACCAA CATCATCGGC
ATGGTCGGCG GCTACGCCAT CCCGGAGGTG AACCGGCTGA TGCACGCCTT CATGGCCGGC
GCCCGGTCCG TCAATCCGGA CGTCAAGTTC CTGGTCTCCT TCATCAATTC CTGGTACGAC
CCGCCCAAGG CGAAGGAGAC CGCCTTCGCC ATGATCGAGC GCAGGGCCGA CGTGCTCTAC
GCGGAGCGCT TCGGCGTCTC CGACGCCGCC AAGGAGCGCG GCGTCAAGGC GATCGGCAAC
GTCATCGACA CGGCCGCGCA ATACCCGGGC ACGGTGATCG CCTCGGCCCT CTGGCACATG
GAGCCGACGA TCGACCGCGC CGTCGCCAAG GTCATCGAGG GCAGCTTCAC GGCGGAGGAT
TACGGCCCCT GGAGCCACAT GGCCAAGGGC GGCTGCTCGC TCGCCCCCCT CGATCCGAAG
CTGGTGCCGC AGCCGGTGAT CGATCTCGTC CTCGCCAAGG AGAAGGAGAT CCGCGGCGGT
TCCTTCACGG TGGCGGTGAA CGATTCCGAG CCGAAATCCA GCGCCTGA
 
Protein sequence
MTDPGRQLSR RTLLAAGGAA LAGLGCGGAR AAKAKPIAVA LVASVPIEQQ WISRIHLALK 
AAQARGDITY AYSENVANTD AERVLREYAE GRKDLIIGEA FGLERPARRI AADYKSTAFL
MGSSFPAQAP NLSVFDNYIQ DASYLTGIVA GKETRTNIIG MVGGYAIPEV NRLMHAFMAG
ARSVNPDVKF LVSFINSWYD PPKAKETAFA MIERRADVLY AERFGVSDAA KERGVKAIGN
VIDTAAQYPG TVIASALWHM EPTIDRAVAK VIEGSFTAED YGPWSHMAKG GCSLAPLDPK
LVPQPVIDLV LAKEKEIRGG SFTVAVNDSE PKSSA