Gene M446_5132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5132 
Symbol 
ID6131048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5636849 
End bp5639134 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content78% 
IMG OID641645267 
Productcell wall anchor domain-containing protein 
Protein accessionYP_001771892 
Protein GI170743237 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.137427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTCA CGCCCCTCCC GGCGGCGCGC GCCCGCGCGC CGCGCGCCGA CGCTCGGCCG 
ACCGCATGGT CGACCGCTCG GCCGACCTCT TGGCCGACCT CTTGGCCGAC CTCTTGGCCG
ACCTCTTGGC CGACCGCTCG GCCGACCTCT TGGCCGACCG CTTGGCCGGC CGCCCTGCGC
CGGGCGCTCG CCCTCTCGGC GCCCGTCCTC CTCGGCCTGC TCGCCTGGCT CGGCGGCCTC
GACGGCGCCC GCGCCGCACC GGGCGGCGGA GCGCTGCTGC TGCGCGGCCC GGCCCGCGAG
GCCGCGCCGG TCGAGGCGCC GCGCCTGCGG ACCGACATCG CCGTGACGGT GAGTGGCGCC
ACCGCCCGCG CCACGCTCAC CCAGGTCTTC CGCAACACCA CCGACCAGTG GGTCGAGGGC
ACCTACGTCT TCCCGCTGCC GGAGGACGCC GCCGTCGACA CGATGACGCT CGTCGTCGGC
GATCGCGTCA TCGCGGGGGA GATCCGCGCG CGCGAGGCCG CCCGCACCGC CTACGAGGCC
GCGCGCGAGA CCGGCCGCGC CGCCGCCCTC ACCGAGCAGG AGCGCCCGAA CCTGTTCACC
ACCAGCGTGG CCAATATCGG CCCCGGCGAG ACCGTGCTGG TGCAGATCGC GTTCCAGCAG
CCGGTGCGGC TGTCGGGCGG CACCCACGCC CTGCGCCTGC CCCTGGTCGT CGCGCCCCGC
TACAGCCCGG CGCCCGGCTT GCTCCAGCCG GCCGCCGAGG GGCCGGCGCG CGACCCGGTG
CCCGACCGGG CGCGGATCGC CCCGCCGGTC CTCGATCCGG CCGTGCACGG GCCCGTCAAC
CCGGTGACGC TCACCGTCAC CCTGCGGGCC GGCTTCCCCC TCGGGACGGT GGAGAGCGCC
ACCCACGCGA TCCGCGTCGA GGAGACCGGC CCCGACAGCC GCCGGGTGAC CCTCGCGGAC
GGCCCCGTGC CGGCGGACCG CGACTTCGCG CTGACCTGGC GCGCCGCTCC CAGCGCCGCG
CCCGCGGTCG GGCTCTTCCG CGAGCGGGTC GGGGAGGACG AGTACCTGCT CGCCGTGGTG
ACGCCGCCCG AGGGGCGGGC GCCGGCGCGG CGGCCCCGCG AGGTCACCTT CGTGATCGAC
AATTCCGGCT CCATGGCCGG CGCCTCGATG CGGCAGGCCA AGGCGAGCCT GCTCGTGGCC
CTCGACCGGC TCGGCCCGGC CGACCGCTTC AACGTGATCC GCTTCGACGA CACCATGGAC
CTGCTCTTCC CGGCCCCGGT CCCGGCCGAC GAGGCGCATC GCGACGCCGC CCGCCGCTTC
GTGGCGGCCC TGGAGGCGCG GGGCGGGACC GAGATGCTGC CGCCCCTGCG GGCCGCCCTC
GCCGACCCGC ATCCCGAGGA GGGCGACCGC GTGCGCCAGA TCGTGTTCCT GACCGACGGC
GCGATCGGCA ACGAGGAGCA GATCTTCTCC GCGATCAGCG CCGGGCGGGG CCGCTCGCGC
CTGTTCATGA TCGGCATCGG CTCGGCCCCG AACGGGCACC TGATGACCCA CGCGGCGGAA
CTCGGCGGCG GCAGCTACAC GGCGATCGGC ACGATCGACC AGGTGGCGGA GCGCACGGCC
GAGCTGCTCG CCAAGCTGGA GAGCCCGGTC GTCACCGACC TCGCGGCCGC CTTCTCGGAG
CCCGGCGTCG AGGCGACCCC GCGCCTCCTG CCCGACCTCT ACCGGGGCGA GCCGGTGGTC
CTCGCCGCCC GCCTGCGGGA GGCGACCGGC ACGCTGACCC TGCGCGGGCG GATCGGCGAG
GCGCCCTGGC AGCAGGTGCT GACCCTCGCC GAGGCGCGGG AGGGCAGCGG CATCTCGAAG
CTCTGGGCGC GGGCGAAGAT CGGCGAGGCC GAGACCGCCC GCCTCACCGG CCGCATGAGC
GCCGAGGCCG CCGACGCCGC GATCCTGCGG CTCGCCCTGG CGCACCGGCT GACGACCCGG
CTCACCAGCC TCGTCGCCCT CGACGTCACC CCGCGGCGAC CGCCGGGCGT CGCCCTCACG
GCCGCCGACC TGCCCCTGAA CCTGCCGGCG GGCTGGGACT TTTCGGCTCT GTTCGGCGGC
GAGGGGCGGA TGCCGCGCGC GCGGCGGGCC GAGGCTCCCG TCCCGCGCGC CGCGCAGGAG
GGCCGCGGGG TCGACCTGCC GCAGACCGGG ACCGACGCGC CGGCCCTGCT CTGGCTCGGC
CTCGTGCTGG CCGGCCTCGG CGCCGGGCTG CTCGGGCGCG GCGCGGGCTC GCGGAGGCCC
GCGTGA
 
Protein sequence
MLLTPLPAAR ARAPRADARP TAWSTARPTS WPTSWPTSWP TSWPTARPTS WPTAWPAALR 
RALALSAPVL LGLLAWLGGL DGARAAPGGG ALLLRGPARE AAPVEAPRLR TDIAVTVSGA
TARATLTQVF RNTTDQWVEG TYVFPLPEDA AVDTMTLVVG DRVIAGEIRA REAARTAYEA
ARETGRAAAL TEQERPNLFT TSVANIGPGE TVLVQIAFQQ PVRLSGGTHA LRLPLVVAPR
YSPAPGLLQP AAEGPARDPV PDRARIAPPV LDPAVHGPVN PVTLTVTLRA GFPLGTVESA
THAIRVEETG PDSRRVTLAD GPVPADRDFA LTWRAAPSAA PAVGLFRERV GEDEYLLAVV
TPPEGRAPAR RPREVTFVID NSGSMAGASM RQAKASLLVA LDRLGPADRF NVIRFDDTMD
LLFPAPVPAD EAHRDAARRF VAALEARGGT EMLPPLRAAL ADPHPEEGDR VRQIVFLTDG
AIGNEEQIFS AISAGRGRSR LFMIGIGSAP NGHLMTHAAE LGGGSYTAIG TIDQVAERTA
ELLAKLESPV VTDLAAAFSE PGVEATPRLL PDLYRGEPVV LAARLREATG TLTLRGRIGE
APWQQVLTLA EAREGSGISK LWARAKIGEA ETARLTGRMS AEAADAAILR LALAHRLTTR
LTSLVALDVT PRRPPGVALT AADLPLNLPA GWDFSALFGG EGRMPRARRA EAPVPRAAQE
GRGVDLPQTG TDAPALLWLG LVLAGLGAGL LGRGAGSRRP A