Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_5132 |
Symbol | |
ID | 6131048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 5636849 |
End bp | 5639134 |
Gene Length | 2286 bp |
Protein Length | 761 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641645267 |
Product | cell wall anchor domain-containing protein |
Protein accession | YP_001771892 |
Protein GI | 170743237 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.137427 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCTCA CGCCCCTCCC GGCGGCGCGC GCCCGCGCGC CGCGCGCCGA CGCTCGGCCG ACCGCATGGT CGACCGCTCG GCCGACCTCT TGGCCGACCT CTTGGCCGAC CTCTTGGCCG ACCTCTTGGC CGACCGCTCG GCCGACCTCT TGGCCGACCG CTTGGCCGGC CGCCCTGCGC CGGGCGCTCG CCCTCTCGGC GCCCGTCCTC CTCGGCCTGC TCGCCTGGCT CGGCGGCCTC GACGGCGCCC GCGCCGCACC GGGCGGCGGA GCGCTGCTGC TGCGCGGCCC GGCCCGCGAG GCCGCGCCGG TCGAGGCGCC GCGCCTGCGG ACCGACATCG CCGTGACGGT GAGTGGCGCC ACCGCCCGCG CCACGCTCAC CCAGGTCTTC CGCAACACCA CCGACCAGTG GGTCGAGGGC ACCTACGTCT TCCCGCTGCC GGAGGACGCC GCCGTCGACA CGATGACGCT CGTCGTCGGC GATCGCGTCA TCGCGGGGGA GATCCGCGCG CGCGAGGCCG CCCGCACCGC CTACGAGGCC GCGCGCGAGA CCGGCCGCGC CGCCGCCCTC ACCGAGCAGG AGCGCCCGAA CCTGTTCACC ACCAGCGTGG CCAATATCGG CCCCGGCGAG ACCGTGCTGG TGCAGATCGC GTTCCAGCAG CCGGTGCGGC TGTCGGGCGG CACCCACGCC CTGCGCCTGC CCCTGGTCGT CGCGCCCCGC TACAGCCCGG CGCCCGGCTT GCTCCAGCCG GCCGCCGAGG GGCCGGCGCG CGACCCGGTG CCCGACCGGG CGCGGATCGC CCCGCCGGTC CTCGATCCGG CCGTGCACGG GCCCGTCAAC CCGGTGACGC TCACCGTCAC CCTGCGGGCC GGCTTCCCCC TCGGGACGGT GGAGAGCGCC ACCCACGCGA TCCGCGTCGA GGAGACCGGC CCCGACAGCC GCCGGGTGAC CCTCGCGGAC GGCCCCGTGC CGGCGGACCG CGACTTCGCG CTGACCTGGC GCGCCGCTCC CAGCGCCGCG CCCGCGGTCG GGCTCTTCCG CGAGCGGGTC GGGGAGGACG AGTACCTGCT CGCCGTGGTG ACGCCGCCCG AGGGGCGGGC GCCGGCGCGG CGGCCCCGCG AGGTCACCTT CGTGATCGAC AATTCCGGCT CCATGGCCGG CGCCTCGATG CGGCAGGCCA AGGCGAGCCT GCTCGTGGCC CTCGACCGGC TCGGCCCGGC CGACCGCTTC AACGTGATCC GCTTCGACGA CACCATGGAC CTGCTCTTCC CGGCCCCGGT CCCGGCCGAC GAGGCGCATC GCGACGCCGC CCGCCGCTTC GTGGCGGCCC TGGAGGCGCG GGGCGGGACC GAGATGCTGC CGCCCCTGCG GGCCGCCCTC GCCGACCCGC ATCCCGAGGA GGGCGACCGC GTGCGCCAGA TCGTGTTCCT GACCGACGGC GCGATCGGCA ACGAGGAGCA GATCTTCTCC GCGATCAGCG CCGGGCGGGG CCGCTCGCGC CTGTTCATGA TCGGCATCGG CTCGGCCCCG AACGGGCACC TGATGACCCA CGCGGCGGAA CTCGGCGGCG GCAGCTACAC GGCGATCGGC ACGATCGACC AGGTGGCGGA GCGCACGGCC GAGCTGCTCG CCAAGCTGGA GAGCCCGGTC GTCACCGACC TCGCGGCCGC CTTCTCGGAG CCCGGCGTCG AGGCGACCCC GCGCCTCCTG CCCGACCTCT ACCGGGGCGA GCCGGTGGTC CTCGCCGCCC GCCTGCGGGA GGCGACCGGC ACGCTGACCC TGCGCGGGCG GATCGGCGAG GCGCCCTGGC AGCAGGTGCT GACCCTCGCC GAGGCGCGGG AGGGCAGCGG CATCTCGAAG CTCTGGGCGC GGGCGAAGAT CGGCGAGGCC GAGACCGCCC GCCTCACCGG CCGCATGAGC GCCGAGGCCG CCGACGCCGC GATCCTGCGG CTCGCCCTGG CGCACCGGCT GACGACCCGG CTCACCAGCC TCGTCGCCCT CGACGTCACC CCGCGGCGAC CGCCGGGCGT CGCCCTCACG GCCGCCGACC TGCCCCTGAA CCTGCCGGCG GGCTGGGACT TTTCGGCTCT GTTCGGCGGC GAGGGGCGGA TGCCGCGCGC GCGGCGGGCC GAGGCTCCCG TCCCGCGCGC CGCGCAGGAG GGCCGCGGGG TCGACCTGCC GCAGACCGGG ACCGACGCGC CGGCCCTGCT CTGGCTCGGC CTCGTGCTGG CCGGCCTCGG CGCCGGGCTG CTCGGGCGCG GCGCGGGCTC GCGGAGGCCC GCGTGA
|
Protein sequence | MLLTPLPAAR ARAPRADARP TAWSTARPTS WPTSWPTSWP TSWPTARPTS WPTAWPAALR RALALSAPVL LGLLAWLGGL DGARAAPGGG ALLLRGPARE AAPVEAPRLR TDIAVTVSGA TARATLTQVF RNTTDQWVEG TYVFPLPEDA AVDTMTLVVG DRVIAGEIRA REAARTAYEA ARETGRAAAL TEQERPNLFT TSVANIGPGE TVLVQIAFQQ PVRLSGGTHA LRLPLVVAPR YSPAPGLLQP AAEGPARDPV PDRARIAPPV LDPAVHGPVN PVTLTVTLRA GFPLGTVESA THAIRVEETG PDSRRVTLAD GPVPADRDFA LTWRAAPSAA PAVGLFRERV GEDEYLLAVV TPPEGRAPAR RPREVTFVID NSGSMAGASM RQAKASLLVA LDRLGPADRF NVIRFDDTMD LLFPAPVPAD EAHRDAARRF VAALEARGGT EMLPPLRAAL ADPHPEEGDR VRQIVFLTDG AIGNEEQIFS AISAGRGRSR LFMIGIGSAP NGHLMTHAAE LGGGSYTAIG TIDQVAERTA ELLAKLESPV VTDLAAAFSE PGVEATPRLL PDLYRGEPVV LAARLREATG TLTLRGRIGE APWQQVLTLA EAREGSGISK LWARAKIGEA ETARLTGRMS AEAADAAILR LALAHRLTTR LTSLVALDVT PRRPPGVALT AADLPLNLPA GWDFSALFGG EGRMPRARRA EAPVPRAAQE GRGVDLPQTG TDAPALLWLG LVLAGLGAGL LGRGAGSRRP A
|
| |