Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0150 |
Symbol | |
ID | 5833768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 163251 |
End bp | 166871 |
Gene Length | 3621 bp |
Protein Length | 1206 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641365933 |
Product | outer membrane adhesin like proteiin |
Protein accession | YP_001637647 |
Protein GI | 163849604 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01965] VCBS repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACTG TGAGCAATGG CGCAGCGACG TCCGTTTCCA ACACGCCGCA GGCCAAGGAC GACTACGTCA CTACGGCTGA AGACGTAGGC ACAAAAATTA ACGTCCTCGC GAACGACTCC GGCGGTACGG CCAAATCCCT CTACTCCATC AGCCAGGGCG ACCCGCTGGC TGTCGCTAAG ACCGCCACAA GCAGTCTGGG TGCCACCATT TACATCACCA GCGACGGCCA GGTTTACTAT GACCCGGCTA GCGCAACGCG AATCCAGGCG CTGGCGCAAG GCGAGACAGC GGTCGACACC TTCGTCTACG AGGAGCGACT CGGGAACGGC ACGATCAGCA CTGCAACCGT GTACGTGACC GTCAAGGGCA CCAATGATGG GCCAATACTG GCGGCTGATA CAGCCACGCA CGCAATGACG GAGATCGCGG GTGCGACATC TGGCTCGGGT ACCGAGACAG CTTCCACTAC GCTGACATTC ACTGATAGAG ACCTGAGCGA CACACACACA GTGAGCTATG GTGCTCCGAG TGTCACCTGG ACGGGCGGAG CCCACATCCC GGCCGATACG GCAGCTGCCC TGGCAAGTGC GCTGACGCTC ACCAAAATGG ATTCGACCGG TAGTGGCACC GGCTCAGTCA AGGTCGACTT TGCCCTAGCC GATAAGTTGG CTGATTTTCT TGGGGTACAC GAGACATTGA CGGTGACGTA TCAAATCACT GTCAGAGATA GCCAAGGAGC CAGCTCAGTT CAGCCAGTGA CTCTCACGCT GACTGGCACC AATGACGACG CGTTGATCAC GGCCGCTACG GCTGGCTCCG ACAGAGGGAC CGTGACCGAA GATGGCAATG TGGCTGCTGA GGGCGTGCTG TCGTTCACGG ATGCAGATCT GAACGATGCG CATACAGTCA GCGTCATGCC GTCAGGGGCC GCGCTAGGCA CACTCACGGT CAACAAGACG GCTGATTTGA ACGGCGTCGG CTCGGTCAGT TGGAGCTACA CTGTCGATAG CACCAAGGTC CAATACCTGG CCGAGGGGGA GACCAAAGTC GAGTCTTTCC AGATCCTGTT GAGCGATGGT ACCTCGACGG TAAGCAAAAC CGTTTCAATC ACGATTACCG GCACGAACGA TGCGCCGGTG GTCACCCCAG CGAGCGTGGG CGACAGTGCC GGAACGGCAA CACCGCTGCT CGAAACCGAC GCAGGTCTGG CGACGTCCGG CACTCTCACG GTCATAGATC TCGACGTGGT GGATCAGGTC TCGGTGGCTG TCTCAGGCGT GACCCATTCC GGGCCGACCG GCAGCCTGAC AAATGCAGAG CTCCTGAGCT ACTTCCATAT CACACCAGGA ACCATTGTCG ACGGCAGACA TACAAGCGGT CAATTCACCT GGACGTTCAA CTCCGGTGCG CAGGCTTTCG ATTTCCTCGC TCAAAGCGAA GTGCTCGAGC TGCAATACCG CATCACTCCT GATGATGGGC ATGCTCCCAC CGGCACAGGC GACGGTGTGG TGACGATCAG GATCCAGGGC ACCAACGATG CGGCTGTGAT CGGCGGCGCT GCGGCCGGTG CAGTCACCGA GGACGCGGTC ACCACGACCT CAGGCCAGCT GACCATCACG GACGTAGATA CCGGCGAGGC TCACTTCCAG GCCATCACCG CCGGGGCTCT GACCAAGACC TACGGCTCGT TCACGTTCAA CGAGACCACG GGTGCGTGGA GCTATGCGCT CGACCACGCC AAAGCCGACA GCCTGGCCAA GGACCAGGTG GTGCACGACA CCCTGAACGT GACCTCGTTC GACGGCACCG ACACCCAGCT CATCGACGTC ACCATCACCG GCACCAATGA TGCGGCGACG ATCGTGGCCA GCGGTGGCGA CCACGGCAAC GTCACTGAGG ACACACCCGG CCAGGGCCTC ACACAGGGTT CGCTGAGCGT GCAGGACGTG GATGCGGGCG AGAACCGCTT CCAAGCTGTG GCGGCTGGCG CCCTTGAGGG CCAGTACGGC AGCTTCACGT TCGACGCGGA CACCGGCGCT TGGAAGTACA CGCTCGACCA TGCCAAGGCG GACAAGCTGA CCGGCACGGA CGTGAAGCAC GACACCCTGA CGGTAACCTC GTTCGACGGC ACCGACACGC ACGTCATCGA TGCGACCGTC TACGGCACCA ACGATGCGGC TGTGATCGGC GGCGCTGCGG CCGGTGCAGT CACCGAGGAC GCGGTCACCA CGACCTCAGG TCAGCTGACC ATCACGGACG TAGATACCGG CGAGGCTCAC TTCCAGGCCA TCACCGCCGG GGCTCTGACC AAGACCTACG GCTCGTTCAC GTTCAACGAG ACCACGGGTG CGTGGAGCTA TGCGCTCGAC CACGCCAAAG CCGACAGCCT GGCCAAGGAC CAGGTGGTGC ACGACACCCT GAACGTGACC TCGTTCGACG GCACCGACAC CCAGCTCATC GACGTCACCA TCACCGGCAC CAATGATGCG GCGACGATCG TGGCCAGCGG TGGCGACCAC GGCAACGTCA CTGAGGACAC ACCCGGCCAG GGCCTCACAC AGGGTTCGCT GAGCGTGCAG GACGTGGATG CGGGCGAGAA CCGCTTCCAA GCTGTGGCGG CTGGCGCCCT TGAGGGCCAG TACGGCAGCT TCACGTTCGA CGCGGACACC GGCGCTTGGA AGTACACGCT CGACCATGCC AAGGCGGACA AGCTGACCGG CACGGACGTG AAGCACGACA CCCTGACGGT AACCTCGTTC GACGGCACCG ACACGCACGT CATCGATGCG ACCGTCTACG GCACCAACGA TGCGGCTGTG ATCGGCGGCG CTGCGGCCGG TGCAGTCACC GAGGACGCGG TCACCACGAC CTCAGGTCAG CTGACCATCA CGGACGTAGA TACCGGCGAG GCTCACTTCC AGGCCATCAC CGCCGGGGCT CTGACCAAGA CCTACGGCTC GTTCACGTTC AACGAGACCA CGGGTGCGTG GAGCTATGCG CTCGACCACG CCAAAGCCGA CAGCCTGGCC AAGGACCAGG TGGTGCACGA CACCCTGAAC GTGACCTCGT TCGACGGCAC CGACACCCAG CTCATCGACG TCACCATCAC CGGCACCAAT GATGGATTGA CCCCAAATCT CGTGCTCAAT TTTGACAATA TTTCTAGCGG CCCCGTTCCA GACGGATATG GTGGTCTGAA TTGGAACAGC AATGGGTCAT TCTATAGCGG CACGGCAGAC ATCTTGAATA CTGGATACGG GTACGGGAAT GGCGACAACG ACTCGTTCGT CTTTAACGGG TGGGCAGGAA AATACAGCAG CATAACGAAA ACAAATGGAG GAACTTTCAG CGTCTCTGGG CTAGACATCG CAGATTCCAC CTACGCTCAT TTTAGCGATA CCCCCAATGA TGCTAACACT GTACAATTTG TAGGAATGAA AAACGGCGCT CAGACATATT CGAAGCTCGT CTCGTTAAGC AACGACCATT TCGACCACGT AGACCTAGAT TTCTCTGGTA TAGATCAATT CCAGATCAAC GTAGTTGGAG GCCAGCAAAG CGGTTTAGGT GTTAGCAATA CTGGCTGGTG GGCTATCGAC AACCTAGGGC TCATTATATG A
|
Protein sequence | MATVSNGAAT SVSNTPQAKD DYVTTAEDVG TKINVLANDS GGTAKSLYSI SQGDPLAVAK TATSSLGATI YITSDGQVYY DPASATRIQA LAQGETAVDT FVYEERLGNG TISTATVYVT VKGTNDGPIL AADTATHAMT EIAGATSGSG TETASTTLTF TDRDLSDTHT VSYGAPSVTW TGGAHIPADT AAALASALTL TKMDSTGSGT GSVKVDFALA DKLADFLGVH ETLTVTYQIT VRDSQGASSV QPVTLTLTGT NDDALITAAT AGSDRGTVTE DGNVAAEGVL SFTDADLNDA HTVSVMPSGA ALGTLTVNKT ADLNGVGSVS WSYTVDSTKV QYLAEGETKV ESFQILLSDG TSTVSKTVSI TITGTNDAPV VTPASVGDSA GTATPLLETD AGLATSGTLT VIDLDVVDQV SVAVSGVTHS GPTGSLTNAE LLSYFHITPG TIVDGRHTSG QFTWTFNSGA QAFDFLAQSE VLELQYRITP DDGHAPTGTG DGVVTIRIQG TNDAAVIGGA AAGAVTEDAV TTTSGQLTIT DVDTGEAHFQ AITAGALTKT YGSFTFNETT GAWSYALDHA KADSLAKDQV VHDTLNVTSF DGTDTQLIDV TITGTNDAAT IVASGGDHGN VTEDTPGQGL TQGSLSVQDV DAGENRFQAV AAGALEGQYG SFTFDADTGA WKYTLDHAKA DKLTGTDVKH DTLTVTSFDG TDTHVIDATV YGTNDAAVIG GAAAGAVTED AVTTTSGQLT ITDVDTGEAH FQAITAGALT KTYGSFTFNE TTGAWSYALD HAKADSLAKD QVVHDTLNVT SFDGTDTQLI DVTITGTNDA ATIVASGGDH GNVTEDTPGQ GLTQGSLSVQ DVDAGENRFQ AVAAGALEGQ YGSFTFDADT GAWKYTLDHA KADKLTGTDV KHDTLTVTSF DGTDTHVIDA TVYGTNDAAV IGGAAAGAVT EDAVTTTSGQ LTITDVDTGE AHFQAITAGA LTKTYGSFTF NETTGAWSYA LDHAKADSLA KDQVVHDTLN VTSFDGTDTQ LIDVTITGTN DGLTPNLVLN FDNISSGPVP DGYGGLNWNS NGSFYSGTAD ILNTGYGYGN GDNDSFVFNG WAGKYSSITK TNGGTFSVSG LDIADSTYAH FSDTPNDANT VQFVGMKNGA QTYSKLVSLS NDHFDHVDLD FSGIDQFQIN VVGGQQSGLG VSNTGWWAID NLGLII
|
| |