Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1537 |
Symbol | |
ID | 5831309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1717142 |
End bp | 1719868 |
Gene Length | 2727 bp |
Protein Length | 908 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641367335 |
Product | hypothetical protein |
Protein accession | YP_001639007 |
Protein GI | 163850964 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | [TIGR02302] conserved hypothetical protein TIGR02302 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.161966 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGG CCGAACGGAC GAATTCGAGC GGCGGCGATC CACAGGGCGC GACGCGCCAT CGCCTCGACC GGCTGGTGGC GCAGGCGCGG GCCGCGGGTT TGTGGGAGCG GGCCTGGCCG GTGCTGTGGC GCGGACTCGG CGTCGTCCTC GCCTTCCTCG CCGCCTCCTG GCTCGGCCTG TGGCTCGATT TGTCCCAGGT TGGGCGCATG GTCGGGCTCG GGCTCTTCGC CGCCCTGTTC GTCGCCGCTT TGTGGCCGGC GGTTCGGCTC CGCGCCCTGA GCCGGCGCGA GGCGCTCGCC CGGATCGACC GCGAGGCCGC CCGCAAGGGT GGGACCGCGC ATGATCCAGC CTCTTCGATC GAGGACACGC TGGCGGTCGG CCAGAGCGAT CCGGTCACCC GCGCCCTGTG GGCCCTGCAT CAGTCGCGGG CCGCTGCCGC CGTCGCGCGA CTGAAGGCCG GGCGTCCGCG CCCGCACATG CCCGGCCACG ATCCGCTGGC GCTGCGCGCG GGCGTCCTCG TCGCGGCGCT GGCCGCCCTG TTCGTCGCCG GCCCCGAATG GCGCGGGCGC GTCGCCGCCG CCTTCGACTG GCGCGAGCCC CAGGCCGCCG CGCCGAGCTT CCGCGTCGAT GGCTGGATCG ACCCGCCGAT CTACACCCGC GTGCCGCCGC TGATCGTGAC GATGTCGGGC ACGCCGAAGG ACTCGGTGCA GCGCCTGCGC GCGCCGGTCA ATTCCACCCT GATCGTGCGC ATCGCCGGCC AGGGCGAGGC GGAGCTGACC CCGAACGCCG CCCTTGTGCC GGTGGCCAAG GACGAGAAGG ACACCGCCGG CCGGCCCGCT CCGCGGGGCG TCGTGCAGAA CGGCGCGAAG GCCAACGAGA CCCGCGCGAC CCTGCGCGAG GAGCGCTTCC GGCTCGCCGG CGGCACCGCC GAACTCGGCA TCGCCGCCTC CGGCTCGGAG CCGCAACGCC TCGTGATCGA GACGATCCCC GACCAGCCGC CCGAGGTCAG CCGGGTCGGC GACCTCGAAG TGAACGGGCG CGGCACCTTC AATCTCAGCT ACCGCGCCAA GGACGATTAC GGCATCGCCG CCGCCGAAGG GCTGGTCGAG CCGCTGAAAG CCGGGCGCTC GCTGGTGCCG GTGCCCAAGA TCGCGCTGGC GCTCCCCGCC GACGCGACGG GTGAGACCGA CACCAAGACG CTGGTCGATC TCACCGACAA TCCCTGGTCC GGCGCCCGGG TGAAGCTCAC CCTCGTGGTC CGCGACGAGG CCGGGCAGGA GGGCCGCACC GAGACCGCCG AGATCGTGCT GCCGGCCCGG CCCTTCAGCC AGCCGCTCGC CCGCGCGCTG GCCGAGGAGC GCCGCCGCCT CGTCACCGCG CCCGACGCGG ACCGCGCCCG CGTCCAGACC GCGCTCGACG CCCTGCGGAT CGCGCCGGAG CGCTTCACGC CGCAGCCGGG GATCTTCCTC GGCCTCACCA CCGCCGCCAA CCGCCTGCGG GCGGCGAAGT CGGACGAGGA TCTCACCAAC GTCGCCGACC TCCTGTGGGA GATGGCCCTC AAGATCGAGG ACGGCGATCT CTCGGATGCC GAGAAGGCCC TGCGCGCCGC CCAGGACCGC CTCAAGGAGG CGATCGAGCG CAACGCGCCG GACGAGGAGG TCAAGAAGCT CACCGAGGAT CTGAAACAGG CCCTCGACAA GTTCATGAAG GAGTTCGCCC AGCGGGCGAA GCCGCAGAAC CGGCAGCAAT CGGAGCGCCA GCAACAGCAG CAGAACGGCC AGACGGTCAC GCCCGACGAT CTCGAGAAGA TGATCAAGGA CATGCAGGAG GCGATGCAGC GCGGCGACAC GGCGGAAGCC CAGCGCCTGC TCGAACAGCT CCGCAATGTG CTCGAAAACC TCCAGAACGC CGAGAACGGC CAGAAGTCCG ACGGCGGCAT GGCCGAGATG AACCGGCAGC TCGACGAGCT CGACAAGATG TCCCGCGAGC AGCAGGACCT GCGCGATGAG ACCTACAAGG AAGGCCAGCA GGGCCAGCAG GGCCAGCAGC GTCCCGGCCA GCGCCAGAGG CCGCAGCCGG GCCAGCAGCA GGGGCAGCAA GGCCAGCGCG GGCAACAGCC CGGCCAGCAG GGCGAGGGCC AGGAAGGCCA ACAGGGGCAG CAGGGCCGCG GCCAGCAGGG GCAGCGCGGC CAGGGTCAGC AAGGTCAGGG CGGCGGGCAG CAGGGCCAGA ATATGGGCCA GCGCCAGCAG GGCCTGCGCG AGCAGCTCCA GGATCTCAAG AACCGGATGA AGCAGCAAGG GCTTCAAGGG GAAGAGGGGC TGTCGGATGC CGAGGAGGCC ATGCGTGAGG CCGAGGAATC GCTCGGCCAG GGCCGCAACG GCGACGCGGT GGACGCGCAG GGCCGCGCCC TCGACGGGCT GAAGCGCGGT GCCGAGGGCA TGCAGAAGCA GATGCAGCAG ATGGCCGAGG GCCAGGAAGG CGAGGGCCAG CAGGAGGGCC AGTCCCAGGG TAAGCAGGGC CGCTCGGGCT CCGCCGACGA CGACCCGCTC GGCCGTCCGA CCCGCGGACG CGACCTCTCG AACGGCAACG TGCGGGTGCC GAATGCCGAC GAATCCGCGG TGCAGCGCGC CCGCCGGATC ATGGAGGAGC TGCGGCGCAA GCTCGGCGAC CCCTCGCGCC CGCAGGAAGA ACTCGATTAT TTCGAGCGTC TGCTCCGCCG GAACTAA
|
Protein sequence | MSEAERTNSS GGDPQGATRH RLDRLVAQAR AAGLWERAWP VLWRGLGVVL AFLAASWLGL WLDLSQVGRM VGLGLFAALF VAALWPAVRL RALSRREALA RIDREAARKG GTAHDPASSI EDTLAVGQSD PVTRALWALH QSRAAAAVAR LKAGRPRPHM PGHDPLALRA GVLVAALAAL FVAGPEWRGR VAAAFDWREP QAAAPSFRVD GWIDPPIYTR VPPLIVTMSG TPKDSVQRLR APVNSTLIVR IAGQGEAELT PNAALVPVAK DEKDTAGRPA PRGVVQNGAK ANETRATLRE ERFRLAGGTA ELGIAASGSE PQRLVIETIP DQPPEVSRVG DLEVNGRGTF NLSYRAKDDY GIAAAEGLVE PLKAGRSLVP VPKIALALPA DATGETDTKT LVDLTDNPWS GARVKLTLVV RDEAGQEGRT ETAEIVLPAR PFSQPLARAL AEERRRLVTA PDADRARVQT ALDALRIAPE RFTPQPGIFL GLTTAANRLR AAKSDEDLTN VADLLWEMAL KIEDGDLSDA EKALRAAQDR LKEAIERNAP DEEVKKLTED LKQALDKFMK EFAQRAKPQN RQQSERQQQQ QNGQTVTPDD LEKMIKDMQE AMQRGDTAEA QRLLEQLRNV LENLQNAENG QKSDGGMAEM NRQLDELDKM SREQQDLRDE TYKEGQQGQQ GQQRPGQRQR PQPGQQQGQQ GQRGQQPGQQ GEGQEGQQGQ QGRGQQGQRG QGQQGQGGGQ QGQNMGQRQQ GLREQLQDLK NRMKQQGLQG EEGLSDAEEA MREAEESLGQ GRNGDAVDAQ GRALDGLKRG AEGMQKQMQQ MAEGQEGEGQ QEGQSQGKQG RSGSADDDPL GRPTRGRDLS NGNVRVPNAD ESAVQRARRI MEELRRKLGD PSRPQEELDY FERLLRRN
|
| |