Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3534 |
Symbol | |
ID | 5834859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 3914419 |
End bp | 3918735 |
Gene Length | 4317 bp |
Protein Length | 1438 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641369331 |
Product | hypothetical protein |
Protein accession | YP_001640988 |
Protein GI | 163852945 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACATCG CCGTCCGGCA TAACCTCCAG ATCTTCGACC CGGCGGACCC TTCGCTTTGC GAGAGCAGCG CTATCGTGCT GCCGATCGCC GAGGCGCAGG CTGCCATGGG CGAGACCGTC GTCGCCTACC TCGTGCGCGT CGCGTGGCGG TTCGACCTGC CGACTGTCTG CCGGATCAAC GGTGAGTTCT ACGCTCGGGC CGATTGGGAG ACACGCGCGC TCGGCGTCAA CGAAAACGTC GAGTTCGTCA GCCGGCCGCT TGGTGGCGGA TCGAGCGGCG GCTCCACCGG TAAGAGCATC CTGTCCGTCG TCGCCCTTGT CGCGCTCACC GCGGTGGCGC CCTACGCCGT CGGCGCCATC GCCGGCGCAC TGGGCACCGC GGCGCTCGGC ACGGCCTCCG CCCTCACCTT TGCCGGCAAG CTCGCAGCGG CGGTCATCGT CGGCGCGGGC GCGCTCGCCG TCTCGCACTT CCTGAGCCCG AAGTCCGGCG GCAAGACCAA CAGCAGCGAC GCGCTCTATT CGTTCGGCCT TCAGGGCAAC GCGGCCCGGC CGATGCAGCC GATCCCGGTG CTGAACGGTC GCCGCAAGTT CGCCCCCGAT TACGCCGCGC CGACCTACAG CGAGTATGCC GGCGACTCGA TGACCGACTA CGCGCTGTAT GCGCTGACGT GCGGCCGGAT GCAGGTGGAG CAGGTGCTGA TCGGCGACAC CCCGATCTGG CACTATGAGA GCGGCCCGAG CCCCGATTAT CCCGGCATCG AGCTTCAGAT CGTCGAGCCG GGCGAGCAGG TCACGCTGTA TCCGGTCAAC GTGGTCACGG CCGACGAGTC GAGCGGCGCC GAGCTGCAGC AGACCTACAC GCCCGGCTAC ATCGTCAACG CCGCCGGCAC GCGGGCCAAG GAGCTGATCT TCGACCTCGT CTGGCCGGGC GGCGCCTACG TCACCTTCAA GGACCGGACG CTCGCCGCAA CCACGCACGT CGAGATCCAG GCGCGGCAGG TCGATGATGC CGGCGCGCCG ATCACCGGGT GGACGACGAT CGTCAGCGAT CCCTACGTCC AGGCGAAGCA GAGCCAGATC CGGATGACCG TCCGCCGGAT CGTAGAACTG GCCCGGTACG AGGTCCGCGG GCGCCGGGTG AACCCGAGCA TCAACGACAG CGGTATCGAG AAGATCGGCG GCACCGACGA TGTGACGTGG ACGGCCGCTC GCGCGCACAT CGAGGGGCCG CAGGCCTTCC CGCGCGTCAC GACGCTCGCG GTGAAGGGCG TTGCCTCGAA GCAACTCTCC GGCGTCTCGG GCGGGCAGCT GCGCGTCGTC GGCACCCGCA TCCTGCCGGT CTGGCGTGAC GGACAGTTCG TGGAGGAGCC GACGCGCTCC ATCGCCTGGG CCGCCCTCGA TTGGTGGCGC AACAGCGACT ACGCCGCGGG CCTCAGCATC TCCGACATCG ACTTTCAGAG CTTCGTGCGC CACGCCACGC TCTGGGATGC GCTGGGCCAC ACCTTCGACC ATCGCTTCAC CGAGGTGCAG AACCTCGACG ACGTGCTCGA GACGGTGCTG AAGGCCGGCC GCGCCTTCCC GGCCCCGGTA GGCGACAAGC TCACCATCAC GCGGGACGAG CCGCGCGTGC TGCCGCGCAT GCTCTTCACC GACAACGACA TCCTGCGCGA CACGCTGGAG ATCGACTACG CGCTCTCGGA CGAGGCCTGG GCCGACGGCA TGGTCGGCGA GTACGTCGAC GAGACCACCT GGCGCCTCGC CGAGGTCTCG TCCGCGCCCG ACGGCGTGAC CCTGCTGAAG CCAGCCCGCG TCCAGCTGGA AGGCGTGGTC AACCGCAAAC AGGCCGCCGG CATGGTCCGG ATGATGGCCG CCGAGAGCCA GTACCGCCGC ATCACCGTGT CTTGGACCGC CCGCATGGAA GGGCGGCTGC TGAAGCGTGG CGACCTCGTG CGGATCACCA CCGAGGAGCC CGAGACCTGG GGGCAGTCCT GCGAGGTCGT TGGCTTCACT CCGGCCACGC GACGCCTGAC GCTCGACCCA GCGCCGAGAT GGGAGGCGAG TGGCAACCAC TACGTCGAGA TCCGCCGCCG CGACGGCAGC CCGTGGGGGC CGGTGCAGGT CACCCGCGGT GCCAACGACG CGCAGGCGAT TGTCAGCGTG TCCGAGGTGA GCGGCGTGAC GCTGGCGGAT GCGGTGGGCC GCTCCAGCAC GCAGGAGCCG GCATGGCTCG CCTTCTCGCC CGGTCAGCCC CGCAGCTTCC CGGTGCTCAT CACCGACGGC GACCCGGATC AGGACGGCGA GCACATCCAC CTGTCGGGTG TCATGGACGC CGCCGAGGTC TACGCGACCA CCGAGGACGG CGTCCCACCG CTCCTGCAGA TCCCCGACCT GTTCTCGTCG GCGCTGCCGG TCATCGCCGC GCTGATGGCG AACATCGCCC AGCGACAGGC CTCGCTGATT CTGACGGCCG GCTGGCAGCC CGCGAAGAAC GCGGTGAGCT ACGAGGCTCA GGTCTCCTAC GACGGTGGCG GGTCATGGAT CGGCGCCTAC GAGGGCAACC GCACCACGTT CGAAGCCGTG GTCGGCGGGT CCGACCAGAT GCGCGTCCGA GTCCGCGGCG TCACGCCGGC CGGACCCCGT GGTGCTTGGT CGGTGGTGGA GGTTGAGGCG CCAACCCTGG CAATCAGCGG GGATCTGTTC GCGGACCTCT CCATCACCAT CGACAAGCTG TCGGCCGAGA CGAAGAAGGC GATCGAGGAC CTCAACGCAC TCGGAGCGGG CGCGCTGGCT ATGGGCAGCG ACGCCACTGA ACTCGCCCTC AGCATGGCGC GCGACACCAT CCGCGGGGCG CAGCGGCAGC TTGCCGCCTA CATTGAGCAG CTGGCCGAGG CGCAGGCGAC ACAGGAGTTC TATTCCTACG AGCAGCGCCA GCTGATCAAG GTCGGCAACA GCGCGAACTA TGCCGCGATC CTGCGCACCG AGCGCGCCAT GGTCGAGGCG GACGAAGCGC TGGCTGAGAT TACAACGGCC TTGGACGTTC GCCTCGTCGA TGCACAGACC GCTCTCGTCG GTCAGGCTAC CGCCCTGGAT CAACTGAACG CGCGGGTAAC CGTCAACGAG GAGGGCATCA CGGCGACCGC TCAGCGCGCG ACCGCGCTGG AGAGCACCGT CAACAATCCC ACGAACGGCG TCTCAGCTAC CGCTACGGCG GTGGACACCC TGAAGACCAC CGTCACCACA CAGGGCCAGA ACATCACGGC TACCGCCAAC CGGACAACGG CGCTGGAAAG CACGGTCAAC AATCCGACGA CAGGCGTGAC CGCCACGGCG ACCACGCTCG ACACCCTGAA GACCACGGTG ACCCAGCAGG GTGGCACGCT TACCACCACG GCCGAGCGGA CAACGGCGCT TGAGAGCACG GTCAACAACG GCACAACGGG GGTGGCCGCC ACGGCTACTG CGCTGAACAG CCTGACGACG ACGGTTACGC AGCAAGGCGG CACGATCAGC AGCACGTCGC AGCAGCTCAC CAGCCTTGCT AACGAGGTGC GCAACCCGAC GACGGGTCTC AGCGCTACAG CGCAGGCCGT CGACGTGCTG GAAACACAAG TCAACGACGG CACGAATGGT CTGAGCGCTG TTGCGCAGCG GGTCTCGTCG CTGAATACGA CCGTGGGGAA CAACAGCGCC AGCATTACGA GCCTGTTCGG CTCTTACGAC GGCGTGAAGG TCCAGTTCGG CGTCACGGGC ACCATCGACG GCCAGACCGG CGGCTTTGTC CTGAAAGGCA TCAGGAAACT CGATGGCTCG GTCAGCTACT CTATGCTGAT CGACGCGGAT GTGTTCGCTC GATCGATCAC TTCCCCGCTC TTGCAGACGA CAAAGCTGAT CGCGACCTCA GCGCAGATCG GAAATCTCAT CGTCGACAAC ATCCACGTCA AGGACGGCGG CATTTCATCG ATGGTGTCCA ACTTCGCGGA CGCCAAAAAG GTCAGCGCAA GCATCACAGT CCGCACGGGC GGAAAAATCA GGATCTCGAT GAGCCGATCC GGCAATGCCG GTGTCCGCTA TGCGCCTGCG CTTGGGTACA CGTCGGGCAA CTTCCAACTC CGCCGCAATG GGTTCCTCGT CTATGAGGCC CCCGCGGCAG TGTCGTTCCA GTGGGATCCG CAGGCCGGCG GTAATTATCT GGTGGTGATC GCCAGCACAG CGATGGTTGA CGACACGCCG GGCCCAGGCA CGCACACCTA CGAGCTAACC GATACCAACA ACACGCCAGT GGCTGGGGTC TATATCTCGG TTCAGGAGAG CAAGTGA
|
Protein sequence | MHIAVRHNLQ IFDPADPSLC ESSAIVLPIA EAQAAMGETV VAYLVRVAWR FDLPTVCRIN GEFYARADWE TRALGVNENV EFVSRPLGGG SSGGSTGKSI LSVVALVALT AVAPYAVGAI AGALGTAALG TASALTFAGK LAAAVIVGAG ALAVSHFLSP KSGGKTNSSD ALYSFGLQGN AARPMQPIPV LNGRRKFAPD YAAPTYSEYA GDSMTDYALY ALTCGRMQVE QVLIGDTPIW HYESGPSPDY PGIELQIVEP GEQVTLYPVN VVTADESSGA ELQQTYTPGY IVNAAGTRAK ELIFDLVWPG GAYVTFKDRT LAATTHVEIQ ARQVDDAGAP ITGWTTIVSD PYVQAKQSQI RMTVRRIVEL ARYEVRGRRV NPSINDSGIE KIGGTDDVTW TAARAHIEGP QAFPRVTTLA VKGVASKQLS GVSGGQLRVV GTRILPVWRD GQFVEEPTRS IAWAALDWWR NSDYAAGLSI SDIDFQSFVR HATLWDALGH TFDHRFTEVQ NLDDVLETVL KAGRAFPAPV GDKLTITRDE PRVLPRMLFT DNDILRDTLE IDYALSDEAW ADGMVGEYVD ETTWRLAEVS SAPDGVTLLK PARVQLEGVV NRKQAAGMVR MMAAESQYRR ITVSWTARME GRLLKRGDLV RITTEEPETW GQSCEVVGFT PATRRLTLDP APRWEASGNH YVEIRRRDGS PWGPVQVTRG ANDAQAIVSV SEVSGVTLAD AVGRSSTQEP AWLAFSPGQP RSFPVLITDG DPDQDGEHIH LSGVMDAAEV YATTEDGVPP LLQIPDLFSS ALPVIAALMA NIAQRQASLI LTAGWQPAKN AVSYEAQVSY DGGGSWIGAY EGNRTTFEAV VGGSDQMRVR VRGVTPAGPR GAWSVVEVEA PTLAISGDLF ADLSITIDKL SAETKKAIED LNALGAGALA MGSDATELAL SMARDTIRGA QRQLAAYIEQ LAEAQATQEF YSYEQRQLIK VGNSANYAAI LRTERAMVEA DEALAEITTA LDVRLVDAQT ALVGQATALD QLNARVTVNE EGITATAQRA TALESTVNNP TNGVSATATA VDTLKTTVTT QGQNITATAN RTTALESTVN NPTTGVTATA TTLDTLKTTV TQQGGTLTTT AERTTALEST VNNGTTGVAA TATALNSLTT TVTQQGGTIS STSQQLTSLA NEVRNPTTGL SATAQAVDVL ETQVNDGTNG LSAVAQRVSS LNTTVGNNSA SITSLFGSYD GVKVQFGVTG TIDGQTGGFV LKGIRKLDGS VSYSMLIDAD VFARSITSPL LQTTKLIATS AQIGNLIVDN IHVKDGGISS MVSNFADAKK VSASITVRTG GKIRISMSRS GNAGVRYAPA LGYTSGNFQL RRNGFLVYEA PAAVSFQWDP QAGGNYLVVI ASTAMVDDTP GPGTHTYELT DTNNTPVAGV YISVQESK
|
| |