Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4871 |
Symbol | |
ID | 5834251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 5443094 |
End bp | 5445055 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641370669 |
Product | hypothetical protein |
Protein accession | YP_001642310 |
Protein GI | 163854267 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCGGC CACTTCGGGT AATCGTGATT TCCGCGGCCG CCATCGGCAC CGTTCTCGTC GCCGGCTCCC TGGCCCGCTT CAGTTACGCG CCGATGCCGG AGGCAGGCAC CCTTCTCAAT CCCGAACGCT CTCCGATCGA GACCGCTTTC GACGTCCTCG GCCGCCGCAT CGCACGGGAC GAGGCCGAGC GGCTGAAAGC CACACCGGAG GGACGCACCG TACTGTCGCC GGAGAGCGGC GCCGTGGCGA TCGACGACGC GCTGGTTAAG CGGGGCCGCG AGGCGTTCTA TCGCGAGACG TTCGGCAACG AGGTGTTTCT CTCCGACGTG ATGGGCATGC TCGACGGGGG GCTGACGCCC TTTGAGGTGG CGCGGGCCAT CCTCATGCTC GGCGGCGCCG GTACCACCAA CCTGAAGGTC CGCATGGCCC GCGACGTCAC CGTGGGCGAT CAGGTCTGGA AGACGGGCGA ACTCGTCCCG ACGGGCCTCG ACGTGCCGCG CGGCTCCCCA TTCATCCTCG GCATCCGTAC CTTCTACGAC CGCGGACATT TGCGTATGGG CATCACCTGC GCGCTGTGTC ACACGGCGGT CGATCCGCAA TCGGGCAAGG TGGTGGAGGG CGCGCCCAAC ACCGATCTCA ATGCCGGTCT GCTGATGGCG CTCGCCAGCA ACGCGACCGC CTACTTCATG CATGGCAGCG CCGACCCGTC CGCCCATCCG GGTGATCCGG CCCGCAGCGT CACCACCGAG GACGGTGCGA AGACCGCCCT CCCCGATCCC GCCCGGTTCG AGGCCGCGGC CAAGGTGGAG GTCGCGAGCT GGCCCGCTGG CAGCTTCGAT TCCTCCGCCG ATCGCGAAAC CAACCCGACC TCGATCCCGT CGAGTTTCTC GGCCTACGGC GAGCCCTATA GCTGGAGCGG GCGTGCCGGG ATCGGACCAT TCAAAGGGCT GTCGGCGCTG AACAACAACG TCCACGCGGC CAATTCCGAC ACCACCCAGC AGACGAAAGC CGCCAAGACG CTGTTCGGGC TCGATCCGCA GGTCTATCTC GGCACGGTTC TGCAGGGCGC GGCGGTGGCG GCCCTGCGCT ACGATCCTGC CTCGGGTAAG CGCCCGACGG ATGTGCTGGG GGCGGCCGAC CCGACGCCGG GGGCGCCGGG CCTCAACAGC TACGCCGTGC TGCCGTCGTT CCCCGCCACC AATTACATGA CCGATAACGG CCTGCTGGCC TCGGTCTCCG GGGAGCCGGC CAACTACGCC AACAACGCGA TGTCGGCTTT CCAGAACCTG TTGCGCGCGC CCGAGCCTTC CCTCGATGCC GAGCGGGTGA AGAAAGGGCG GGCGGTGTTC GAGCGCGCCG GCTGCGCCGG CTGCCATACC GGGCCGGCGC TCACCAACCA TCGGGTCATC CCCGTGGGGG AGATCGGCAC GCAGCCGAGC CGCGCGCACT CGACCGTGCG GATGGAGGCG CGCCTCGCCC CGCCGACGAT CTTTGCCACC GACACGCCCT TCCCGCTTCC GCCCGATCCC AAACTCGTGC CGATCCCCCT AGAGGGCGAC GCGCTCAAGC AGGTGCAGCT TGCCTGGGCC CATGCCGGCA CCGGGGGCGG CTACAAGGTG CCGAACCTCG TCGGCCTCGC CTGGAGCGCA CCTTATCTCC ACGATTCCGG CGTCGCGGTC GGTGCCGACG CTGACGCGCA GCTCGGCGTT CCAGGTACGC TCGATGCGGG CATCCCGCCC GATCCGGCCA ACAGCTTGCG GGCGCTGGTC GATCGCAACC TCCGCGCAAA GGTCGTTTCG GCGAATAAGG CTTCAGCAAA GGCACGCACC GCGCGCGTCA CGGGCGAGGG GCACGCCTAC TGGGCCGATG CCGAGGCCGG AGTTTCAGGC GAGGAGCAGG CGGATCTCGT CGCCTATCTT CTCTCGGTCA ACCGCCTGAC CGAACCTGTG CCGGTGCCAT GA
|
Protein sequence | MGRPLRVIVI SAAAIGTVLV AGSLARFSYA PMPEAGTLLN PERSPIETAF DVLGRRIARD EAERLKATPE GRTVLSPESG AVAIDDALVK RGREAFYRET FGNEVFLSDV MGMLDGGLTP FEVARAILML GGAGTTNLKV RMARDVTVGD QVWKTGELVP TGLDVPRGSP FILGIRTFYD RGHLRMGITC ALCHTAVDPQ SGKVVEGAPN TDLNAGLLMA LASNATAYFM HGSADPSAHP GDPARSVTTE DGAKTALPDP ARFEAAAKVE VASWPAGSFD SSADRETNPT SIPSSFSAYG EPYSWSGRAG IGPFKGLSAL NNNVHAANSD TTQQTKAAKT LFGLDPQVYL GTVLQGAAVA ALRYDPASGK RPTDVLGAAD PTPGAPGLNS YAVLPSFPAT NYMTDNGLLA SVSGEPANYA NNAMSAFQNL LRAPEPSLDA ERVKKGRAVF ERAGCAGCHT GPALTNHRVI PVGEIGTQPS RAHSTVRMEA RLAPPTIFAT DTPFPLPPDP KLVPIPLEGD ALKQVQLAWA HAGTGGGYKV PNLVGLAWSA PYLHDSGVAV GADADAQLGV PGTLDAGIPP DPANSLRALV DRNLRAKVVS ANKASAKART ARVTGEGHAY WADAEAGVSG EEQADLVAYL LSVNRLTEPV PVP
|
| |