Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1271 |
Symbol | |
ID | 5833650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1409561 |
End bp | 1410595 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641367064 |
Product | SMP-30/gluconolaconase/LRE domain-containing protein |
Protein accession | YP_001638744 |
Protein GI | 163850701 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3386] Gluconolactonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0150049 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGATA GGGCGGATGA AAATTTCGCC CCAGACGTAG CACAGGCAAG CTCAGATCAT GGGCTAAGTC TTCGTAAAAC TCGCGGAACT TCTCTTTTCG ACAGACGTCA ACGGTCGTTG ATTTCAGAGG AATGTCCCGA TTTGGCCGTA TCTCCGTCGA TCCGCGTCCT CGATCCGTCC CGCTGCCATC TCGGCGAGGG GCCCAGCTAC GATCCGGCGA CCGACACGGC GTGGTGGGTC GATATCCTAG AGAACCGCCT GTTCGAATTG CCCCTGAGCG ACGGCGCCGG CCCGGCGAAG CTGCACGCCT TGCCGTTCAT GGCGAGCGAC GTCGCCGCCA TCGACGCCGA GCGGCAACTC CTCTCGGCGG AGGACGGGCT CTACATCCGC ACGATCCGCG ACGGGCAACT GAGCCTGTTC TGCCCGCTGG AAGCGGAGGA TGCCGGCACC CGCTCCAATG ACGGCCGGGT CCATCCGAGC GGCGCGCTCT GGATCAGCAC CATGGGCCGC GATGCCGAGA CCGGACGCGG CGCGATCTAC CACGTTGCCG GCACGCGGGT GACGCAGCTG TTCTCCGGCC TCTCGATCCC CAACGGGATC GCCTTCTCGC CGGACGGCGC GACCGGCTAC TTCGTCGATA CCGACGAGGG TATCCTGCGC CGCGTCACGC TCGACCCCGC CACCGGCCTG CCCGCGAGCG CGCCCGAGAC CCATTACGAT CACAGCGACG GAGAGGGCGG GATCGACGGC GCGGCGGTGG ACGCCGAGGG TCTGATCTGG ACCGCGCGCT TCGGCGGCGC CTGCCTCGAC GCCTACAGCC CGGCGGGCGA GCGGGTGCGC ACCGTGTCCG TTCCGGCGCG CCAGCCCACC TGCCCGACCT TCGCCGGCCG CGCCCTCGAC CGGCTCCTCC TGACCACGGC CTACGAGGGC ATGGACGAGG CCGCGCGGGC GGAGGACCCC GAGCACGGGC GCACCCTGCT CGTGGATATC GGCGTCCGCG GCCTGCTGGA GCCGGCCTTC CGCCTCGGCG CCTGA
|
Protein sequence | MHDRADENFA PDVAQASSDH GLSLRKTRGT SLFDRRQRSL ISEECPDLAV SPSIRVLDPS RCHLGEGPSY DPATDTAWWV DILENRLFEL PLSDGAGPAK LHALPFMASD VAAIDAERQL LSAEDGLYIR TIRDGQLSLF CPLEAEDAGT RSNDGRVHPS GALWISTMGR DAETGRGAIY HVAGTRVTQL FSGLSIPNGI AFSPDGATGY FVDTDEGILR RVTLDPATGL PASAPETHYD HSDGEGGIDG AAVDAEGLIW TARFGGACLD AYSPAGERVR TVSVPARQPT CPTFAGRALD RLLLTTAYEG MDEAARAEDP EHGRTLLVDI GVRGLLEPAF RLGA
|
| |