Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0531 |
Symbol | |
ID | 5832465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 580688 |
End bp | 581725 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641366308 |
Product | hypothetical protein |
Protein accession | YP_001638017 |
Protein GI | 163849974 |
COG category | [C] Energy production and conversion |
COG ID | [COG4313] Protein involved in meta-pathway of phenol degradation |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACT TCGGCTATTC CAGGCGCGTC CTGGCATCAG CGACCCTTGC GGCGGCGTGC ATGCTTTTCC CAGGGCAAGC CGAAGCGGCC GAGTCCGCCC TCGGCGTCTA TGTGTTGGGC AACTGCGGTC CTATGGCGGG TGTGACCCCG CCGCCCGGCT TCTACTTCGA GAACGAGACG TACTTCTACC AGGGCAACCT CGGGGGCAAT CGAACCTTCC AGAGCGGCGG CGTGGTGGCC GCGAACGTGA AGCTCGACAT CAGCGCGACC TTCATGACGC CGGTGTGGGT CACGCCCGTC GAGGTCCTGG GTGGAAACCT CGGTTTCTCC ATCACCATCC CGTTCGGGAC GCCCAACGTC AGCGCCGGCG CCGTCCTCTC GTCGCCCCGG ATCGACCGGA TCGTCTCAGG CCGGGAACGC GACGCGATCT TCAACGTCGG CGACATCTAC CTCGCCTCAT TCGTCGGCTG GCACTCCGGC AACCTCCACT GGAGCACGAC GGTCCTCGGG GTCGTCCCAT CCGGCTCCTA CGAGACCGGG CAACTCTCCA ACATCTCCCT CAACCGCCCG GCGCTCGACC TCAGCGCGGC GCTCACCTAT CTTGATCCGG TGCTCGGCTA CGAGCTTTCG GTGGTGCCCG GCGTCACGTT CAACTGGATC AACCCGGCGA CGCAGTATCT CACCGGCACC GAGTTCCACC TGGAATGGTC GGCCTCGAAG TTCCTGAGCA AAGAATTCTC AATCGGCATG GTCGGATATT TCTACGACCA GCTCACGGGT GACAGCGGCC GCGGCGACAG GATCGGGCCC TTCAAGGGAC GGGTCACGGC GCTGGGCGGG CAGATCGGCT ACACGTTCAA GGTGGGCGAG ATCCCGGTCT CCACGAACGT GAGGGTGCTG CGCGAGTTCA ACACCACCAA CCGGTTCGAG GGGACCGCGA CCTATCTGAC GATCACCGCT CCGCTTTGGG TGGCGCCCGG CGCCGCGGTC GCCGAGGCCA AGCCTGCCAT GACGAAACCG GTCATCAGGA AGTTCTGA
|
Protein sequence | MLDFGYSRRV LASATLAAAC MLFPGQAEAA ESALGVYVLG NCGPMAGVTP PPGFYFENET YFYQGNLGGN RTFQSGGVVA ANVKLDISAT FMTPVWVTPV EVLGGNLGFS ITIPFGTPNV SAGAVLSSPR IDRIVSGRER DAIFNVGDIY LASFVGWHSG NLHWSTTVLG VVPSGSYETG QLSNISLNRP ALDLSAALTY LDPVLGYELS VVPGVTFNWI NPATQYLTGT EFHLEWSASK FLSKEFSIGM VGYFYDQLTG DSGRGDRIGP FKGRVTALGG QIGYTFKVGE IPVSTNVRVL REFNTTNRFE GTATYLTITA PLWVAPGAAV AEAKPAMTKP VIRKF
|
| |