Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0023 |
Symbol | |
ID | 4785307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 24591 |
End bp | 26576 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640088570 |
Product | putative cysteine desulfurase |
Protein accession | YP_001019220 |
Protein GI | 124265216 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.493291 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACGT CCGATGCCCT GAGCGGCCCG GTCGCGGCGG CGCAGCCCGC GTTGCCGCCC GGCCTGCCCG ATCCCGCCAC GCTCGCGCGT CTGGCGGGCG AATTCTTCGC CGCGCTGCCC GGCGACGGCG GCGCCCACGG CGGCGTGCCG GTGCCGGCAC AGCCCACCCC GCCCGGCCTG TCGCTGCCCG GTGTGCTGAC CGGCGGTCCG GCCACGCCCA ACGTGCTGCC GCTCGGGGCC GCGGCGCCCG GCGCCAACCT CGTGCCCAGC TCGCCCCAGC ACGCCGCGGC GCACGGTGCG TCGGCCCCGG CGCTGGTGCC GCATGCGGCC GCGCCGAACG GGCTGCCCGA GCACGTGGCC ACCCTCCCGC CGACGCTCGA TGGCCGCCTC GGCAGCCACG CCCTCGGCGT GCCGCAGGTC GTGCCGGCCG CATCGCCGCT GGTCGGGCTG CCGTCGCACC CGTCGGCCGG CGGCGCACCC GCGGCAACCG ACACGCCATC GCCCTACTAC TTCCTGTCGC ACGCCACGTC CGGACCGGGC GCGAGCGAGG CGCGCATCGC GCCGCTCGGC AGTGCACCGA CCGGGCTACC GCAGGAGGCC GACCTGCGCT CGCTGCTGCG CAGCGACGCG CCGTCGTCCG GCAGCGCCCC CGCGGCGTCG CCCGGCGCCT TCTACTTCCT CGACGCGCAG CGCCATCCCG GGCCGCACGG CAGCCACGCG GGCGCCGTGC CGAACGCGGC GGCCTCGGCC CATCCGCCGT TCGACGTGCA CGCGATCCGG CGCGACTTCC CGATCCTGCA GGAGCGCGTC AACGGTCGCC CGCTCGTGTG GTTCGACAAC GCCGCGACGA CGCACAAGCC GCAGTCGGTG ATCGACCGCA TCGCCCATTT CTACGCGCAC GAGAACTCCA ACATCCACCG CGCCGCGCAC GAGCTGGCGG CGCGCGCTAC CGACGCCTAC GAAGGCGCGC GCGAGACGGT GCGGCGCTTC ATCGGCGCGA GCTCGGTGGA AGAGATCGTG TTCGTGCGCG GCACCACCGA GGCCATCAAC CTGGTGGCCA AGAGCTGGGG CGCGCAGAAC ATCGGTGCCG GCGACGAGAT CGTGGTCTCG CACCTCGAGC ACCACGCCAA CATCGTGCCG TGGCAGCAGC TCGCCGCCGA GAAGGGCGCG AAGCTGCGCG TGATCCCGGT CGACGACAGC GGCCAGGTGC GGCTCGACGA ATACCGCAAG CTGCTGAACG ACCGCACGAA GATCGTCTCG GTGACGCAGG TGTCCAACGC GCTGGGCACC GTGGTGCCGG TGAAGGAGAT CGTCGAGCTG GCGCACCGCG CCGGCGCGAA GGCGCTGGTC GACGGCGCGC AGTCGGTCTC GCACCTGCGG GTGAACGTGC AGGCGCTCGA CGCCGACTTC TTCGTGTTCT CCGGCCACAA GATCTTCGGT CCCACCGGCA TCGGCGTGGT CTACGGCAAG CGCGAGGTGC TCGAGGACAT GCCCCCGTGG CAGGGCGGCG GCAACATGAT CGCCGACGTG ACTTTCGAGA AGACGGTCTA CCACGGGCCG CCGACGCGCT TCGAGGCCGG CACCGGCAAC ATCGCCGACG CGGTGGGCCT GGGCGCGGCG CTCGACTACG TGGAGCGCGT GGGCATCGAG AACATCGCGC GCTACGAGCA CGACCTGCTC GACTACGCGA CGCACGCGCT GCGCCCGATC GCCGGCGTAC GGCTGGTGGG CACCGCGCGC GACAAGGCCA GCGTGCTGTC CTTCGTGCTC GACGGCTACA CGACCGACGA GGTGGGCAAG GCGCTCAACG AGGAAGGCAT CGCGGTGCGC ACCGGCCACC ACTGCGCCCA GCCCATCCTG CGCCGCTTCG GACTCGAGGC CACGGTGCGG CCCTCGCTGG CGTTCTACAA CACCTGCGAG GAAGTGGACC GCTTCATCGC GGTGGTGCGC CGGCTGAGCG GCGCGCGGCG CGTGCCGGCA CGCTGA
|
Protein sequence | MSTSDALSGP VAAAQPALPP GLPDPATLAR LAGEFFAALP GDGGAHGGVP VPAQPTPPGL SLPGVLTGGP ATPNVLPLGA AAPGANLVPS SPQHAAAHGA SAPALVPHAA APNGLPEHVA TLPPTLDGRL GSHALGVPQV VPAASPLVGL PSHPSAGGAP AATDTPSPYY FLSHATSGPG ASEARIAPLG SAPTGLPQEA DLRSLLRSDA PSSGSAPAAS PGAFYFLDAQ RHPGPHGSHA GAVPNAAASA HPPFDVHAIR RDFPILQERV NGRPLVWFDN AATTHKPQSV IDRIAHFYAH ENSNIHRAAH ELAARATDAY EGARETVRRF IGASSVEEIV FVRGTTEAIN LVAKSWGAQN IGAGDEIVVS HLEHHANIVP WQQLAAEKGA KLRVIPVDDS GQVRLDEYRK LLNDRTKIVS VTQVSNALGT VVPVKEIVEL AHRAGAKALV DGAQSVSHLR VNVQALDADF FVFSGHKIFG PTGIGVVYGK REVLEDMPPW QGGGNMIADV TFEKTVYHGP PTRFEAGTGN IADAVGLGAA LDYVERVGIE NIARYEHDLL DYATHALRPI AGVRLVGTAR DKASVLSFVL DGYTTDEVGK ALNEEGIAVR TGHHCAQPIL RRFGLEATVR PSLAFYNTCE EVDRFIAVVR RLSGARRVPA R
|
| |