Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2262 |
Symbol | |
ID | 4785101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 2420861 |
End bp | 2422081 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640090830 |
Product | cysteine desulfurase |
Protein accession | YP_001021453 |
Protein GI | 124267449 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR02006] cysteine desulfurase IscS [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.626751 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATGA CCCCGCACTT TCCGATCTAC ATGGACTACG GCGCCACGAC GCCGGTCGAC CCCCGCGTCG TCGACGCGAT GATTCCCTGG CTGCGTGAAC ACTTCGGCAA TCCAGCCTCG CGCAGCCACG CATGGGGATG GGAGGCCGAA GAGGCGGTCG AGAAGGCACG CGGCGAGGTG GCGGCACTGA TCGGTGCGGA CCCGCGCGAG ATCGTCTGGA CCTCCGGCGC GACCGAGTCC AACAACCTCG CGCTCAAGGG CGCGGCGCAG TTCTACAAGA CGCGCGGCAA GCACCTGATC ACGGTCCGGA CCGAGCACAA GGCGGTGCTC GACACCATGC GCGAACTCGA GCGCCAGGGT TTCGAGGTGA GCTACCTGGA GGTTCAGGAA GACGGTCTGA TCGACCTCGA CCTGCTCAAG ACCACGATCC GGCCTGACAC GATCCTCGTC TCGGTGATGT TCGTCAACAA CGAGATCGGC GTCATCCAGG ACATCCCGGC GATCGGTGCA CTGTGCCGCG AGCGCGGCAT CGTGTTCCAT GTCGATGCGG CCCAGGCGAC CGGCAAGGTC GCGATCGATC TGGCCACGCT GCCGGTCGAC CTGATGAGCT TGGCCTCGCA CAAGACCTAC GGTCCCAAGG GCATCGGTGC GCTGTACGTG CGCCGCAAGC CGCGCGTGCG GCTGGAAGCG CAGATGCATG GCGGCGGCCA CGAGCGTGGC ATGCGCTCCG GCACGCTGCC CACGCATCAG ATCGTCGGCA TGGGCGAGGC CTTCCGCATC GCGCGCGAGG AGATGGGCAC CGAGAGCGAG CGCATTCGCA TGCTCCAGAA GCGCCTGATC GACGGCCTGG CCGACATCGA GCAGACCTTC CTGAACGGCC ACGCCGAGCG GCGCGTGCCG CACAACGTGA ACATGAGCTT CAACTTCGTC GAGGGCGAGT CGCTGATCAT GGGCATCAAG GGTCTCGCGG TGTCGTCGGG TTCGGCCTGC ACCTCGGCCA GCCTGGAGCC GAGCTACGTG CTGCGCGCGC TGGGCCGCAG CGACGAACTG GCGCACTCCA GCCTGCGCAT GACCATCGGC CGCTTCACGA CCGAGGAAGA GATCGATTAC GCCATCGGCA CCATCCGCCA CAACGTGGCC AAACTGCGCG AACTGTCGCC GCTTTGGGAG ATGTACCAGG ACGGCGTCGA TATCAGCACC ATCCAGTGGG CGGCGCACTG A
|
Protein sequence | MDMTPHFPIY MDYGATTPVD PRVVDAMIPW LREHFGNPAS RSHAWGWEAE EAVEKARGEV AALIGADPRE IVWTSGATES NNLALKGAAQ FYKTRGKHLI TVRTEHKAVL DTMRELERQG FEVSYLEVQE DGLIDLDLLK TTIRPDTILV SVMFVNNEIG VIQDIPAIGA LCRERGIVFH VDAAQATGKV AIDLATLPVD LMSLASHKTY GPKGIGALYV RRKPRVRLEA QMHGGGHERG MRSGTLPTHQ IVGMGEAFRI AREEMGTESE RIRMLQKRLI DGLADIEQTF LNGHAERRVP HNVNMSFNFV EGESLIMGIK GLAVSSGSAC TSASLEPSYV LRALGRSDEL AHSSLRMTIG RFTTEEEIDY AIGTIRHNVA KLRELSPLWE MYQDGVDIST IQWAAH
|
| |