Gene Mpe_A2262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2262 
Symbol 
ID4785101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2420861 
End bp2422081 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content66% 
IMG OID640090830 
Productcysteine desulfurase 
Protein accessionYP_001021453 
Protein GI124267449 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR02006] cysteine desulfurase IscS
[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.626751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATGA CCCCGCACTT TCCGATCTAC ATGGACTACG GCGCCACGAC GCCGGTCGAC 
CCCCGCGTCG TCGACGCGAT GATTCCCTGG CTGCGTGAAC ACTTCGGCAA TCCAGCCTCG
CGCAGCCACG CATGGGGATG GGAGGCCGAA GAGGCGGTCG AGAAGGCACG CGGCGAGGTG
GCGGCACTGA TCGGTGCGGA CCCGCGCGAG ATCGTCTGGA CCTCCGGCGC GACCGAGTCC
AACAACCTCG CGCTCAAGGG CGCGGCGCAG TTCTACAAGA CGCGCGGCAA GCACCTGATC
ACGGTCCGGA CCGAGCACAA GGCGGTGCTC GACACCATGC GCGAACTCGA GCGCCAGGGT
TTCGAGGTGA GCTACCTGGA GGTTCAGGAA GACGGTCTGA TCGACCTCGA CCTGCTCAAG
ACCACGATCC GGCCTGACAC GATCCTCGTC TCGGTGATGT TCGTCAACAA CGAGATCGGC
GTCATCCAGG ACATCCCGGC GATCGGTGCA CTGTGCCGCG AGCGCGGCAT CGTGTTCCAT
GTCGATGCGG CCCAGGCGAC CGGCAAGGTC GCGATCGATC TGGCCACGCT GCCGGTCGAC
CTGATGAGCT TGGCCTCGCA CAAGACCTAC GGTCCCAAGG GCATCGGTGC GCTGTACGTG
CGCCGCAAGC CGCGCGTGCG GCTGGAAGCG CAGATGCATG GCGGCGGCCA CGAGCGTGGC
ATGCGCTCCG GCACGCTGCC CACGCATCAG ATCGTCGGCA TGGGCGAGGC CTTCCGCATC
GCGCGCGAGG AGATGGGCAC CGAGAGCGAG CGCATTCGCA TGCTCCAGAA GCGCCTGATC
GACGGCCTGG CCGACATCGA GCAGACCTTC CTGAACGGCC ACGCCGAGCG GCGCGTGCCG
CACAACGTGA ACATGAGCTT CAACTTCGTC GAGGGCGAGT CGCTGATCAT GGGCATCAAG
GGTCTCGCGG TGTCGTCGGG TTCGGCCTGC ACCTCGGCCA GCCTGGAGCC GAGCTACGTG
CTGCGCGCGC TGGGCCGCAG CGACGAACTG GCGCACTCCA GCCTGCGCAT GACCATCGGC
CGCTTCACGA CCGAGGAAGA GATCGATTAC GCCATCGGCA CCATCCGCCA CAACGTGGCC
AAACTGCGCG AACTGTCGCC GCTTTGGGAG ATGTACCAGG ACGGCGTCGA TATCAGCACC
ATCCAGTGGG CGGCGCACTG A
 
Protein sequence
MDMTPHFPIY MDYGATTPVD PRVVDAMIPW LREHFGNPAS RSHAWGWEAE EAVEKARGEV 
AALIGADPRE IVWTSGATES NNLALKGAAQ FYKTRGKHLI TVRTEHKAVL DTMRELERQG
FEVSYLEVQE DGLIDLDLLK TTIRPDTILV SVMFVNNEIG VIQDIPAIGA LCRERGIVFH
VDAAQATGKV AIDLATLPVD LMSLASHKTY GPKGIGALYV RRKPRVRLEA QMHGGGHERG
MRSGTLPTHQ IVGMGEAFRI AREEMGTESE RIRMLQKRLI DGLADIEQTF LNGHAERRVP
HNVNMSFNFV EGESLIMGIK GLAVSSGSAC TSASLEPSYV LRALGRSDEL AHSSLRMTIG
RFTTEEEIDY AIGTIRHNVA KLRELSPLWE MYQDGVDIST IQWAAH