Gene Mmwyl1_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_2049 
Symbol 
ID5369145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp2320950 
End bp2322125 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content43% 
IMG OID640804394 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_001340906 
Protein GI152996071 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATT ATAAAGACGC AACGAATGCG ATTCGTGCAG GTATTCGTCA AACTCAAGAG 
CAAGAAAACA GCGAAGCCAT TTTTATGACA TCGAGCTTTG CTTATGGCAG TGCAGAAGAG
GCAGCAGGGA AGTTTTCTGG TGAAGAAGAT GGCAACGTTT ATTCTCGTTT TACCAACCCA
ACAGTGGAAT TATTTGAAAA GCGTTTAGCG ACCCTAGAAA AAGGCGAGGC TGCCATTGCA
ACAAGTTCTG GGATGGCGGC GTTAATGACG CTTGCGTATA GTTTGTTAAG CGCAGGTGAT
CGTGTTGTTT GTTCCCGCAA TATTTTTGGC TCAACAGTTA AGTTTTTTAA TGCCTACACA
GCAAAATTTG GTGTTGAAGT TCTGTATGTG GATGCAACAG ATTATGCTGC ATGGGAAGAG
GCAATTAACG AAAATACACG TTTTTGTTAT TTCGAAACAC CTTCTAATCC ACTCTATGAG
GTGGTTGATG TCGCTCGTGT GGCTGCCTTG GCGCATGCTA AAGGCGCGTT GCTTTGTGTT
GATACGGTTT TAGCGACACC GGCACTGCAG AACCCATTGA CTCAAGGTGC TGATATTGTC
ATGCAATCAG CAACTAAGTT TATTGATGGT CAAGGTCGTT GTTTGGGTGG AGCTTTAATT
TCCAGCCAAA AAATAGTTGA TGTCTTCACT GCCTTTATGC GTAGTGCTGG GCCATGTATG
AGCCCATTTA ATGCTTGGGT GTTGTTAAAT GGTTTAGAGA CCTTGTCTTT GCGTATGACG
GCACATTCCG CAAACGCTAT GAAGCTGGCT ACATATTTAG AAACGCACCC TAAAGTACTT
AAAGTAAATT ACGGCGGATT GCCAAGCCAT AAATATCATG AGCTGGCTAA ACAGCAGCAA
AAAGATTTTG GTGGATTGTT GTCTTTTGAA GTGGAGGGTG GTCGTAAAGC TGCTTGGGCC
GTCATTAATG CATCGAAATT GATGTCGATT ACAGGCAACT TGGGTGATAC CAAAACTTTG
GTGACTCATC CGGCTACTAC GACTCATGGG CGATTGACGG ACGATGAAAA AGCCAAGGCT
GGTATTACAG AAGGATTGAT TCGTATCTCT GTTGGTTTGG AAGATATCGA CGATATTATT
GCAGATCTTA AAGAAGCGTT AGACCAGTTG GATTAA
 
Protein sequence
MADYKDATNA IRAGIRQTQE QENSEAIFMT SSFAYGSAEE AAGKFSGEED GNVYSRFTNP 
TVELFEKRLA TLEKGEAAIA TSSGMAALMT LAYSLLSAGD RVVCSRNIFG STVKFFNAYT
AKFGVEVLYV DATDYAAWEE AINENTRFCY FETPSNPLYE VVDVARVAAL AHAKGALLCV
DTVLATPALQ NPLTQGADIV MQSATKFIDG QGRCLGGALI SSQKIVDVFT AFMRSAGPCM
SPFNAWVLLN GLETLSLRMT AHSANAMKLA TYLETHPKVL KVNYGGLPSH KYHELAKQQQ
KDFGGLLSFE VEGGRKAAWA VINASKLMSI TGNLGDTKTL VTHPATTTHG RLTDDEKAKA
GITEGLIRIS VGLEDIDDII ADLKEALDQL D