Gene EcSMS35_3292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3292 
SymbolmetC 
ID6144171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3367330 
End bp3368517 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content52% 
IMG OID641618122 
Productcystathionine beta-lyase 
Protein accessionYP_001745272 
Protein GI170680690 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01324] cystathionine beta-lyase, bacterial 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACA AAAAGCTTGA TACTCAACTG GTGAATGCAG GACGCAGCAA AAAATACACT 
CTCGGCGCGG TAAATAGCGT GATTCAGCGC GCTTCTTCGC TGGTCTTTGA CAGTGTGGAA
GCCAAAAAGC ACGCGACACG CAATCGCGCC AATGGCGAGT TGTTCTATGG GCGGCGCGGA
ACGTTAACCC ATTTTTCCTT ACAACAAGCG ATGTGTGAAC TGGAAGGTGG CGCAGGCTGC
GCGCTATTTC CCTGCGGGGC GGCGGCGGTT GCTAATTCGA TTCTTGCTTT TGTCGAACAG
GGCGATCATG TGTTGATGAC CAACACCGCC TATGAACCGA GTCAGGATTT CTGTAGCAAA
ATCCTCAGCA AACTGGGCGT AACGACATCG TGGTTTGATC CACTGATTGG TGCCGATATC
GTTAAGCATC TGCAGCCAAA CACTAAAATC GTGTTTCTGG AATCGCCAGG CTCCATCACC
ATGGAAGTCC ACGACGTTCC GGCGATTGTT GCCGCCGTAC GCAGTGTGGT GCCGGATGCC
ATCATTATGA TCGACAACAC CTGGGCAGCC GGTGTGCTGT TTAAGGCGCT GGATTTTGGC
ATTGATGTTT CCATTCAGGC CGCTACCAAA TATCTGGTTG GGCATTCAGA TGCGATGATT
GGCACTGCGG TGTGCAATGC CCGTTGCTGG GAGCAGCTGC GAGAGAATGC CTATCTGATG
GGCCAGATGG TCGATGCCGA TACCGCCTAT ATAACCAGCC GTGGCCTGCG CACTTTGGGT
GTTCGCCTGC GTCAACATCA TGAAAGCAGT CTGAAAGTGG CTGAATGGCT GGCAGAACAT
CCCCAAGTAG CGCGAGTTAA CCACCCTGCT CTGCCTGGCA GTAAAGGCCA CGAATTCTGG
AAACGAGACT TTACAGGTAG CAGCGGGCTA TTTTCCTTTG TGCTTAAGAA AAAGCTCAAT
GATGAAGAAC TGGCGAACTA TCTGGATAAC TTCAGTTTAT TCAGCATGGC CTACTCGTGG
GGCGGGTATG AATCGTTGAT CCTGGCAAAT CAACCAGAAC ATATCGCCGC CATTCGCCCA
CAAGGCGAGA TCGATTTTAG CGGGACCTTG ATTCGCCTGC ATATTGGTCT GGAAGATGTC
GACGATCTGA TTGCCGATCT GGATGCCGGT TTTGCGCGAA TTGTATAA
 
Protein sequence
MADKKLDTQL VNAGRSKKYT LGAVNSVIQR ASSLVFDSVE AKKHATRNRA NGELFYGRRG 
TLTHFSLQQA MCELEGGAGC ALFPCGAAAV ANSILAFVEQ GDHVLMTNTA YEPSQDFCSK
ILSKLGVTTS WFDPLIGADI VKHLQPNTKI VFLESPGSIT MEVHDVPAIV AAVRSVVPDA
IIMIDNTWAA GVLFKALDFG IDVSIQAATK YLVGHSDAMI GTAVCNARCW EQLRENAYLM
GQMVDADTAY ITSRGLRTLG VRLRQHHESS LKVAEWLAEH PQVARVNHPA LPGSKGHEFW
KRDFTGSSGL FSFVLKKKLN DEELANYLDN FSLFSMAYSW GGYESLILAN QPEHIAAIRP
QGEIDFSGTL IRLHIGLEDV DDLIADLDAG FARIV