Gene EcHS_A3186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3186 
SymbolmetC 
ID5595136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3197067 
End bp3198254 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content52% 
IMG OID640922305 
Productcystathionine beta-lyase 
Protein accessionYP_001459803 
Protein GI157162485 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01324] cystathionine beta-lyase, bacterial 


Plasmid Coverage information

Num covering plasmid clones66 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGACA AAAAGCTTGA TACTCAACTG GTGAATGCAG GACGCAGCAA AAAATACACT 
CTCGGCGCGG TAAATAGCGT GATTCAGCGC GCTTCTTCGC TGGTCTTTGA CAGTGTGGAA
GCCAAAAAGC ACGCGACGCG CAATCGCGCC AATGGCGAGT TGTTCTATGG ACGGCGCGGA
ACGTTAACCC ATTTCTCCTT ACAACAAGCG ATGTGTGAAC TGGAAGGTGG CGCAGGCTGC
GCGCTATTTC CCTGCGGGGC GGCAGCGGTT GCTAATTCCA TTCTTGCTTT TGTCGAACAG
GGCGATCATG TGTTGATGAC CAACACCGCC TATGAACCGA GTCAGGATTT CTGTAGCAAA
ATCCTCAGCA AACTGGGCGT AACGACATCG TGGTTTGATC CGCTGATTGG TGCCGATATC
GTTAAGCATC TGCAGCCAAA CACTAAAATC GTGTTTCTGG AATCGCCAGG CTCCATCACC
ATGGAAGTCC ACGACGTTCC GGCGATTGTT GCCGCCGTAC GCAGTGTGGT GCCGGATGCC
ATCATTATGA TCGACAACAC CTGGGCAGCC GGTGTGCTGT TTAAGGCGCT GGATTTTGGC
ATCGATGTTT CTATTCAAGC CGCCACCAAA TATCTGGTTG GGCATTCAGA TGCGATGATT
GGCACTGCCG TGTGCAATGC CCGTTGCTGG GAGCAGCTAC GGGAAAACGC CTATCTGATG
GGCCAGATGG TCGATGCCGA TACCGCCTAT ATAACCAGCC GTGGCCTGCG CACATTAGGT
GTGCGTTTGC GTCAACATCA TGAAAGCAGT CTGAAAGTGG CTGAATGGCT GGCAGAACAT
CCGCAAGTAG CGCGAGTTAA CCACCCTGCT CTGCCTGGCA GTAAAGGACA CGAATTCTGG
AAACGAGACT TTACAGGTAG CAGCGGGCTA TTTTCCTTTG TGCTTAAGAA AAAACTCAGT
AATGAAGAGC TGGCGAACTA TCTGGATAAC TTCAGTTTAT TCAGCATGGC CTACTCGTGG
GGCGGGTATG AATCGTTGAT CCTGGCGAAT CAACCAGAAC ATATCGCCGC CATTCGCCCA
CAAGGCGAGA TCGATTTTAG CGGGACCTTG ATTCGCCTGC ATATTGGTCT GGAAGATGTC
GACGATCTGA TTGCCGATCT GGACGCCGGT TTTGCGCGAA TTGTATAA
 
Protein sequence
MADKKLDTQL VNAGRSKKYT LGAVNSVIQR ASSLVFDSVE AKKHATRNRA NGELFYGRRG 
TLTHFSLQQA MCELEGGAGC ALFPCGAAAV ANSILAFVEQ GDHVLMTNTA YEPSQDFCSK
ILSKLGVTTS WFDPLIGADI VKHLQPNTKI VFLESPGSIT MEVHDVPAIV AAVRSVVPDA
IIMIDNTWAA GVLFKALDFG IDVSIQAATK YLVGHSDAMI GTAVCNARCW EQLRENAYLM
GQMVDADTAY ITSRGLRTLG VRLRQHHESS LKVAEWLAEH PQVARVNHPA LPGSKGHEFW
KRDFTGSSGL FSFVLKKKLS NEELANYLDN FSLFSMAYSW GGYESLILAN QPEHIAAIRP
QGEIDFSGTL IRLHIGLEDV DDLIADLDAG FARIV