Gene ECH74115_4318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4318 
SymbolmetC 
ID6968159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3995540 
End bp3996862 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content51% 
IMG OID643388047 
Productcystathionine beta-lyase 
Protein accessionYP_002272485 
Protein GI209397938 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01324] cystathionine beta-lyase, bacterial 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTTTAC GCAGTAAAAA AGTCACCAGC ACGCCATTTG CGAAAATTTT CTGCTTTATG 
CCAATTCTTC AGGATGCGCC CGCGAATATT CATGCTAGTT TAGACATCCA GACGTATAAA
AACAGGAATC CCGACATGGC GGACAAAAAG CTTGATACTC AACTGGTGAA TGCAGGACGC
AGCAAAAAAT ACACTCTCGG CGCGGTAAAT AGCGTGATTC AGCGCGCTTC TTCGCTGGTA
TTTGAGAGTA TGGAAGCCAA AAAGCACGCG ACGCGCAATC GCGCCAATGG CGAGTTGTTC
TATGGACGGC GCGGAACGTT AACCCATTTT TCCTTACAAC AAGCGATGTG TGAACTGGAA
GGTGGCGCAG GCTGCGCGCT ATTTCCCTGC GGGGCGGCAG CGGTTGCTAA TTCGATTCTT
GCTTTTGTCG AACAGGGCGA TCATGTGTTG ATGACCAACA CCGCCTATGA ACCGAGTCAG
GATTTCTGTA GCAAAATCCT CAGCAAACTG GGCGTAACGA CATCGTGGTT TGATCCGCTG
ATTGGTGCCG ATATCGTTAA GCATCTGCAG CCAAACACTA AAATCGTGTT TCTGGAATCG
CCGGGTTCCA TCACTATGGA AGTCCACGAC GTTCCGGCGA TTGTTGCCGC CGTGCGCAGT
GTGGTGCCGG ATGCCATCAT TATGATCGAC AACACCTGGG CAGCCGGTGT GCTGTTTAAG
GCGCTGGATT TTGGCATCGA TGTTTCTATT CAAGCCGCCA CCAAATATCT GGTTGGGCAT
TCAGATGCGA TGATTGGCAC TGCCGTGTGC AATGCCCGTT GCTGGGAGCA GCTACGGGAA
AATGCCTATC TGATGGGCCA GATGGTCGAT GCCGATACCG CCTATATAAC CAGCCGTGGC
CTGCGCACAT TAGGTGTGCG TTTGCGTCAA CATCATGAAA GCAGTCTGAA AGTGGCCGAA
TGGCTGGCAG AACATCCGCA AGTTGCGCGA GTTAACCACC CTGCCCTGCC TGGCAGTAAA
GGACACGAAT TCTGGAAACG AGACTTTACA GGCAGCAGCG GGCTATTTTC CTTTGTGCTT
AAGAAAAAAC TCAATGATGA AGAGCTGGCG AACTATCTGG ATAACTTCAG TTTATTCAGC
ATGGCCTACT CGTGGGGCGG GTATGAATCG TTGATCCTGG CAAATCAACC AGAACATATC
GCAGCCATTC GCCCACAAGG CGAGATCGAT TTTAGCGGGA CCTTGATTCG CCTGCATATT
GGTCTGGAAG ATGTCGACGA TCTGATTGCC GATCTGGACG CCGGTTTTGC GCGGATTGTA
TAA
 
Protein sequence
MRLRSKKVTS TPFAKIFCFM PILQDAPANI HASLDIQTYK NRNPDMADKK LDTQLVNAGR 
SKKYTLGAVN SVIQRASSLV FESMEAKKHA TRNRANGELF YGRRGTLTHF SLQQAMCELE
GGAGCALFPC GAAAVANSIL AFVEQGDHVL MTNTAYEPSQ DFCSKILSKL GVTTSWFDPL
IGADIVKHLQ PNTKIVFLES PGSITMEVHD VPAIVAAVRS VVPDAIIMID NTWAAGVLFK
ALDFGIDVSI QAATKYLVGH SDAMIGTAVC NARCWEQLRE NAYLMGQMVD ADTAYITSRG
LRTLGVRLRQ HHESSLKVAE WLAEHPQVAR VNHPALPGSK GHEFWKRDFT GSSGLFSFVL
KKKLNDEELA NYLDNFSLFS MAYSWGGYES LILANQPEHI AAIRPQGEID FSGTLIRLHI
GLEDVDDLIA DLDAGFARIV