Gene EcolC_1819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1819 
Symbol 
ID6064759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2018393 
End bp2019757 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content54% 
IMG OID641601233 
ProductL-serine dehydratase 1 
Protein accessionYP_001724795 
Protein GI170019841 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1760] L-serine deaminase 
TIGRFAM ID[TIGR00720] L-serine dehydratase, iron-sulfur-dependent, single chain form 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.561411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.687131 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTAGTC TATTCGACAT GTTTAAGGTG GGGATTGGTC CCTCATCTTC CCATACCGTA 
GGGCCTATGA AGGCAGGTAA ACAGTTCGTC GATGATCTGG TCGAAAAAGG CTTACTGGAT
AGCGTTACTC GCGTTGCCGT GGACGTTTAT GGTTCACTGT CGCTGACGGG TAAAGGCCAC
CACACCGATA TCGCCATTAT TATGGGTCTT GCAGGTAACG AACCTGCCAC CGTGGATATC
GACAGTATTC CCGGTTTTAT TCGCGACGTA GAAGAGCGCG AACGTCTGCT GCTGGCACAG
GGACGGCATG AAGTGGATTT CCCGCGCGAC AACGGGATGC GTTTTCATAA CGGCAACCTG
CCGCTGCATG AAAACGGTAT GCAAATCCAC GCCTATAACG GCGATGAAGT CGTCTACAGC
AAAACTTATT ATTCCATCGG CGGCGGTTTT ATCGTCGATG AAGAACACTT TGGTCAGGAT
GCTGCCAACG AAGTAAGCGT GCCGTATCCG TTCAAATCTG CCACCGAACT GCTCGCGTAC
TGTAATGAAA CCGGCTATTC GCTGTCTGGT CTCGCTATGC AGAACGAACT GGCGCTGCAC
AGCAAGAAAG AGATCGACGA GTATTTCGCG CATGTCTGGC AAACCATGCA GGCATGTATC
GATCGCGGGA TGAACACCGA AGGTGTACTG CCAGGCCCGC TGCGCGTGCC ACGTCGTGCG
TCTGCCCTGC GCCGGATGCT GGTTTCCAGC GATAAACTGT CTAACGATCC GATGAATGTC
ATTGACTGGG TAAACATGTT TGCGCTGGCA GTTAACGAAG AAAACGCCGC CGGTGGTCGT
GTGGTAACTG CGCCAACCAA CGGTGCCTGC GGTATCGTTC CGGCAGTGCT GGCTTACTAT
GACCACTTTA TTGAATCGGT CAGCCCGGAC ATCTATACCC GTTACTTTAT GGCAGCGGGC
GCGATTGGTG CATTGTATAA AATGAACGCC TCTATTTCCG GTGCGGAAGT TGGTTGCCAG
GGCGAAGTGG GTGTTGCCTG TTCAATGGCT GCTGCGGGTC TTGCAGAACT GCTGGGCGGT
AGCCCGGAAC AGGTTTGCGT GGCGGCGGAA ATTGGCATGG AACACAACCT TGGTTTAACC
TGCGACCCGG TTGCAGGGCA GGTTCAGGTG CCGTGCATTG AGCGTAATGC CATTGCCTCT
GTGAAGGCGA TTAACGCCGC GCGGATGGCT CTGCGCCGCA CCAGTGCACC GCGCGTCTCG
CTGGATAAGG TCATCGAAAC GATGTACGAA ACCGGTAAGG ACATGAACGC CAAATACCGC
GAAACCTCAC GCGGTGGTCT GGCAATCAAA GTCCAGTGTG ACTAA
 
Protein sequence
MISLFDMFKV GIGPSSSHTV GPMKAGKQFV DDLVEKGLLD SVTRVAVDVY GSLSLTGKGH 
HTDIAIIMGL AGNEPATVDI DSIPGFIRDV EERERLLLAQ GRHEVDFPRD NGMRFHNGNL
PLHENGMQIH AYNGDEVVYS KTYYSIGGGF IVDEEHFGQD AANEVSVPYP FKSATELLAY
CNETGYSLSG LAMQNELALH SKKEIDEYFA HVWQTMQACI DRGMNTEGVL PGPLRVPRRA
SALRRMLVSS DKLSNDPMNV IDWVNMFALA VNEENAAGGR VVTAPTNGAC GIVPAVLAYY
DHFIESVSPD IYTRYFMAAG AIGALYKMNA SISGAEVGCQ GEVGVACSMA AAGLAELLGG
SPEQVCVAAE IGMEHNLGLT CDPVAGQVQV PCIERNAIAS VKAINAARMA LRRTSAPRVS
LDKVIETMYE TGKDMNAKYR ETSRGGLAIK VQCD