Gene EcolC_4017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4017 
Symbol 
ID6064575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4418200 
End bp4419129 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content50% 
IMG OID641603428 
Producthomoserine O-succinyltransferase 
Protein accessionYP_001726943 
Protein GI170021989 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1897] Homoserine trans-succinylase 
TIGRFAM ID[TIGR01001] homoserine O-succinyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.149933 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATTC GTGTGCCGGA CGAGCTACCC GCCGTCAATT TCTTGCGTGA AGAAAACGTC 
TTTGTGATGA CAACTTCTCG TGCGTCTGGT CAGGAAATTC GTCCACTTAA GGTTCTGATC
CTTAACCTGA TGCCGAAGAA GATTGAAACT GAAAATCAGT TTCTGCGCCT GCTTTCAAAC
TCACCTTTGC AGGTCGATAT TCAGCTGTTG CGCATCGATT CCCGTGAATC GCGCAACACG
CCCGCAGAGC ATCTGAACAA CTTCTACTGT AACTTTGAAG ATATTCAGGA TCAGAACTTT
GACGGTTTGA TTGTAACTGG TGCGCCGCTG GGCCTGGTGG AGTTTAATGA TGTCGCTTAC
TGGCCGCAGA TCAAACAGGT GCTGGAGTGG TCGAAAGATC ACGTCACCTC GACGCTGTTT
GTCTGCTGGG CGGTACAGGC CGCGCTCAAT ATCCTCTACG GCATTCCTAA GCAAACTCGC
ACCGACAAAC TCTCTGGCGT TTACGAGCAT CATATTCTCC ATCCTCATGC GCTTCTGACG
CGTGGCTTTG ATGATTCATT CCTGGCACCG CATTCGCGCT ATGCTGACTT TCCGGCAGCG
TTGATTCGTG ATTACACCGA TCTGGAAATT CTGGCAGAGA CGGAAGAAGG GGATGCATAT
CTGTTTGCCA GTAAAGATAA GCGCATTGCC TTTGTGACGG GCCATCCCGA ATATGATGCG
CAAACGCTGG CGCAGGAATT TTTCCGCGAT GTGGAAGCCG GACTAGACCC GGATGTACCG
TATAACTATT TCCCGCACAA TGATCCGCAA AATACACCGC GAGCGAGCTG GCGTAGTCAC
GGTAATTTAC TGTTTACCAA CTGGCTCAAC TATTACGTCT ACCAGATCAC GCCATACGAT
CTACGGCACA TGAATCCAAC GCTGGATTAA
 
Protein sequence
MPIRVPDELP AVNFLREENV FVMTTSRASG QEIRPLKVLI LNLMPKKIET ENQFLRLLSN 
SPLQVDIQLL RIDSRESRNT PAEHLNNFYC NFEDIQDQNF DGLIVTGAPL GLVEFNDVAY
WPQIKQVLEW SKDHVTSTLF VCWAVQAALN ILYGIPKQTR TDKLSGVYEH HILHPHALLT
RGFDDSFLAP HSRYADFPAA LIRDYTDLEI LAETEEGDAY LFASKDKRIA FVTGHPEYDA
QTLAQEFFRD VEAGLDPDVP YNYFPHNDPQ NTPRASWRSH GNLLFTNWLN YYVYQITPYD
LRHMNPTLD