Gene EcSMS35_4467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4467 
SymbolmetA 
ID6144772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4559649 
End bp4560578 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content50% 
IMG OID641619283 
Producthomoserine O-succinyltransferase 
Protein accessionYP_001746395 
Protein GI170681732 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1897] Homoserine trans-succinylase 
TIGRFAM ID[TIGR01001] homoserine O-succinyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.214699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0469066 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATTC GTGTGCCGGA CGAGCTACCC GCCGTCAATT TCTTGCGTGA AGAAAACGTC 
TTTGTGATGA CAACTTCTCG TGCGTCTGGT CAGGAAATTC GTCCGCTAAA GGTACTTATC
CTTAACCTGA TGCCGAAGAA GATCGAAACT GAAAATCAGT TTCTGCGCCT GCTTTCAAAC
TCACCTTTGC AGGTCGATAT TCAGCTGTTG CGCATTGATT CCCGTGAATC GCGCAACACG
CCCGCAGAGC ATCTGAATAA CTTCTACTGT AACTTTGAAG ATATTCAGGA ACAAAACTTT
GACGGTCTGA TAGTTACCGG CGCGCCGCTG GGCCTGGTGG AGTTTAATGA TGTCGCTTAC
TGGCCGCAGA TCAAACAGGT GCTGGAGTGG TCGAAAGATC ACGTCACCTC GACGCTGTTT
GTCTGCTGGG CGGTACAGGC CGCGCTAAAT ATCCTCTACG GCATTCCTAA GCAAACTCGC
ACCGACAAAC TCTCTGGCGT TTACGAGCAT CATATTCTCC ATCCTCATGC GCTTCTGACG
CGTGGCTTTG ATGATTCATT CCTGGCACCG CATTCGCGCT ATGCTGACTT TCCGGCAGCG
TTGATTCGTG ATTACACCGA TCTGGAAATT CTGGCAGAGA CGGAAGAAGG GGATGCATAT
CTGTTTGCCA GCAAAGATAA GCGCATTGCC TTTGTGACGG GCCATCCCGA ATATGATGCG
CAAACGCTGG CGCAGGAATA TTTCCGCGAT GTGGAAGCCG GACTAGACCC GGAAGTACCG
TATAACTATT TCCCGCACAA TGATCCGCAA AACAAACCGC GAGCGAGCTG GCGTAGTCAC
GGTAATTTGC TGTTTACCAA CTGGCTCAAC TATTACGTCT ACCAGATCAC GCCATACGAT
CTACGGCACA TGAATCCAAC GCTGGATTAA
 
Protein sequence
MPIRVPDELP AVNFLREENV FVMTTSRASG QEIRPLKVLI LNLMPKKIET ENQFLRLLSN 
SPLQVDIQLL RIDSRESRNT PAEHLNNFYC NFEDIQEQNF DGLIVTGAPL GLVEFNDVAY
WPQIKQVLEW SKDHVTSTLF VCWAVQAALN ILYGIPKQTR TDKLSGVYEH HILHPHALLT
RGFDDSFLAP HSRYADFPAA LIRDYTDLEI LAETEEGDAY LFASKDKRIA FVTGHPEYDA
QTLAQEYFRD VEAGLDPEVP YNYFPHNDPQ NKPRASWRSH GNLLFTNWLN YYVYQITPYD
LRHMNPTLD