Gene EcSMS35_4657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4657 
Symbol 
ID6146112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4757810 
End bp4758973 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content52% 
IMG OID641619473 
Productglutathionylspermidine synthase domain-containing protein 
Protein accessionYP_001746581 
Protein GI170679916 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0754] Glutathionylspermidine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.717463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGAC ACAACGTTCC TGTGCGGCGG GATCTGGACC GGATTGCCGC CGATAACGGA 
TTTGACTTTC ATATCATCGA CAATGAAATC TATTGGGATG AGAGTCGGGC TTATCGCTTT
ACTTTGCGCC AGATTGAAGA GCAGATCGAA AAACCGACTG CGGAACTGCA TCAGATGTGC
CTGGAGGTGG TGGATCGTGC GGTTAAAGAT GAAGAGATCC TGACGCAACT GGCGATCCCG
CCGTTGTACT GGGATGTGAT CGCCGAAAGC TGGCACGCAC GCGATCCTTC GTTGTATGGC
CGTATGGATT TTGCCTGGTG TGGTAATGCA CCGGTTAAGT TGCTGGAGTA CAACGCCGAT
ACGCCAACTT CATTGTACGA GTCGGCTTAT TTCCAGTGGT TGTGGCTGGA AGATGCCCGG
CGCAGCGGCG TTATTCCGCG TGATGCCGAT CAGTACAATG CCATTCAGGA ACGCCTGATT
TCGCGCTTTA GTGAGCTTTA CAGTCGGGAA CCGTTTTATT TTTGCTGCTG TCAGGACACC
GATGAAGACA GGAGTACCGT GCTGTACTTG CAGGACTGCG CCCAGCAGGC AGGGCAGGAG
TCGCGGTTTA TCTACATTGA AGATCTCGGT TTGGGCGTCG GCGGAGTACT GACCGATCTT
GATGATAATG TCATCCAGCG TGCATTTAAG CTGTATCCGC TGGAGTGGAT GATGCGTGAC
GATAACGGTC CGCTGCTGCG CAAGCGTCGT GAGCAATGGG TGGAGCCGTT ATGGAAAAGT
ATCTTGAGTA ATAAAGGGCT AATGCCGCTG CTTTGGCGCT TCTTCCCTGG TCATCCTAAC
CTTCTTGCAT CCTGGTTTGA GGGTGAAAAA CCGCAGATTG CCGCTGGCGA AAGCTATGTG
CGTAAACCGA TTTACTCGCG CGAAGGCGGA AACGTCACCA TTTTCGACGG TCAGAATAAC
GTCGTTGACC ACGCTGATGG TGATTACGCC GATGAACCGA TGATCTACCA GGCGTTTCAA
CCTCTGCCGC GATTTGGCGA TAGCTACACA CTCATCGGTA GCTGGATTGT CGATGATGAA
GCGTGCGGAA TGGGGATCCG TGAAGATAAC ACGTTGATCA CCAAAGACAC CTCACGTTTC
GTTCCGCATT ACATTGCTGG ATAA
 
Protein sequence
MLRHNVPVRR DLDRIAADNG FDFHIIDNEI YWDESRAYRF TLRQIEEQIE KPTAELHQMC 
LEVVDRAVKD EEILTQLAIP PLYWDVIAES WHARDPSLYG RMDFAWCGNA PVKLLEYNAD
TPTSLYESAY FQWLWLEDAR RSGVIPRDAD QYNAIQERLI SRFSELYSRE PFYFCCCQDT
DEDRSTVLYL QDCAQQAGQE SRFIYIEDLG LGVGGVLTDL DDNVIQRAFK LYPLEWMMRD
DNGPLLRKRR EQWVEPLWKS ILSNKGLMPL LWRFFPGHPN LLASWFEGEK PQIAAGESYV
RKPIYSREGG NVTIFDGQNN VVDHADGDYA DEPMIYQAFQ PLPRFGDSYT LIGSWIVDDE
ACGMGIREDN TLITKDTSRF VPHYIAG