Gene EcSMS35_2111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2111 
SymbolputA 
ID6146110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2117924 
End bp2121886 
Gene Length3963 bp 
Protein Length1320 aa 
Translation table11 
GC content56% 
IMG OID641616987 
Producttrifunctional transcriptional regulator/proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase 
Protein accessionYP_001744162 
Protein GI170681102 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0506] Proline dehydrogenase
[COG4230] Delta 1-pyrroline-5-carboxylate dehydrogenase 
TIGRFAM ID[TIGR01238] delta-1-pyrroline-5-carboxylate dehydrogenase (PutA C-terminal domain) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.384648 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACCA CCACCATGGG GGTTAAGCTG GACGACGCGA CGCGTGAGCG TATTAAGTCT 
GCCGCGACAC GTATCGATCG CACGCCACAC TGGTTAATTA AGCAGGCGAT TTTTTCTTAT
CTTGAACAAC TGGAAAACAG CGATACTCTG CCGGAGCTAC CTGCGCTGCT TTCTGGCGCG
GCCAATGAGA GCGATGAAGC ACCGACTCCG GCAGAGGAAC CACACCAGCC ATTCCTCGAC
TTTGCCGAGC AAATTTTGCC CCAGTCGGTT TCCCGCGCCG CGATCACCGC GGCCTATCGC
CGCCCGGAAA CCGAAGCGGT TTCGATGCTG CTGGAACAAG CCCGCCTGCC GCAACCAGTT
GCTGAACAGG CGCATAAACT GGCGTATCAA CTGGCCGATA AACTGCGTAA TCAAAAAAAT
GCCAGTGGTC GCGCAGGTAT GGTCCAGGGG TTATTGCAGG AGTTTTCGCT GTCATCGCAG
GAAGGCGTGG CGCTGATGTG TCTGGCAGAA GCGTTGTTGC GTATTCCCGA CAAAGCCACC
CGGGACGCGT TAATTCGCGA CAAAATCAGC AACGGTAACT GGCAGTCACA CATTGGTCGT
AGCCCGTCTT TGTTTGTTAA TGCCGCCACC TGGGGGCTGC TGTTTACCGG CAAACTGGTT
TCCACCCATA ACGAAGCCAG TCTCTCCCGC TCGCTGAACC GCATTATCGG TAAGAGCGGT
GAACCGCTGA TCCGCAAAGG TGTGGATATG GCGATGCGCC TGATGGGAGA GCAGTTCGTC
ACTGGCGAAA CCATCGCGGA AGCGTTAGCC AATGCCCGCA AGCTGGAAGA GAAAGGTTTC
CGTTACTCTT ACGATATGCT GGGCGAAGCC GCGCTGACTG CCGCAGATGC ACAGGCGTAT
ATGGTTTCCT ATCAACAGGC GATTCACGCC ATTGGTAAAG CGTCCAATGG TCGTGGCATC
TATGAAGGTC CAGGCATTTC AATCAAGCTG TCGGCGCTGC ATCCGCGTTA CAGCCGCGCC
CAGTATGACC GGGTAATGGA AGAGCTTTAC CCACGTCTGA AATCACTCAC CCTGCTGGCG
CGTCAGTACG ATATTGGTAT TAACATCGAT GCCGAAGAGG CCGATCGTCT GGAGATCTCC
CTCGATCTGC TGGAAAAACT CTGTTTCGAG CCGGAACTGG CAGGCTGGAA CGGCATCGGT
TTTGTTATTC AGGCTTATCA AAAACGCTGC CCGTTGGTGA TCGATTACCT GATTGATCTC
GCCACCCGCA GCCGTCGCCG TCTGATGATT CGCCTGGTGA AAGGCGCGTA CTGGGATAGT
GAAATCAAGC GTGCACAGAT GGACGGCCTT GAAGGTTATC CGGTTTATAC CCGCAAGGTG
TATACCGACG TTTCTTATCT TGCCTGTGCG AAAAAACTGC TGGCGGTGCC GAATTTAATC
TACCCGCAGT TCGCGACGCA CAACGCCCAT ACGCTGGCAG CGATTTATCA ACTGGCGGGT
CAAAACTACT ACCCGGGTCA GTACGAGTTC CAGTGCCTGC ATGGTATGGG CGAACCACTG
TATGAGCAGG TCACCGGGAA AGTTGCCGAC GGCAAACTTA ACCGTCCGTG TCGTATTTAT
GCTCCGGTCG GTACACATGA AACGCTGCTG GCGTATCTGG TGCGCCGCCT GCTGGAAAAC
GGTGCTAACA CCTCGTTTGT TAACCGTATT GCCGACACCT CTTTGCCACT GGATGAACTG
GTCGCCGATC CAGTCACTGC TGTAGAAAAA CTGGCACAAC AGGAAGGGCA AACTGGATTA
CCGCATCCGA AAATTCCCCT GCCGCGCGAT CTTTACGGTC ACGGGCGCGA CAACTCGGCA
GGGCTGGATC TCGCTAACGA ACACCGTCTG GCCTCGCTCT CTTCAGCCCT GCTCAATAGT
GCACTGCAAA AATGGCAGTC CTTGCCAATG CTGGAACAAC CGGTAGCGGC AGGTGAGATG
TCGCCCGTTA TTAACCCTGC GGAACCGAAA GATATTGTGG GCTATGTGCG TGAAGCCACG
CCGCGTGAAG TAGAACAGGC GCTGGAAAGT GCGGTTAATA ACGCGCCAAT CTGGTTTGCC
ACGCCTCCGG CTGAACGCGC GGCAATTTTG CATCGCGCTG CCGTGCTAAT GGAAAGCCAG
ATGCAGCAAC TGATTGGTAT TCTGGTGCGT GAAGCCGGAA AAACCTTCAG TAACGCCATT
GCCGAAGTGC GCGAAGCGGT CGATTTTCTC CACTACTATG CCGGACAGGT GCGGGATGAT
TTCGCTAACG AAACCCACCG TCCATTAGGG CCTGTGGTAT GTATCAGTCC GTGGAACTTC
CCGCTGGCGA TTTTCACCGG ACAGATCGCT GCCGCACTGG CGGCAGGTAA CAGCGTGCTG
GCAAAACCGG CAGAACAAAC GCCGCTGATT GCCGCGCAAG GGATCGCCAT TTTGCTGGAA
GCGGGTGTAC CGCCAGGCGT GGTGCAATTG CTGCCCGGTC GGGGTGAAAC CGTGGGCGCG
CAACTGACGG GTGATGATCG CGTGCGCGGG GTGATGTTTA CCGGTTCAAC CGAAGTCGCT
ACGTTACTGC AGCGCAATAT CGCCAGCCGC CTGGACGCTC AGGGTCGCCC TATTCCGCTC
ATCGCTGAAA CCGGCGGCAT GAACGCGATG ATTGTCGATT CTTCAGCACT GACCGAACAA
GTCGTAATAG ATGTGCTGGC CTCGGCGTTC GACAGTGCAG GTCAGCGTTG TTCGGCACTG
CGCGTGCTGT GCCTGCAAGA TGAGATTGCC GACCACACGC TGAAAATGCT GCGCGGCGCA
ATGGCCGAAT GCCGGATGGG TAATCCGGGT CGCCTGACCA CCGATATCGG TCCGGTGATT
GATAGCGAAG CGAAAGCCAA TATCGAGCGT CATATTCAGA CCATGCGTAG TAAAGGCCGT
CAGGTGTTCC AGGCGGTGCG GGAAAACAGC GAAGATACCC GTGAATGGCA AAGCGGCACC
TTTGTCGCTC CGACGCTGAT CGAACTGGAT GACTTTGCCG AATTACAAAA AGAGGTCTTT
GGTCCGGTGC TGCATGTGGT GCGTTACAAC CGTAACCAGC TGCCGGCGCT GATCGAGCAG
ATTAACGCTT CCGGTTATGG TCTGACGCTT GGCGTCCATA CGCGCATTGA TGAAACCATC
GCCCAGGTCA CTGGCTCGGC GCATGTCGGT AATCTGTACG TTAACCGTAA TATGGTGGGC
GCAGTGGTTG GGGTGCAACC ATTCGGCGGC GAAGGGTTGT CCGGCACCGG GCCGAAAGCA
GGCGGTCCGC TCTATCTCTA CCGTCTGCTG GCGAATCGCC CAGAAAGTGC GCTGGCAGTG
ACGCTCGCGC GTCAGGACGC AGAGTATCCG GTCGATGCGC AGTTGAAAGC CGCATTGACT
CAGCCGCTAA ATGCACTGCG GGAATGGGCG GCAAACCGTC CAGAATTGCA GGCGTTATGT
ACGCAATATG GCGAGCTGGC GCAGGCAGGA ACACAACGAT TGCTGCCAGG GCCGACGGGT
GAACGCAACA CCTGGACGCT GCTGCCGCGT GAGCGCGTAT TGTGTATCGC GGATGATGAA
CAGGATGCGC TGACTCAGCT CGCCGCCGTG CTGGCGGTGG GCAGCCAGGT ACTGTGGCCG
GATGACACGC TGCATCATCA GTTAGTGAAG GCATTGCCAT CGGCAGTCAG CGAACGTATT
CAACTGGCGA AAGCGGAAAA TATAACCGCT CAACCCTTTG ATGCGGTGAT CTTCCACGGT
GATTCGGATC AGCTTCGCGC ATTGTGTGAA GCCGTTGCCG CGCGGGATGG TGCAATTGTT
TCGGTGCAGG GTTTTGCCCG TGGCGAAAGC AATATCCTTC TGGAACGGCT GTATATCGAG
CGTTCGCTGA GTGTGAATAC CGCTGCCGCT GGCGGTAACG CCAGTTTAAT GACGATCGGT
TAA
 
Protein sequence
MGTTTMGVKL DDATRERIKS AATRIDRTPH WLIKQAIFSY LEQLENSDTL PELPALLSGA 
ANESDEAPTP AEEPHQPFLD FAEQILPQSV SRAAITAAYR RPETEAVSML LEQARLPQPV
AEQAHKLAYQ LADKLRNQKN ASGRAGMVQG LLQEFSLSSQ EGVALMCLAE ALLRIPDKAT
RDALIRDKIS NGNWQSHIGR SPSLFVNAAT WGLLFTGKLV STHNEASLSR SLNRIIGKSG
EPLIRKGVDM AMRLMGEQFV TGETIAEALA NARKLEEKGF RYSYDMLGEA ALTAADAQAY
MVSYQQAIHA IGKASNGRGI YEGPGISIKL SALHPRYSRA QYDRVMEELY PRLKSLTLLA
RQYDIGINID AEEADRLEIS LDLLEKLCFE PELAGWNGIG FVIQAYQKRC PLVIDYLIDL
ATRSRRRLMI RLVKGAYWDS EIKRAQMDGL EGYPVYTRKV YTDVSYLACA KKLLAVPNLI
YPQFATHNAH TLAAIYQLAG QNYYPGQYEF QCLHGMGEPL YEQVTGKVAD GKLNRPCRIY
APVGTHETLL AYLVRRLLEN GANTSFVNRI ADTSLPLDEL VADPVTAVEK LAQQEGQTGL
PHPKIPLPRD LYGHGRDNSA GLDLANEHRL ASLSSALLNS ALQKWQSLPM LEQPVAAGEM
SPVINPAEPK DIVGYVREAT PREVEQALES AVNNAPIWFA TPPAERAAIL HRAAVLMESQ
MQQLIGILVR EAGKTFSNAI AEVREAVDFL HYYAGQVRDD FANETHRPLG PVVCISPWNF
PLAIFTGQIA AALAAGNSVL AKPAEQTPLI AAQGIAILLE AGVPPGVVQL LPGRGETVGA
QLTGDDRVRG VMFTGSTEVA TLLQRNIASR LDAQGRPIPL IAETGGMNAM IVDSSALTEQ
VVIDVLASAF DSAGQRCSAL RVLCLQDEIA DHTLKMLRGA MAECRMGNPG RLTTDIGPVI
DSEAKANIER HIQTMRSKGR QVFQAVRENS EDTREWQSGT FVAPTLIELD DFAELQKEVF
GPVLHVVRYN RNQLPALIEQ INASGYGLTL GVHTRIDETI AQVTGSAHVG NLYVNRNMVG
AVVGVQPFGG EGLSGTGPKA GGPLYLYRLL ANRPESALAV TLARQDAEYP VDAQLKAALT
QPLNALREWA ANRPELQALC TQYGELAQAG TQRLLPGPTG ERNTWTLLPR ERVLCIADDE
QDALTQLAAV LAVGSQVLWP DDTLHHQLVK ALPSAVSERI QLAKAENITA QPFDAVIFHG
DSDQLRALCE AVAARDGAIV SVQGFARGES NILLERLYIE RSLSVNTAAA GGNASLMTIG