Gene EcSMS35_1672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1672 
Symbol 
ID6147252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1665652 
End bp1667931 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content50% 
IMG OID641616548 
Productputative oxidoreductase 
Protein accessionYP_001743726 
Protein GI170682379 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR01701] oxidoreductase alpha (molybdopterin) subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA AAATTGAATC CTACCAGGGG GCTGCCGGTG GTTGGGGTGC TGTTAAATCC 
GTAGCGAATG CAGTACGTAA GCAGATGGAT ATACGCCAGG ATGTTATTGC CATGTTTGAC
ATGAATAAGC CAGAGGGCTT TGACTGTCCG GGTTGTGCAT GGCCAGATCC TAAGCACAGT
GCGTCATTCG ACATTTGTGA AAACGGCGCA AAAGCAATCG CCTGGGAAGT CACGGATAAG
CAGGTAAATG CCTCTTTCTT TGCTGAGAAT ACGGTTCAAT CATTACTTAC CTGGGGAGAT
CACGAGCTTG AGGCTGCGGG GCGTCTCACC CAGCCTTTGA AATATGATGC CGTCAGCGAC
TGTTACAAGC CATTAAGCTG GCAACAAGCT TTCGACGAAA TTGGCGAACG CCTTCAAAGC
TATAGTGATC CCAATCAGGT TGAATTCTAT ACTTCGGGCC GCACTTCCAA TGAAGCTGCC
TTTCTTTATC AGCTTTTTGC CCGTGAATAC GGGAGCAATA ACTTCCCCGA CTGCTCCAAC
ATGTGCCATG AACCGACCAG TGTGGGTTTG GCAGCGAGTA TCGGTGTAGG TAAAGGGACC
GTGTTGCTGG AAGACTTTGA GAAATGCGAT TTAGTCATTT GCATTGGGCA TAACCCTGGT
ACAAACCACC CTCGCATGTT GACTTCGTTG CGCGCTTTAG TGAAACGGGG AGCGAAAATG
ATCGCCATCA ATCCTCTACA GGAACGTGGC CTGGAGCGAT TTACCGCACC GCAAAACCCG
TTTGAAATGC TGACGAACTC TGAGACTCAG TTGGCCAGTG CCTACTACAA CGTGCGCATT
GGTGGTGATA TGGCGTTGCT CAAGGGAATG ATGCGCCTGT TAATTGAGCG CGATGATGCT
GCAAGCGCCG CAGGTCGGCC CTCATTGCTG GATGATGAAT TTATTCAAAC GCATACCGTC
GGCTTTGACG AGCTACGCCG TGACGTGCTC AATTCCGAGT GGAAAGATAT CGAACGTATA
TCTGGACTAA GTCAGACACA AATCGCCGAA CTTGCTGATG CATATGCCGC TGCCGAACGA
ACCATTATCT GTTACGGAAT GGGGATCACT CAGCACGAAC ATGGTACCCA GAACGTACAG
CAACTGGTCA ATCTGCTGTT GATGAAAGGT AACATTGGCA AGCCTGGTGC GGGTATCTGC
CCACTACGAG GTCACTCTAA TGTACAGGGC GACCGAACCG TCGGTATCAC CGAGAAACCG
TCTGCAGAGT TTCTGGATCG CCTGGGTGAG CGCTATGGCT TCACCCCACC TCATGCACCT
GGACATGCTG CAATTGCCAG CATGCAAGCA ATATGTACGG GGCAAGCTCG AGCATTGATC
TGCATGGGGG GCAACTTTGC GCTGGCAATG CCAGATCGGG AAGCGAGCGC TGTACCGTTA
ACGCAATTAG ATTTGGCGGT ACACGTAGCC ACTAAGCTTA ACCGCTCTCA TCTGCTGACC
GCACGGCATA GCTATATTCT GCCGGTTCTG GGACGTAGCG AGATTGACAT GCAAAAAAGC
GGTGCGCAGG CGGTAACCGT TGAAGATTCA ATGTCGATGA TTCATGCCTC GCGTGGTGTG
TTAAAACCCG CCGGTGTAAT GCTGAAATCA GAGTGTGCAG TGGTCGCGGG AATCGCGCAG
GCAGCACTAC CCCAGAGCGT GGTAGCCTGG GAGTATCTGG TGGAAGATTA TGATCGCATT
CGCAATGACA TTGAAGCTGT GCTGCCAGAG TTTGCCGACT ATAACCAGCG CATCCGTCAT
CCTGGTGGTT TTCACCTGAT AAACGCAGCT GCTGAAAGGC GCTGGATGAC GCCGTCAGGT
AAGGCTAATT TCATTACCAG CAAAGGGCTG TTAGAAGATC CCTCTTCAGC GTTTAACAGT
AAACTGGTCA TGGCGACAGT ACGCAGCCAC GATCAGTACA ACACGACGAT TTATGGTATG
GATGATCGCT ATCGAGGTGT ATTCGGTCAA CGAGATGTGG TCTTTATGAG TGCTAAACAA
GCTAAAATTT GCCGTGTAAA AAACGGCGAA AGAGTTAATC TTATTGCACT CACGCCAGAC
GGTAAACGTA GCTCACGACG CATGGATAGA TTAAAAGTGG TCATTTACCC TATGGCTGAC
CGCTCACTGG TGACCTATTT TCCAGAATCG AATCACATGC TAACACTTGA TAACCACGAT
CCATTAAGTG GCATTCCTGG CTATAAAAGT ATTCCTGTTG AATTAGAGCC ATCAAATTAA
 
Protein sequence
MKKKIESYQG AAGGWGAVKS VANAVRKQMD IRQDVIAMFD MNKPEGFDCP GCAWPDPKHS 
ASFDICENGA KAIAWEVTDK QVNASFFAEN TVQSLLTWGD HELEAAGRLT QPLKYDAVSD
CYKPLSWQQA FDEIGERLQS YSDPNQVEFY TSGRTSNEAA FLYQLFAREY GSNNFPDCSN
MCHEPTSVGL AASIGVGKGT VLLEDFEKCD LVICIGHNPG TNHPRMLTSL RALVKRGAKM
IAINPLQERG LERFTAPQNP FEMLTNSETQ LASAYYNVRI GGDMALLKGM MRLLIERDDA
ASAAGRPSLL DDEFIQTHTV GFDELRRDVL NSEWKDIERI SGLSQTQIAE LADAYAAAER
TIICYGMGIT QHEHGTQNVQ QLVNLLLMKG NIGKPGAGIC PLRGHSNVQG DRTVGITEKP
SAEFLDRLGE RYGFTPPHAP GHAAIASMQA ICTGQARALI CMGGNFALAM PDREASAVPL
TQLDLAVHVA TKLNRSHLLT ARHSYILPVL GRSEIDMQKS GAQAVTVEDS MSMIHASRGV
LKPAGVMLKS ECAVVAGIAQ AALPQSVVAW EYLVEDYDRI RNDIEAVLPE FADYNQRIRH
PGGFHLINAA AERRWMTPSG KANFITSKGL LEDPSSAFNS KLVMATVRSH DQYNTTIYGM
DDRYRGVFGQ RDVVFMSAKQ AKICRVKNGE RVNLIALTPD GKRSSRRMDR LKVVIYPMAD
RSLVTYFPES NHMLTLDNHD PLSGIPGYKS IPVELEPSN