Gene EcSMS35_1961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1961 
Symbol 
ID6145482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1984471 
End bp1986003 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content50% 
IMG OID641616837 
ProductSpoVR family protein 
Protein accessionYP_001744013 
Protein GI170681824 
COG category[S] Function unknown 
COG ID[COG2719] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00693869 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGA TCGATTCTAT GAATAAGGAC ACCACACGTT TGAGCGATGG ACCCGACTGG 
ACGTTCGACC TGCTGGATGT TTATCTGGCA GAGATAGACC GGGTGGCGAA ACTCTACCGG
CTGGATACCT ACCCGCACCA GATTGAAGTG ATAACCTCAG AACAGATGAT GGATGCCTAC
TCCAGCGTCG GCATGCCAAT TAACTATCCG CACTGGTCAT TCGGTAAAAA GTTTATCGAG
ACTGAACGGC TGTATAAGCA CGGTCAGCAA GGACTGGCCT ATGAAATCGT CATTAACTCT
AACCCGTGTA TCGCTTACCT GATGGAAGAA AACACCATTA CCATGCAAGC GCTGGTGATG
GCACACGCCT GCTATGGGCA TAACTCTTTC TTCAAAAACA ATTACTTATT CCGTAGCTGG
ACCGACGCCA GTTCGATTGT CGATTATCTG ATTTTTGCCC GTAAATATAT TACCGAGTGC
GAAGAGCGCT ATGGTGTGGA TGAAGTAGAA CGGCTGCTGG ACTCGTGCCA TGCGCTGATG
AACTACGGCG TGGACCGGTA TAAACGCCCA CAAAAAATCT CGCTGCAAGA GGAGAAAGCC
CGGCAGAAAA GCCGCGAAGA GTACTTGCAA AGTCAGGTCA ATATGCTCTG GCGTACCCTG
CCGAAGCGCG AGGAAGAGAA AACGGTTGCT GAAGCGCGCC GCTATCCGTC CGAGCCACAA
GAAAACCTGC TCTATTTTAT GGAGAAAAAT GCGCCACTGC TGGAATCATG GCAGCGTGAA
ATTCTGCGTA TTGTGCGTAA GGTGAGCCAG TATTTTTATC CGCAAAAACA GACTCAGGTG
ATGAACGAAG GCTGGGCGAC CTTCTGGCAC TACACCATCC TTAACCATCT GTATGATGAA
GGGAAAGTAA CGGAACGTTT TATGCTGGAG TTTTTGCACA GCCACACCAA TGTGGTCTTC
CAGCCGCCCT ATAACAGCCC GTGGTACAGC GGCATCAACC CGTATGCCCT CGGGTTCGCC
ATGTTCCAGG ATATTAAACG GATTTGTCAG TCGCCAACGG AAGAAGACAA ATACTGGTTC
CCGGATATCG CCGGTTCTGA CTGGCTGGAC ACATTACATT TTGCGATGCG TGATTTCAAA
GATGAGAGCT TTATCAGTCA GTTCCTGTCA CCGAAAGTGA TGCGTGATTT CCGCTTCTTC
ACCGTGCTGG ACGACGATCG GCATAATTAT CTGGAGATTT CCGCTATTCA TAATGAAGAA
GGTTATCGGG AGATCCGTAA CCGGTTATCG TCGCAATATA ATTTAAGTAA TCTGGAGCCG
AATATTCAGA TCTGGAACGT GGATTTGCGC GGCGACCGTT CGCTGACGCT GCGTTACATT
CCACATAATC GCGCACCGCT GGATCGGGGG CGCAAAGAAG TACTGAAGCA TGTGCATCGC
CTGTGGGGAT TTGATGTGAT GCTGGAACAG CAAAACGAAG ACGGCAGCGT CGAGTTGCTG
GAACGTTGCC CGCCGAGAAT GGGAAATCTG TAA
 
Protein sequence
MATIDSMNKD TTRLSDGPDW TFDLLDVYLA EIDRVAKLYR LDTYPHQIEV ITSEQMMDAY 
SSVGMPINYP HWSFGKKFIE TERLYKHGQQ GLAYEIVINS NPCIAYLMEE NTITMQALVM
AHACYGHNSF FKNNYLFRSW TDASSIVDYL IFARKYITEC EERYGVDEVE RLLDSCHALM
NYGVDRYKRP QKISLQEEKA RQKSREEYLQ SQVNMLWRTL PKREEEKTVA EARRYPSEPQ
ENLLYFMEKN APLLESWQRE ILRIVRKVSQ YFYPQKQTQV MNEGWATFWH YTILNHLYDE
GKVTERFMLE FLHSHTNVVF QPPYNSPWYS GINPYALGFA MFQDIKRICQ SPTEEDKYWF
PDIAGSDWLD TLHFAMRDFK DESFISQFLS PKVMRDFRFF TVLDDDRHNY LEISAIHNEE
GYREIRNRLS SQYNLSNLEP NIQIWNVDLR GDRSLTLRYI PHNRAPLDRG RKEVLKHVHR
LWGFDVMLEQ QNEDGSVELL ERCPPRMGNL