Gene EcSMS35_B0007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_B0007 
Symbolcea 
ID6106605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010485 
Strand
Start bp4330 
End bp5895 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content43% 
IMG OID641614739 
Productcolicin-E1 protein 
Protein accessionYP_001739880 
Protein GI170650746 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG1196] Chromosome segregation ATPases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.542919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACAG CTGTAGCGTA CTATAAAGAT GGTGTTCCTT ATGATGATAA GGGGCAGGTA 
ATCATTACTC TTTTGAATGG TAATCCAGAC GGGAGTGGCT CTGGCGGCGG TGGTGGAACT
GGAGGTAGCA AAAGTGAAAG TTCTGCAGCC ATTCATGCCA CAGCTAAATG GTCTACTGCT
CAATTGAAGA AAACGCAGGC AGAACAGGCT GCCCGGGCAA AAGCTGCCGC AGAAGCACAG
GCTAAAGCAA AAGCAAACCG GGATGCGCTG ACTCAACATC TGAAGGATAT TGTGAATGAG
GCGCTTCGTC ATAATTCCAC TCATCCGGAG GTTATTGACC TGGCTCATGC CAATAATGCA
GCGATGCAGG CAGAAGCAGA GCGGTTGCGC CTTGCAAAAG CAGAAGAAAA AGCCCGTAAA
GAAGCGGAAG CTGCGGAAAA GGCTTTTCAG GAGGCAGAAC AACGACGTAA AGAGATAGAG
AAGGAGCAGG CTGAAACAGA ACGCCAGTTG AAACTGGCTG AAGCTGAAGA GAAACGGCTG
GCAGCATTGA ATGAAGAGGC CCGGGCAGTG GAGGTGGCAC AAAAAAATCT TGCTGCTGCA
CAATCTGAGC TGGCGAAAGT GGATGAAGAG ATTAAGACTC TCAATACCCG TTTAAGCTCC
AGTATTCATG CCCGTGATGC AGAAATGAAT ACGCTGTCCG GAAAACGAAA TGAGCTGGCT
CAGGCATCTG CTAAATATAA GGAACTTGAT GAACTGGTCA AAAAATTGTC ACCAAGAGCT
AATGATCCGC TTCAGAGTCG TCCTTTTTTC TATGCCACCA GTCGACGGAC GGGAGCCGGT
AAGATTATGG AGGAAAAACA AAAACAGGTA ACAGCATCAG AAACGCGTAT TAACCAACTT
AATGCTGAGA TAAATGGAAT TCAGGGGGCT ATGTCTCAGG CTAATAATAA TCGTAATACA
GCTGTTTTAC GTGTTCATGA ATCTGAAGAA AATTTGAAAA CAGCGCAGAC TAATCTCCTG
AACTCGCAGA TTAAGGATGC TGTGGATGCA ACAGTTAGCT TTTATCAAAC GCTATCTGAA
AAATATGGTG AAAAATATTC AAAAATGGCA CAGGAACTTG CTGATAAGTC TAAAGGTAAG
AAAATCAGCA ATGTGAATGA AGCTCTAGCT GCTTTTGAAA AATACAAGGA TGTTTTAAAT
AAGAAATTCA GCAAAGCAGA CCGTGATGCG ATTTTCAATG CACTGGAATC GGTTAAGTAT
GAAGACTGGG CTAAGCATTT AGATCAGTTT GCTAAGTACT TGAAGATTAC GGGACATGTT
TCTTTTGGAT ATGATGTGGT ATCTGATATC CTAAAAATTA AGGATACAGG TGACTGGAAG
CCACTATTTC TTACATTAGA GAAGAAAACT GTAGATGCTG GAGTTAGTTA TATTGTTGTT
TTACTTTTTA GTGTGCTTGC TGGAACTACA TTAGGTATCT GGGGGATTGC TATTGTTACA
GGCATTCTAT GTGCCTTTAT TGACAAGAAT AAACTTAATA CTATAAATGA TGTGTTGGGT
ATTTAA
 
Protein sequence
METAVAYYKD GVPYDDKGQV IITLLNGNPD GSGSGGGGGT GGSKSESSAA IHATAKWSTA 
QLKKTQAEQA ARAKAAAEAQ AKAKANRDAL TQHLKDIVNE ALRHNSTHPE VIDLAHANNA
AMQAEAERLR LAKAEEKARK EAEAAEKAFQ EAEQRRKEIE KEQAETERQL KLAEAEEKRL
AALNEEARAV EVAQKNLAAA QSELAKVDEE IKTLNTRLSS SIHARDAEMN TLSGKRNELA
QASAKYKELD ELVKKLSPRA NDPLQSRPFF YATSRRTGAG KIMEEKQKQV TASETRINQL
NAEINGIQGA MSQANNNRNT AVLRVHESEE NLKTAQTNLL NSQIKDAVDA TVSFYQTLSE
KYGEKYSKMA QELADKSKGK KISNVNEALA AFEKYKDVLN KKFSKADRDA IFNALESVKY
EDWAKHLDQF AKYLKITGHV SFGYDVVSDI LKIKDTGDWK PLFLTLEKKT VDAGVSYIVV
LLFSVLAGTT LGIWGIAIVT GILCAFIDKN KLNTINDVLG I