Gene EcSMS35_3343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3343 
Symbol 
ID6146458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3418482 
End bp3420143 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content51% 
IMG OID641618172 
ProductSPFH domain-containing protein/band 7 family protein 
Protein accessionYP_001745322 
Protein GI170681594 
COG category[S] Function unknown 
COG ID[COG2268] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.201295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGATA TTGTTAATTC TGTGCCCTCC TGGATGTTTA CCGCGATTAT TGCCGTATGC 
ATTCTGTTTA TTATTGGAAT TATTTTCGCC AGGCTCTATC GTCGCGCTTC GGCAGAGCAA
GCTTTTGTTC GTACTGGTTT AGGTGGGCAA AAAGTGGTAA TGAGCGGTGG CGCAATCGTG
ATGCCGATCT TCCATGAAAT AATCCCCATC AATATGAATA CTCTGAAGCT GGAAGTCAGC
CGCTCAACCA TTGATAGCCT GATTACGAAA GATCGTATGC GCGTCGATGT CGTTGTCGCT
TTCTTTGTGC GGGTAAAACC TTCAGTAGAA GGGATCGCCA CCGCTGCCCA GACGCTGGGG
CAACGCACCC TGTCGCCTGA AGACTTACGT ATGTTGGTTG AAGATAAATT TGTCGATGCC
CTCCGTGCAA CAGCTGCGCA AATGACCATG CATGAGTTAC AGGATACCCG CGAGAACTTC
GTGCAGGGCG TACAAAATAC CGTTGCTGAA GATCTGTCGA AAAACGGCCT GGAACTGGAA
AGCGTTTCAC TTACCAACTT TAACCAGACG TCGAAAGAAC ATTTCAATCC TAACAACGCC
TTTGACGCCG AAGGTTTAAC CAAGCTGACG CAGGAGACGG AGCGCCGTCG TCGCGAACGT
AACGAAGTTG AACAGGATGT AGAAGTTGCG GTGCGTGAGA AAAACCGTGA TGCACTTTCG
CGCAAGTTGG AGATTGAACA ACAAGAAGCG TTTATGACGC TTGAGCAGGA GCAGCAGGTT
AAAACCCGTA CCGCTGAGCA GAATGCGAAA ATTGCGGCTT TTGAAGCTGA ACGTCGTCGT
GAAGCAGAGC AGACGCGAAT TCTGGCTGAA CGACAGATTC AGGAAACAGA AATCGACCGC
GAACAGGCCG TCCGCTCAAG AAAAGTTGAA GCTGAACGTG AAGTTCGCAT TAAAGAGATC
GAACAACAGC AGGTCACCGA AATCGCCAAC CAGACGAAAT CGATCGCTAT TGCCGCCAAA
TCGGAACAGC AGTCACAAGC GGAAGCGCGT GCCAACCTTG CACTTGCAGA AGCGGTAAGC
GCCCAGCAAA ACGTAGAAAC CACTCGCCAG ACCGCAGAAG CCGATCGTGC TAAACAAGTT
GCCCTAATCG CTGCCGCGCA GGATGCAGAA ACCAAAGCGG TTGAACTGAC CGTGCGGGCG
AAAGCAGAGA AAGAAGCCGC AGAGATGCAG GCGGCGGCGA TCGTTGAGTT AGCCGAAGCA
ACACGCAAAA AAGGCCTGGC GGAAGCAGAA GCGCAACGTG CGCTGAACGA CGCTATCAAC
GTACTTTCTG ACGAGCAAAC CAGCCTTAAA TTCAAACTGG CGCTATTACA GTCGTTACCT
GCAGTAATAG AGAAATCCGT TGAGCCGATG AAGTCCATTG ACGGCATTAA GATTATTCAG
GTCGATGGAT TAAACCGAGG TGGTGCTGCG GGGGATGCGG CATCAGGCAG CGTTAGTGGC
GGAAACCTGG CAGAACAGGC ATTGTCTGCC GCCCTTTCTT ACCGCACACA GGCACCGCTG
ATTGACTCCT TGCTCAATGA AATTGGCGTT TCAGGCGGCT CACTGGCTGC ATTGACTTCA
CCCTTAACCT CAACAACTCC CGTCGCCGAA AACGTAGAAT AA
 
Protein sequence
MDDIVNSVPS WMFTAIIAVC ILFIIGIIFA RLYRRASAEQ AFVRTGLGGQ KVVMSGGAIV 
MPIFHEIIPI NMNTLKLEVS RSTIDSLITK DRMRVDVVVA FFVRVKPSVE GIATAAQTLG
QRTLSPEDLR MLVEDKFVDA LRATAAQMTM HELQDTRENF VQGVQNTVAE DLSKNGLELE
SVSLTNFNQT SKEHFNPNNA FDAEGLTKLT QETERRRRER NEVEQDVEVA VREKNRDALS
RKLEIEQQEA FMTLEQEQQV KTRTAEQNAK IAAFEAERRR EAEQTRILAE RQIQETEIDR
EQAVRSRKVE AEREVRIKEI EQQQVTEIAN QTKSIAIAAK SEQQSQAEAR ANLALAEAVS
AQQNVETTRQ TAEADRAKQV ALIAAAQDAE TKAVELTVRA KAEKEAAEMQ AAAIVELAEA
TRKKGLAEAE AQRALNDAIN VLSDEQTSLK FKLALLQSLP AVIEKSVEPM KSIDGIKIIQ
VDGLNRGGAA GDAASGSVSG GNLAEQALSA ALSYRTQAPL IDSLLNEIGV SGGSLAALTS
PLTSTTPVAE NVE