Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3343 |
Symbol | |
ID | 6146458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3418482 |
End bp | 3420143 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618172 |
Product | SPFH domain-containing protein/band 7 family protein |
Protein accession | YP_001745322 |
Protein GI | 170681594 |
COG category | [S] Function unknown |
COG ID | [COG2268] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.201295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGATA TTGTTAATTC TGTGCCCTCC TGGATGTTTA CCGCGATTAT TGCCGTATGC ATTCTGTTTA TTATTGGAAT TATTTTCGCC AGGCTCTATC GTCGCGCTTC GGCAGAGCAA GCTTTTGTTC GTACTGGTTT AGGTGGGCAA AAAGTGGTAA TGAGCGGTGG CGCAATCGTG ATGCCGATCT TCCATGAAAT AATCCCCATC AATATGAATA CTCTGAAGCT GGAAGTCAGC CGCTCAACCA TTGATAGCCT GATTACGAAA GATCGTATGC GCGTCGATGT CGTTGTCGCT TTCTTTGTGC GGGTAAAACC TTCAGTAGAA GGGATCGCCA CCGCTGCCCA GACGCTGGGG CAACGCACCC TGTCGCCTGA AGACTTACGT ATGTTGGTTG AAGATAAATT TGTCGATGCC CTCCGTGCAA CAGCTGCGCA AATGACCATG CATGAGTTAC AGGATACCCG CGAGAACTTC GTGCAGGGCG TACAAAATAC CGTTGCTGAA GATCTGTCGA AAAACGGCCT GGAACTGGAA AGCGTTTCAC TTACCAACTT TAACCAGACG TCGAAAGAAC ATTTCAATCC TAACAACGCC TTTGACGCCG AAGGTTTAAC CAAGCTGACG CAGGAGACGG AGCGCCGTCG TCGCGAACGT AACGAAGTTG AACAGGATGT AGAAGTTGCG GTGCGTGAGA AAAACCGTGA TGCACTTTCG CGCAAGTTGG AGATTGAACA ACAAGAAGCG TTTATGACGC TTGAGCAGGA GCAGCAGGTT AAAACCCGTA CCGCTGAGCA GAATGCGAAA ATTGCGGCTT TTGAAGCTGA ACGTCGTCGT GAAGCAGAGC AGACGCGAAT TCTGGCTGAA CGACAGATTC AGGAAACAGA AATCGACCGC GAACAGGCCG TCCGCTCAAG AAAAGTTGAA GCTGAACGTG AAGTTCGCAT TAAAGAGATC GAACAACAGC AGGTCACCGA AATCGCCAAC CAGACGAAAT CGATCGCTAT TGCCGCCAAA TCGGAACAGC AGTCACAAGC GGAAGCGCGT GCCAACCTTG CACTTGCAGA AGCGGTAAGC GCCCAGCAAA ACGTAGAAAC CACTCGCCAG ACCGCAGAAG CCGATCGTGC TAAACAAGTT GCCCTAATCG CTGCCGCGCA GGATGCAGAA ACCAAAGCGG TTGAACTGAC CGTGCGGGCG AAAGCAGAGA AAGAAGCCGC AGAGATGCAG GCGGCGGCGA TCGTTGAGTT AGCCGAAGCA ACACGCAAAA AAGGCCTGGC GGAAGCAGAA GCGCAACGTG CGCTGAACGA CGCTATCAAC GTACTTTCTG ACGAGCAAAC CAGCCTTAAA TTCAAACTGG CGCTATTACA GTCGTTACCT GCAGTAATAG AGAAATCCGT TGAGCCGATG AAGTCCATTG ACGGCATTAA GATTATTCAG GTCGATGGAT TAAACCGAGG TGGTGCTGCG GGGGATGCGG CATCAGGCAG CGTTAGTGGC GGAAACCTGG CAGAACAGGC ATTGTCTGCC GCCCTTTCTT ACCGCACACA GGCACCGCTG ATTGACTCCT TGCTCAATGA AATTGGCGTT TCAGGCGGCT CACTGGCTGC ATTGACTTCA CCCTTAACCT CAACAACTCC CGTCGCCGAA AACGTAGAAT AA
|
Protein sequence | MDDIVNSVPS WMFTAIIAVC ILFIIGIIFA RLYRRASAEQ AFVRTGLGGQ KVVMSGGAIV MPIFHEIIPI NMNTLKLEVS RSTIDSLITK DRMRVDVVVA FFVRVKPSVE GIATAAQTLG QRTLSPEDLR MLVEDKFVDA LRATAAQMTM HELQDTRENF VQGVQNTVAE DLSKNGLELE SVSLTNFNQT SKEHFNPNNA FDAEGLTKLT QETERRRRER NEVEQDVEVA VREKNRDALS RKLEIEQQEA FMTLEQEQQV KTRTAEQNAK IAAFEAERRR EAEQTRILAE RQIQETEIDR EQAVRSRKVE AEREVRIKEI EQQQVTEIAN QTKSIAIAAK SEQQSQAEAR ANLALAEAVS AQQNVETTRQ TAEADRAKQV ALIAAAQDAE TKAVELTVRA KAEKEAAEMQ AAAIVELAEA TRKKGLAEAE AQRALNDAIN VLSDEQTSLK FKLALLQSLP AVIEKSVEPM KSIDGIKIIQ VDGLNRGGAA GDAASGSVSG GNLAEQALSA ALSYRTQAPL IDSLLNEIGV SGGSLAALTS PLTSTTPVAE NVE
|
| |