Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1407 |
Symbol | |
ID | 6144521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1390072 |
End bp | 1391355 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641616285 |
Product | hypothetical protein |
Protein accession | YP_001743465 |
Protein GI | 170679771 |
COG category | [S] Function unknown |
COG ID | [COG2718] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.955409 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTGGT TTATTGACCG GCGTCTGAAC GGCAAAAACA AAAGCATGGT GAATCGCCAG CGTTTTTTAC GCCGTTATAA AGCGCAAATT AAACAGTCGA TCTCCGAGGC CATTAATAAG CGTTCGGTGA CTGACGTCGA CAGCGGCGAG TCCGTATCCA TTCCCACGGA AGATATTAGC GAACCGATGT TTCATCAGGG GCGTGGCGGT CTGCGCCACC GCGTGCATCC GGGCAATGAC CATTTCGTCC AGAACGACCG AATTGAACGT CCCCAGGGTG GCGGCGGAGG TTCCGGCAGT GGTCAGGGCC AGGCCAGCCA GGATGGTGAA GGTCAGGATG AATTTGTCTT TCAGATTTCG AAAGATGAGT ATCTTGATCT GCTCTTTGAA GATCTGGCCT TACCGAATCT GAAACAAAAC CAACAACGCC AGCTGACCGA ATATAAAACG CATCGTGCGG GTTATACCGC TAACGGCGTT CCGGCCAATA TCAGCGTTGT GCGTTCATTG CAGAACTCAC TGGCACGACG CACAGCCATG ACGGCGGGCA AGCGGCGGGA ACTTCATGCG CTGGAAGAGC ATTTGGCCAT CATCAGCAAC AGTGAACCTG CGCAACTGCT GGAAGAGGAA CGTCTGCGCA AAGAAATTGC AGAATTACGT GCCAAAATTG AACGCGTCCC TTTCATTGAC ACCTTCGATT TACGTTACAA GAACTACGAA AAACGGCCCG ATCCCTCCAG CCAGGCAGTG ATGTTTTGCC TGATGGACGT TTCGGGTTCA ATGGATCAGT CCACTAAAGA TATGGCTAAG CGTTTTTATA TTCTGCTGTA TCTGTTCCTT AGCAGAACGT ATAAGAACGT GGAAGTCGTT TACATCCGCC ACCATACTCA GGCGAAAGAA GTCGATGAAC ATGAGTTTTT CTACTCGCAG GAAACTGGCG GCACCATTGT TTCCAGCGCC CTGAAACTGA TGGATGAGGT AGTGAAAGAG CGTTATAACC CGGCACAGTG GAATATTTAC GCAGCCCAAG CATCAGACGG CGATAACTGG GCCGATGACT CTCCGCTTTG CCATGAAATT CTGGCGAAAA AAATATTACC CGTCGTTCGT TATTACAGCT ATATCGAAAT TACACGTCGC GCGCATCAGA CACTGTGGCG AGAATATGAG CATCTGCAAT CTACTTTCGA CAACTTTGCG ATGCAGCATA TCCGCGACCA GGATGATATT TATCCGGTGT TCCGTGAACT GTTTCATAAA CAAAATGCAA CAGCTAAAGA CTAA
|
Protein sequence | MTWFIDRRLN GKNKSMVNRQ RFLRRYKAQI KQSISEAINK RSVTDVDSGE SVSIPTEDIS EPMFHQGRGG LRHRVHPGND HFVQNDRIER PQGGGGGSGS GQGQASQDGE GQDEFVFQIS KDEYLDLLFE DLALPNLKQN QQRQLTEYKT HRAGYTANGV PANISVVRSL QNSLARRTAM TAGKRRELHA LEEHLAIISN SEPAQLLEEE RLRKEIAELR AKIERVPFID TFDLRYKNYE KRPDPSSQAV MFCLMDVSGS MDQSTKDMAK RFYILLYLFL SRTYKNVEVV YIRHHTQAKE VDEHEFFYSQ ETGGTIVSSA LKLMDEVVKE RYNPAQWNIY AAQASDGDNW ADDSPLCHEI LAKKILPVVR YYSYIEITRR AHQTLWREYE HLQSTFDNFA MQHIRDQDDI YPVFRELFHK QNATAKD
|
| |