Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1801 |
Symbol | |
ID | 6145443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1820406 |
End bp | 1821803 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641616677 |
Product | hypothetical protein |
Protein accession | YP_001743855 |
Protein GI | 170682550 |
COG category | [R] General function prediction only |
COG ID | [COG3106] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.873662 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGAC TTAAAAATGA ACTTAATGCG CTGGTGAATC GGGGTGTCGA CAGACATCTG CGCCTCGCCG TAACCGGACT TAGCCGCAGC GGCAAAACGG CGTTTATCAC TGCGATGGTC AATCAGTTGC TCAATATTCA CGCCGGAGCA AGGTTGCCGC TGTTAAGCGC GGTGCGTGAA GAACGGCTGC TGGGCGTAAA ACGCATTCCC CAGCGTGACT TTGGCATTCC GCGCTTCACC TATGATGAAG GGCTGGCGCA GTTATATGGC GATCCACCCG CCTGGCCAAC GCCAACGCGC GGCGTCAGCG AAATTCGCCT GGCGCTACGT TTTAAATCGA ACGATTCGCT GCTACGCCAC TTTAAGGATA CCTCCACGCT GTATCTGGAG ATTGTGGATT ATCCCGGCGA ATGGTTGCTC GACCTGCCGA TGCTGGCGCA GGACTATTTA AGCTGGTCGC GCCAGATGAC GGGCTTACTT AATGGTCAGC GCGGCGAATG GTCGGCGAAA TGGCGAATGA TGTGCGAAGG GCTGGACCCG CTAGCGCCTG CCGACGAAAA CCGGCTGGCA GACATTGCCG CCGCCTGGAC CGAGTATCTC CACCACTGCA AACAGCAGGG GTTGCACTTT ATTCAGCCTG GGCGCTTTGT CTTGCCGGGA GATATGGCAG GAGCACCCGC GCTGCAATTC TTCCCGTGGC CGGATGTCGA TGCCTGGGGC GAGTCCAAAC TGGCGCAGGC CGATAAGCAT ACCAATGCCG GAATGCTGCG CGAGCGGTTT AATTATTACT GCGAGAAAGT GGTGAAGGGG TTCTATAAGA ATCATTTTCT GCGCTTTGAC CGCCAGATTG TGCTGGTGGA TTGCCTGCAA CCTCTCAACA GTGGGCCACA GGCATTTAAT GATATGCGTC TGGCGCTGAC GCAGCTGATG CAAAGTTTCC ACTACGGGCA GCGAACCCTG TTCCGGCGTT TGTTTTCACC GGTTATCGAT AAGCTATTGT TTGCTGCCAC TAAAGCAGAC CATGTGACCA TCGATCAGCA CGCCAATATG GTTTCATTAC TGCAACAACT GATTCAGGAT GCCTGGCAAA ATGCGGCGTT CGAAGGGATC AGCATGGACT GCCTGGGGCT GGCGTCAGTT CAGGCGACCA CCAGCGGCAT TATTGATGTT AACGGTGAGA AAATCCCGGC GCTGCGCGGT AATCGACTCA GCGATGGCGC ACCGCTCACT GTTTATCCTG GCGAAGTTCC CGCTCGTTTG CCCGGTCAGG CGTTCTGGGA TAAGCAAGGC TTCCAGTTTG AGGCGTTTCG TCCGCAGGTG ATGGATGTTG ACAAACCGCT GCCGCATATT CGCCTTGATG CTGCGCTGGA ATTTTTAATA GGAGATAAAT TGCGATGA
|
Protein sequence | MKRLKNELNA LVNRGVDRHL RLAVTGLSRS GKTAFITAMV NQLLNIHAGA RLPLLSAVRE ERLLGVKRIP QRDFGIPRFT YDEGLAQLYG DPPAWPTPTR GVSEIRLALR FKSNDSLLRH FKDTSTLYLE IVDYPGEWLL DLPMLAQDYL SWSRQMTGLL NGQRGEWSAK WRMMCEGLDP LAPADENRLA DIAAAWTEYL HHCKQQGLHF IQPGRFVLPG DMAGAPALQF FPWPDVDAWG ESKLAQADKH TNAGMLRERF NYYCEKVVKG FYKNHFLRFD RQIVLVDCLQ PLNSGPQAFN DMRLALTQLM QSFHYGQRTL FRRLFSPVID KLLFAATKAD HVTIDQHANM VSLLQQLIQD AWQNAAFEGI SMDCLGLASV QATTSGIIDV NGEKIPALRG NRLSDGAPLT VYPGEVPARL PGQAFWDKQG FQFEAFRPQV MDVDKPLPHI RLDAALEFLI GDKLR
|
| |