Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2621 |
Symbol | |
ID | 6143209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2678124 |
End bp | 2680139 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617492 |
Product | hypothetical protein |
Protein accession | YP_001744657 |
Protein GI | 170683380 |
COG category | [R] General function prediction only |
COG ID | [COG1444] Predicted P-loop ATPase fused to an acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAAC TGACTGCGCT TCACACATTA ACAGCGCAAA TGAAACGTGA AGGGATCCGC CGCTTGCTGG TGTTGAGCGG GGAAGAGGGT TGGTGTTTTG ATCATGCGCT TAAGTTGCGT GATGCCTTAC CTGGCGACTG GCTGTGGATT TCGCCGCAGC CAGATGCTGA AAACCACTGT TCTCCCTCGG CACTACAAAC TTTACTTGGG CGCGAGTTCC GGCATGCGGT ATTCGACGCC CGCCACGGCT TTGATGCCGC TGCCTTTGCG GCACTTAGCG GAACGTTGAA AGCGGGAAGC TGGCTAGTGT TGTTACTCCC TGTATGGGAT GAGTGGGAAA ACCAACCTGA TGCCGACTCG CTGCGCTGGA GTGATTGCCC TGACCCTATT GCGACGCCGC ATTTTGTCCA GCATTTCAAA CGCGTACTTA CGGCGAATAA CGACGCTATC CTCTGGCGGC AAAACCAGCC GTTCTCGTTG GCGCATTTTA CTCCCCGTAC TGACTGGCAC CCCGCGACCG GCGCACCACA GCCAGAACAA CAGCAACTCT TACAGCAGCT ACTGACCATG CCATTGGGCG TGGCGGTGGT AACGGCTGCG CGTGGGCGCG GTAAATCGGC GCTGGCAGGG CAACTCATTT CTCGTATTGC GGGTAGTGCG ATTGTCACCG CGCCCGCAAA AGCGGCAACG TATGTACTGG CACAATTTGC GGGCGAGAAG TTTCGCTTTA TTGCACCGGA TGCCTTGTTA GCCAGCGATG AGCAAGCCGA CTGGCTGGTG GTCGATGAAG CCGCAGCCAT ACCTGCGCCG TTGTTGCATC AACTGGTATC GCGTTTTCCT CGAACGTTGT TAACCACTAC GGTGCAGGGC TACGAAGGCA CCGGACGTGG TTTTTTGCTG AAATTTTGCG CTCGCTTTCC GCATTTACAC CGTTTTGAAC TGCAACAACC GATCCGCTGG GCGCAGGGAT GCCCGCTGGA GAAAATGGTC AGCGAGGCAC TGGTGTTTGA CGATGAAAAC TTCACTCACG AACCACAAGG TGACATCGTC ATTTCTGCTT TTGAACAGAC GTTATGGCGA AGCGAACCAG AAACGCCGTT AAAGGTATAT CAGCTATTGT CTGGCGCGCA CTACCGGACC TCGCCGCTGG ATTTACGCCG GATGATGGAT GCACCAGGGC AACATTTTTT ACAGGCGGCT GGCGAAAACG AGATTGCCGG AGCGCTGTGG CTGGTGGATG AGGGGGGATT ATCTCAACAA CTCAGTCAGG CGGTATGGGC AGGTTTTCGT CGCCCTCGGG GTAATCTGGT GGCCCAGTCG CTGGCGGCGC ACGGCAGCAA TCCACTGGCA GCGACATTGC GTGGACGGCG GGTAAGCCGG ATAGCAGTTC ATCCGGCGCG TCAGCGGGAA GGCACAGGGC GGCAACTTAT TGCTGGTGCT TTGCAATATA CGCATGACCT CGACTATCTT TCGGTGAGTT TTGGTTACAC CGGGGAGTTA TGGCGTTTCT GGCAACGCTG CGGTTTTGTG CTGGTGCGAA TGGGTAATCA TCGTGAAGCC AGCAGCGGTT GCTATACGGC GATGGCACTG TTACCGATGA GTGATGCGGG TAAACAGCTG GCTGAACGTG AGCATTACCG TTTACGTCGC GATGCGCAAG CTCTCGCGCA GTGGAATGGC GAAATGCTTC CCGTTGATCC ACTAAACGAT GCCGTCCTTT CTGACGACGA CTGGCTTGAA CTGGCCGGTT TTGCTTTCGC TCATCGTCCG CTATTAACGT CGTTAGGTTG CTTAATGCGT CTGTTACAAA CCAGCGAAAT GGCATTACCG GCGCTGCGTG GGCGTTTACA GAAAAACGCC AGTGACGCGC AGTTATGTAC CACACTTAAA CTTTCAGGCC GTAAGCTGTT ACTGGTCCGT CAGCGGGAAG AGGCCGCGCA GGCGCTATAC GCACTTGATG ATGTTCGCAC TGAGCGTTTG CGCGATCGCA TAACGCAATG GCAATTTTTT CACTGA
|
Protein sequence | MAELTALHTL TAQMKREGIR RLLVLSGEEG WCFDHALKLR DALPGDWLWI SPQPDAENHC SPSALQTLLG REFRHAVFDA RHGFDAAAFA ALSGTLKAGS WLVLLLPVWD EWENQPDADS LRWSDCPDPI ATPHFVQHFK RVLTANNDAI LWRQNQPFSL AHFTPRTDWH PATGAPQPEQ QQLLQQLLTM PLGVAVVTAA RGRGKSALAG QLISRIAGSA IVTAPAKAAT YVLAQFAGEK FRFIAPDALL ASDEQADWLV VDEAAAIPAP LLHQLVSRFP RTLLTTTVQG YEGTGRGFLL KFCARFPHLH RFELQQPIRW AQGCPLEKMV SEALVFDDEN FTHEPQGDIV ISAFEQTLWR SEPETPLKVY QLLSGAHYRT SPLDLRRMMD APGQHFLQAA GENEIAGALW LVDEGGLSQQ LSQAVWAGFR RPRGNLVAQS LAAHGSNPLA ATLRGRRVSR IAVHPARQRE GTGRQLIAGA LQYTHDLDYL SVSFGYTGEL WRFWQRCGFV LVRMGNHREA SSGCYTAMAL LPMSDAGKQL AEREHYRLRR DAQALAQWNG EMLPVDPLND AVLSDDDWLE LAGFAFAHRP LLTSLGCLMR LLQTSEMALP ALRGRLQKNA SDAQLCTTLK LSGRKLLLVR QREEAAQALY ALDDVRTERL RDRITQWQFF H
|
| |