Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_A0102 |
Symbol | |
ID | 6106472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010488 |
Strand | + |
Start bp | 77170 |
End bp | 78057 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641614847 |
Product | IS629 transposase orfB |
Protein accession | YP_001739988 |
Protein GI | 170650767 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2801] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.0471323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.000157883 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCACTGC TGGATAAGCT GCGTGAGCAG TACGGGGTCG GACCGCTATG CAGCGAACTG CATATTGCCC CGTCAACGTA TTACCACTGT CAGCAACAGC GACATCATCC GGATAAACGC AGTGCCCGTG CGCAGCGCGA TGACTGGCTG AAGAAAGAGA TACAGCGCGT ATACGATGAA AATCACAAGG TATACGGTGT GCGTAAAGTC TGGCGTCAGT TGTTACGGGA AGGTATCAGA GTGGCCAGAT GCACTGTGGC ACGTCTCATG GCGGTTATGG GACTTGCCGG TGTTCTCCGT GGTAAAAAGG TCCGTACGAC CATCAGCCGG AAAGCCGTTG TCGCAGGCGA CCGCGTAAAC CGTCAGTTCG TGGCAGAACG TCCTGACCAG TTGTGGGTGG CTGATTCTAC TTACGTCAGC ACATGGCAGG GGGTCGTCTA TGTGGCGTTC ATCATTGATG TGTTTGCCGG ATACATCGTG GGGTGGCGGG TCTCATCGTC TATGGAAACG ACATTTGTGC TGGATGCTCT GGAGCAGGCG TTATGGGCCC GTCGACCGTC CGGCACAGTC CATCACAGTG ATAAAGGTTC TCAGTATGTA TCGCTGGCCT ACACACAGCG GCTTAAGGAA GCCGGATTAC TGGCATCAAC AGGAAGTACA GGCGACTCGT ATGACAACGC GATGGCGGAG AGCATCAATG GCCTTTACAA AGCGGAGGTA ATACACCGTA AGAGCTGGAA AAACCGTGCA GAAGTGGAAC TGGCCACACT CACGTGGGTG GACTGGTATA ACAATCGACG ATTGCTGGAA AGGCTGGGCC ATACTCCTCC GGCAGAAGCA GAAAAAGCTT ATTATGCTTC CATCGGAAAC GATGATCTGG CAGCCTGA
|
Protein sequence | MPLLDKLREQ YGVGPLCSEL HIAPSTYYHC QQQRHHPDKR SARAQRDDWL KKEIQRVYDE NHKVYGVRKV WRQLLREGIR VARCTVARLM AVMGLAGVLR GKKVRTTISR KAVVAGDRVN RQFVAERPDQ LWVADSTYVS TWQGVVYVAF IIDVFAGYIV GWRVSSSMET TFVLDALEQA LWARRPSGTV HHSDKGSQYV SLAYTQRLKE AGLLASTGST GDSYDNAMAE SINGLYKAEV IHRKSWKNRA EVELATLTWV DWYNNRRLLE RLGHTPPAEA EKAYYASIGN DDLAA
|
| |