Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4791 |
Symbol | |
ID | 6144051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4882937 |
End bp | 4883827 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619598 |
Product | IS1203 transposase orfB |
Protein accession | YP_001746705 |
Protein GI | 170679822 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2801] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.2388 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCCAC TGCTGGATAA GCTGCGTGAG CAGTACGGGG TCGGACCGCT ATGCAGCGAA CTGCATATTG CCCCGTCAAC GTATTACCAC TGTCAGCAAC AGCGACATCA TCCGGATAAA CGCAGTGCCC GTGCGCAGCG CGATGACTGG CTGAAGAAAG AGATACAGCG CGTATACGAT GAAAATCATC AGGTGTACGG TGTGCGTAAA GTCTGGCGTC AGTTGTTACG GGAAGGTATC AGAGTGGCCA GATGCACTGT GGCACGTCTC ATGGCGGTTA TGGGACTTGC CGGTGTTCTC CGGGGTAAAA AGGTCCGTAC GACCATCAGC CGGAAAGCCG TTGCCGCAGG CGACCGCGTA AACCGTCAGT TCGTGGCAGA ACGACCTGAC CAGCTGTGGG TGGCTGATTT TACTTACGTC AGCACATGGC GGGGCTTCGT CTATGTGGCG TTCATCATTG ATGTGTTTGC CGGATACATC GTGGGGTGGC GGGTCTCATC GTCTATGGAA ACGACATTCG TGCTGGATGC TCTGGAGCAG GCGTTATGGG CCCGTCAACC GTCCGGCACA GTCCATCACA GTGATAAAGG TTCTCAGTAT GTATCGCTGG CCTACACACA GCGGCTTAAG GAAGCCGGAT TACTGGCATC AACAGGAAGT ACTGGTGACT CGTATGACAA CGCGATGGCG GAGAGCATCA ATGGCCTTTA CAAAGCGGAG GTAATACACC GTAAGAGCTG GAAAAACCGG ACAGAAGTGG AGCTGGCCAC ACTCACGTGG GTGGACTGGT ATAACAATCG ACGATTGCTG GAAAGGCTGG GCCATATCCC ACCGGCAGAA GCAGAAAAAG CTTATTATGC TTCCATCGGA AACGATGATC TGGCAGCCTG A
|
Protein sequence | MMPLLDKLRE QYGVGPLCSE LHIAPSTYYH CQQQRHHPDK RSARAQRDDW LKKEIQRVYD ENHQVYGVRK VWRQLLREGI RVARCTVARL MAVMGLAGVL RGKKVRTTIS RKAVAAGDRV NRQFVAERPD QLWVADFTYV STWRGFVYVA FIIDVFAGYI VGWRVSSSME TTFVLDALEQ ALWARQPSGT VHHSDKGSQY VSLAYTQRLK EAGLLASTGS TGDSYDNAMA ESINGLYKAE VIHRKSWKNR TEVELATLTW VDWYNNRRLL ERLGHIPPAE AEKAYYASIG NDDLAA
|
| |