Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0142 |
Symbol | |
ID | 6143098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 157021 |
End bp | 157941 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641615043 |
Product | ISNCY family transposase |
Protein accession | YP_001742259 |
Protein GI | 170682409 |
COG category | [S] Function unknown |
COG ID | [COG5464] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01784] conserved hypothetical protein (putative transposase or invertase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGCAC CGAGTACCAC ACCGCATGAT GCGGTGTTTA AACAATTTTT AATGCATGCG GAAACGGCTC GTGACTTTCT GGATATCCAT TTGCCAGCGG AACTACGCGA ACTGTGTGAC CTCGACACGC TGCATCTTGA GTCGGGGAGT TTTATTGAAG AAAGCCTGAA AGGGCACAGC ACTGACGTGC TCTATTCCGT GCAAATGCAG GGTAATACGG GCTATCTACA TGTTGTAATT GAACACCAAA GCAAGCCGGA CAAAAAAATG GCCTTTCGCA TGATGCGTTA TTCTATTGCT GCCATGCACC GGCATCTGGA GGCAGATCAC GATAAGCTGC CGCTGGTGGT GCCGATTTTG TTTTATCAGG GCGAGGCCAC GCCTTATCCA CTCTCAATGT GCTGGTTTGA TATGTTTTAC TCGCCGGAGC TGGCGCGACG CGTCTATAAC AGTCCTTTCC CGCTGGTGGA TATCACTATC ACACCGGATG ACGAAATCAT GCAACATCGG CGGATTGCGA TTCTCGAACT GCTGCAAAAA CATATTCGCC AACGCGACTT AATGTTATTG CTGGAGCAAC TGGTCACGCT GATAGACGAA GGGTACACTA GCGGAAGTCA GTTAGTTGCC ATGCAAAACT ATATGCTGCA ACGCGGTCAT ACTGAACAAG CGGATTTGTT TTATGGTGTG CTGAGAGACA GGGAAACGGG AGGGGAGTCT ATGATGACGC TGGCGCAGTG GTTTGAAGAG AAGGGAAGAC AGGAGGAAAG GCAGGAGGTA AGACAGGAGG TAATACAAGA GGTTAGACAG GAAGTAAGAC AGGAATTCGC CCTGCGTTTT CTGAGTAAAG GGATGTCTCG GGAAGACGTT GCAGAGATGG CAAATTTACC TCTTGCTGAG GTTGATAAGC TGATTAGCTA A
|
Protein sequence | MDAPSTTPHD AVFKQFLMHA ETARDFLDIH LPAELRELCD LDTLHLESGS FIEESLKGHS TDVLYSVQMQ GNTGYLHVVI EHQSKPDKKM AFRMMRYSIA AMHRHLEADH DKLPLVVPIL FYQGEATPYP LSMCWFDMFY SPELARRVYN SPFPLVDITI TPDDEIMQHR RIAILELLQK HIRQRDLMLL LEQLVTLIDE GYTSGSQLVA MQNYMLQRGH TEQADLFYGV LRDRETGGES MMTLAQWFEE KGRQEERQEV RQEVIQEVRQ EVRQEFALRF LSKGMSREDV AEMANLPLAE VDKLIS
|
| |