Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1888 |
Symbol | |
ID | 6145253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1908150 |
End bp | 1909055 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641616764 |
Product | IS2 transposase orfB |
Protein accession | YP_001743942 |
Protein GI | 170680938 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2801] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00494374 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGATAGCG CACGCGCCCT TATTGCCCGG GGATGGGGAG TAAGCTTAGT CAGCCGTTGT CTCCGGGTGT CGCGTGCGCA GTTGCACGTA ATTCTCAGAC GAACCGATGA CTGGAAAGAT GGTCGCCGCA GCCGTCACTC AGATGATACG GATGTGCTTC TCCGTATACA CCATGTTATC GGAGAGCTGC CCACGTATGG TTATCGTCGG GTATGGGCGC TGCTTCGCAG ACAGGCCGAA CTTGATGGTA TGCCTGCGAT CAATGCCAAA CGTGTTTACC GGATCATGCG CCAGAATGCG CTGTTGCTTG AGCGAAAAAC CGCTGTACCG CCATCGAAAC GGGCACATAC TGGCAAAGTG GCCGTGAAAG AAAGCAATCA ACGATGGTGC TCTGACGGGT TCGAGTTCCG CTGTGATAAC GGAGAAAAAC TGCGAGTCAC GTTCGCGCTG GACTGCTGTG ACCGTGAGGC ACTGCACTGG GCAGTCACTA CGGGCGGCTT CGACAGTGAA ACAGTACAGG ACGTCATGCT GGGAGCGGTG GAACGCCGCT TCGGCAACGA GCTTCCGGCG TCTCCAGTAG AGTGGCTGAC GGATAATGGT TCATGCTACC GGGCTAATGA AACACGGCAG TTTGCCCGGA TGTTGGGGCT TGAACCGAAG AACACGGCGG TGCGGAGTCC GGAGAGTAAC GGCATAGCAG AGAGCTTCGT GAAAACGATA AAGCGTGACT ACATCAGTAT CATGCCTAAA CCAGACGGGT TAACGGCAGC AAAGAACCTT GCAGAGGCGT TCGAGCATTA TAACGAATGG CATCCGCATA GTGCACTGGG TTATCGCTCG CCACGGGAAT ATCTGCGGCA GCAGGCCAGT AATGGGTTAA GTGATAACAG GTGTCTGGAA ATATAG
|
Protein sequence | MDSARALIAR GWGVSLVSRC LRVSRAQLHV ILRRTDDWKD GRRSRHSDDT DVLLRIHHVI GELPTYGYRR VWALLRRQAE LDGMPAINAK RVYRIMRQNA LLLERKTAVP PSKRAHTGKV AVKESNQRWC SDGFEFRCDN GEKLRVTFAL DCCDREALHW AVTTGGFDSE TVQDVMLGAV ERRFGNELPA SPVEWLTDNG SCYRANETRQ FARMLGLEPK NTAVRSPESN GIAESFVKTI KRDYISIMPK PDGLTAAKNL AEAFEHYNEW HPHSALGYRS PREYLRQQAS NGLSDNRCLE I
|
| |