Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3126 |
Symbol | |
ID | 6145603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3211704 |
End bp | 3213278 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641617990 |
Product | IS66 family transposase orfB |
Protein accession | YP_001745140 |
Protein GI | 170683550 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACCT CACTTGCTCA TGAGAACGCC CGCCTGCGGG CACTGTTGCA GACGCAACAG GACACCATCC GCCAGATGGC TGAATACAAC CGCCTGCTCT CACAGAGGGT GGCGGCTTAT GCTTCTGAAA TCAACCGGCT GAAGGCGCTG GTTGCGAAAC TGCAACGTAT GCAGTTCGGT AAAAGCTCAG AAAAACTTCG TGCAAAAACC GAACGGCAGA TACAGGAAGC ACAGGAGCGA ATCAGCGCAC TTCAGGAAGA AATGGCGGAA ACGCTGGGTG AGCAATATGA CCCGGTACTG CCATCCGCCG CCCTGCGTCA GTCTTCAGCC TGTAAACAGT TACCGGCCTC ACTTCCCCGT GAAACCCGGG TTATCCGGCC GGAAGAGGAA TGCTGTCCTG CCTGTGGTGG TGAACTCAGT TCTCTGGGAT GTGATGTGTC AGAGCAACTG GAGCTTATCA GCAGCGCCTT TAAGGTTATC GAAACACAAC GTCCGAAACA GGCCTGTTGC CGGTGCGACC ATATCGTGCA GGCACCAGTA CCTTCAAAAC CCATTGCACG CAGTTATGCC GGAGCGGGGC TTCTGGCCCA TGTTGTCACC GGGAAATATG CAGACCATCT GCCGTTATAC CGCCAGTCAG AAATATACCG TCGTCAGGGA GTGGAGCTGA GCCGTGCCAC ACTGGGGCGC TGGACAGGTG CTGTTGCTGA ACTGCTGGAG CCGCTGTATG ACGTCCTGCG CCAGTATGTG CTGATGCCCG GTAAAGTCCA TGCTGATGAT ATCCCCGTCC CGGTCCAGGA GCCGGGCAGC GGTAAAACCC GGACAGCCCG GCTGTGGGTC TACGTCCGTG ATGACCGTAA CGCCGGTTCA CAGATGCCCC CGGCGGTCTG GTTCGCGTAC AGTCCGGACC GGAAAGGTAT CCATCCACAA AATCACCTGG CCGGTTACAG CGGTGTGCTT CAGGCCGATG CTTACGGTGG TTACCGGGCG TTATACGAAT CCGGCAGAAT AACGGAAGCC GCGTGTATGG CTCATGCCCG GAGAAAAATC CACGATGTGC ATGCAAGAGC GCCCACCTAC ATCACCACGG AAGCCCTGCA GCGTATCGGT GAACTGTATG CCATCGAGGC AGAGGTCCGG GGCTGTTCAG CAGAACAGCG TCTGGCGGCA AGAAAAGCCA GAGCCGCGCC ACTGATGCAG TCACTGTATG ACTGGATACA GCAACAGATG AAAACACTGT CGCGTCACTC AGATACGGCA AAAGCGTTCG CATACCTGCT GAAACAGTGG GATGCACTGA ACGTGTACTG CAGTAATGGC TGGGTGGAAA TCGACAACAA CATCGCAGAG AACGCCTTAC GGGGAGTGGC CGTAGGCCGG AAAAACTGGA TGTTCGCGGG TTCCGACAGC GGTGGTGAAC ATGCGGCGGT GTTGTACTCG CTGATCGGCA CATGCCGTCT GAACAATGTG GAGCCAGAAA AGTGGCTGCG TTACGTCATT GAACATATCC AGGACTGGCC GGCAAACCGG GTACGCGATC TGTTGCCCTG GAAAGTTGAT CTGAGCTCTC AGTAA
|
Protein sequence | MDTSLAHENA RLRALLQTQQ DTIRQMAEYN RLLSQRVAAY ASEINRLKAL VAKLQRMQFG KSSEKLRAKT ERQIQEAQER ISALQEEMAE TLGEQYDPVL PSAALRQSSA CKQLPASLPR ETRVIRPEEE CCPACGGELS SLGCDVSEQL ELISSAFKVI ETQRPKQACC RCDHIVQAPV PSKPIARSYA GAGLLAHVVT GKYADHLPLY RQSEIYRRQG VELSRATLGR WTGAVAELLE PLYDVLRQYV LMPGKVHADD IPVPVQEPGS GKTRTARLWV YVRDDRNAGS QMPPAVWFAY SPDRKGIHPQ NHLAGYSGVL QADAYGGYRA LYESGRITEA ACMAHARRKI HDVHARAPTY ITTEALQRIG ELYAIEAEVR GCSAEQRLAA RKARAAPLMQ SLYDWIQQQM KTLSRHSDTA KAFAYLLKQW DALNVYCSNG WVEIDNNIAE NALRGVAVGR KNWMFAGSDS GGEHAAVLYS LIGTCRLNNV EPEKWLRYVI EHIQDWPANR VRDLLPWKVD LSSQ
|
| |