Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4753 |
Symbol | |
ID | 6143639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4851586 |
End bp | 4852851 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641619567 |
Product | phage integrase family site specific recombinase |
Protein accession | YP_001746674 |
Protein GI | 170682520 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATTAA CAGATATCAA AGTCAGAGCA GCCAAGCCAA CGGATAAGCA ATATAAGCTG ACTGATGGTG GCGGTATGCA TCTGCTTGTC CATCCAAATG GTTCTAAGTA CTGGCGTTTG CAGTACCGTT ATGAGGGAAA GCAAAAAATG CTGGCACTTG GGGTTTATCC TGAAATCACA CTAGCGGATG CCAGAGTACG TCGTGACGAG GCGCGTAAGC TGCTTGCGAA TGGCGTCGAT CCGGGAGACA AAAAGAAAAA TGATAAGGTT GAACAGAGTA AAGCACGAAC CTTTAAAGAA GTCGCGATTG AGTGGCATGG CACCAATAAA AAGTGGTCTG AAGATCACGC CCATCGTGTG CTAAAAAGTC TGGAAGATAA TCTTTTTGCA GCGCTTGGTG AACGTAATAT CGCTGAGTTA AAAACTCGAG ATTTATTAGC ACCCATTAAG GCCGTAGAAA TGTCTGGACG TCTTGAAGTG GCCGCTCGTC TTCAGCAGCG CACTACAGCC ATCATGCGCT ATGCAGTGCA AAGTGGGTTA ATTGATTATA ACCCGGCACA AGAGATGGCT GGGGCGGTTG CTTCCTGTAA TCGACAACAT CGTCCCGCGC TTGAATTAAA GCGCATCCCT GAGTTGCTTA CAAAAATAGA TAGCTATACT GGTAGGCCGC TAACCCGATG GGCGACAGAA CTCTCTTTGC TGATCTTTAT TCGGTCCAGT GAGCTGCGTT TTGCTCGTTG GTCAGAGATC GATTTCGAAG CGTCTATATG GACTATCCCA CCGGAGCGGG AGCCTATTCC TGGAGTGAAA CATTCCCATA GAGGCTCAAA AATGCGTACA ACGCATCTAG TGCCTCTTTC AACGCAAGCT CTTGCAATTT TAAAGCAGAT AAAACAGTTT TGTGGGGCCC ATGACTTGAT ATTTATTGGT GATCACGATT CGCACAAACC CATGAGTGAG AATACGGTAA ATAGTGCGTT ACGGGTCATG GGGTATGATA CAAAAGTAGA GGTTTGTGGT CATGGCTTTC GAACAATGGC CTGTAGTTCA TTGGTCGAAT CAGGTTTGTG GTCTCGTGAT GCTGTTGAAC GTCAGATGAG CCACATGGAG CGAAATTCAG TGAGGGCCGC GTATATCCAT AAAGCAGAGC ATCTGGAAGA ACGCCGCTTG ATGCTACAAT GGTGGGCCGA TTTTCTGGAT GCAAACAGAG AAAAATTTAT CAGTCCATTT GAATATGCAA AGATTAATAA TCCATTAAAA CAGTAA
|
Protein sequence | MALTDIKVRA AKPTDKQYKL TDGGGMHLLV HPNGSKYWRL QYRYEGKQKM LALGVYPEIT LADARVRRDE ARKLLANGVD PGDKKKNDKV EQSKARTFKE VAIEWHGTNK KWSEDHAHRV LKSLEDNLFA ALGERNIAEL KTRDLLAPIK AVEMSGRLEV AARLQQRTTA IMRYAVQSGL IDYNPAQEMA GAVASCNRQH RPALELKRIP ELLTKIDSYT GRPLTRWATE LSLLIFIRSS ELRFARWSEI DFEASIWTIP PEREPIPGVK HSHRGSKMRT THLVPLSTQA LAILKQIKQF CGAHDLIFIG DHDSHKPMSE NTVNSALRVM GYDTKVEVCG HGFRTMACSS LVESGLWSRD AVERQMSHME RNSVRAAYIH KAEHLEERRL MLQWWADFLD ANREKFISPF EYAKINNPLK Q
|
| |