Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2052 |
Symbol | |
ID | 6067742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2263784 |
End bp | 2265079 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641601464 |
Product | integrase family protein |
Protein accession | YP_001725023 |
Protein GI | 170020069 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00969136 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGAAG TTGAAATGAA ATATCCGACA GGCGTGGAAA ACCATGGAGG GAAATTACGT ATCTGGTTTG TTTATAAAGA CGTAAGAGTC AGGGAAAATC TGGGGGTTCC TGACACAGCA AAAAACAGGC GCGTTGCAGG TGAACTACGC TCCTCTGTTT GTTACGCAAT AAAAACTGGT GTTTTCGACT ATGCAAAACA GTTTCCCTCC TCACGCAATC TGGAAAAATT TGGTGAGGCC CGACAAGATT TAACCATAAA AGAACTGGCT GAAAAATTTC TGGCACTGAA AGAAACTGAA GTCGCCAAAA CATCACTCAA CACATACCGT GCCGTCATCA AAAATATCCT GAGCATAATC GGTGAAAAAA ATCTTGCCTC ATCGATTAAT AAAGAAAAAT TACTGGAGGT TCGTAAAGAG TTACTGACTG GATACCAGAT CCCCAAAAGT AACTATATTG TTACACAACC AGGGAGATCG GCTGTAACTG TAAATAATTA CATGACAAAT CTTAACGCCG TGTTCCAGTT TGGTGTTGAT AACGGTTACC TGGCAGATAA TCCGTTTAAG GGGATCTCGC CATTAAAGGA ATCAAGAACC ATTCCGGATC CTCTTTCGCG GGAAGAATTT ATCCGTCTTA TCGATGCGTG CAGAAATCAG CAAGCAAAAA ATTTATGGTG TGTTTCTGTT TATACTGGAG TTCGCCCTGG TGAGCTGTGT GCACTTGGAT GGGAGGACAT AGATCTGAAA AATGGAACAA TGATGATCAG GAGAAATTTA GCAAAAGACC GTTTCACGGT ACCAAAAACA CAGGCGGGAA CCAATCGGGT CATTCATCTT ATTAAGCCAG CAATCGACGC TCTCCGGAGT CAGATGACAT TAACGAGACT GAGCAAAGAG CATATCATTG ATGTTCACCT CAGAGAGTAT GGCAGAACAG AAAAACAAAA ATGCACCTTT GTTTTTCAAC CTGAAGTGTC AGCGAGAGTA AAAAATTATG GTGACCATTT TACCGTTGAC TCAATAAGGC AGATGTGGGA CGCAGCGATA AAACGTGCCG GACTCCGCCA TCGAAAATCA TATCAGTCGA GACATACTTA TGCCTGCTGG TCGCTGACAG CTGGTGCTAA CCCGGCATTT ATAGCAAACC AGATGGGCCA TGCAGATGCG CAAATGGTAT TTCAGGTATA CGGAAAATGG ATGTCTGAAA ACAATAATGC ACAGGTAGCT TTGTTAAATA CACAGTTAAG CGAGTATGCC CCAACCATGC CCCATAACGA AGCAATGAAA AATTAA
|
Protein sequence | MREVEMKYPT GVENHGGKLR IWFVYKDVRV RENLGVPDTA KNRRVAGELR SSVCYAIKTG VFDYAKQFPS SRNLEKFGEA RQDLTIKELA EKFLALKETE VAKTSLNTYR AVIKNILSII GEKNLASSIN KEKLLEVRKE LLTGYQIPKS NYIVTQPGRS AVTVNNYMTN LNAVFQFGVD NGYLADNPFK GISPLKESRT IPDPLSREEF IRLIDACRNQ QAKNLWCVSV YTGVRPGELC ALGWEDIDLK NGTMMIRRNL AKDRFTVPKT QAGTNRVIHL IKPAIDALRS QMTLTRLSKE HIIDVHLREY GRTEKQKCTF VFQPEVSARV KNYGDHFTVD SIRQMWDAAI KRAGLRHRKS YQSRHTYACW SLTAGANPAF IANQMGHADA QMVFQVYGKW MSENNNAQVA LLNTQLSEYA PTMPHNEAMK N
|
| |