Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1150 |
Symbol | |
ID | 6143687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1173551 |
End bp | 1174576 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641616028 |
Product | phage integrase family site specific recombinase |
Protein accession | YP_001743215 |
Protein GI | 170682438 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000198353 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0000000000000625111 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGAAGAC GAAGGAAAAA TCCTGAACAC GAAAAATTAC CTCCAAATGT ATACCCAAAT AAATATAGTT ATGTATGGAA ACCAACATCC AGAGAATCTG TCACACTAAC CGCCATCAAG GATGGTTTAG CTGCTTTATG GAAAAAGTAT GAGGAAACTG TAAATAATCG CGATCGTGCA ATGACATTCG GTCGCTTGTG GGAAAAATTC CTCGCCAGCG CCTATTACAG TGACCTTAGT CCAAGAACAC AAAAAGATTA TCTGCAACAT CAAAAAAAGT TGCTTGCCGT ATTCGGTAAG GTACCAGCGG ATTCCATAAA ACCAGAACAC ATCCGTCGAT ACATGGACAA AAGAGGGGAG CAGAGTAAAA CGCAAGCCAA CCATGAAAAA AGCAGTATGT CCCGTGTTTA CAGTTGGGGG TATGAGCGAG GGTACGTGAA GGCTAACCCA TGTGCAGGTG TAAGTAAATT CAAGGCCAAA AACCGCGAAC GATATGTAAC CGACAAAGAA TACCAGGCAG TATTAAGCGT TGCACCTCTT CCTGTTTTTA TCGCAATGGA AATTGCCTAT CTGTGTGCAG CGAGGGTTTC CGATGTGTTA TCGCTGAAAT GGGAACAGAT TGGAAACGAC GGGATATTCA TCCAGCAAGG GAAAACCGGA AAAAAACAGA TAAAAGCATG GAGTCCACGA TTACAGGCAG CGATCGAAAA AGCAAAACAG TTACCAAAAT CTGCCTATGT GATCAGCAAT CAATACGGCA ACCGATATAT GTACAAAGGC TTTAACGAAA TGTGGGTAGA TGCAAGAAAT CGTGCTGGAA AAATTTCAGG TATTTTAACC GACTTCACCT TTCATGATCT GAAGGCGAAA GGAATTTCAG ACTATGAAGG AAGCAGCCGG GATAAGCAAC TTTTCTCTGG TCACAAAACC GAAGGGCAAG TGCTAATCTA TGACAGGAAG GTTAAAGTTT CACCAACACT TGATGTCCCG TTACCTGAAA ATATTCCAAG AAAATATTCC AAGTAA
|
Protein sequence | MGRRRKNPEH EKLPPNVYPN KYSYVWKPTS RESVTLTAIK DGLAALWKKY EETVNNRDRA MTFGRLWEKF LASAYYSDLS PRTQKDYLQH QKKLLAVFGK VPADSIKPEH IRRYMDKRGE QSKTQANHEK SSMSRVYSWG YERGYVKANP CAGVSKFKAK NRERYVTDKE YQAVLSVAPL PVFIAMEIAY LCAARVSDVL SLKWEQIGND GIFIQQGKTG KKQIKAWSPR LQAAIEKAKQ LPKSAYVISN QYGNRYMYKG FNEMWVDARN RAGKISGILT DFTFHDLKAK GISDYEGSSR DKQLFSGHKT EGQVLIYDRK VKVSPTLDVP LPENIPRKYS K
|
| |