Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4315 |
Symbol | |
ID | 6144994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4412447 |
End bp | 4414732 |
Gene Length | 2286 bp |
Protein Length | 761 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641619136 |
Product | replication gene A protein |
Protein accession | YP_001746260 |
Protein GI | 170681873 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.908807 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.000153162 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCGTTA AAGCCTCCGG GCGTTTTGTC CCTCCGTCAG CATTTGCCGC AGGCACCGGT AAGATGTTTA CCGGTGCTTA TGCATGGAAC GCGCCACGCG AGGCCGTCGG GCGCGAAAGA CCCCTTACAC GTGATGAGAT GCGTCAGGTG CAAGGTGTTT TATCCACGAT TAACCGCCTG CCTTACTTTT TGCGCTCGCT GTTTACTTCA CGCTATGACT ACATCCGGCG CAATAAAAGC CCGGTGCACG GGTTTTATTT CCTCACATCC ACTTTTCAGC GTCGTTTATG GCCGCGCATT GAGCGTGTGA ATCAGCGCCA TGAAATGAAC ACCGACGCGT CGTTGCTGTT TCTGGCAGAG CGTGACCACT ATGCGCGCCT GCCGGGAATG AATGACAAGG AGCTGAAAAA GTTTGCCGCC CGTATCTCAT CGCAGCTTTT CATGATGTAT GAGGAACTCT GCGATGCCTG GGTGGATGCC CATGGCGAAA AAGAATCGCT GTTTACGGAT GAGGCGCAGG CTCACCTGTA TGGTCATGTT GCTGGCGCTG CACGTGCTTT CAATATTTCC CCGCTGTACT GGAGAAAATA CCGTAAAGGG CAGATGACCA CGAGGCAGGC ATATTCTGCC ATTGCCCGCC TGTTTAACGA TGAGTGGTGG ATTAGTCAGC TTAAAGGCCA GCGTATGCGC TGGCATGAGG CGTTACTGAT TGCTGTCGGG GAGGTCAATA AAGACCGTTC ACCTTATGCC AGTAAACATG CCATTCGTGA TGTGCGTGCG CGCCGCCAGG CAAATCTGGA ATTTCTTAAA TCGTGTGACC TTGAAAACAG GGAAACCGGC GAGCGCATCG ACCTTATCAG TAAGGTGATG GGCAGTATTT CTAATCCTGA AATTCGCCGG ATGGAGCTGA TGAACACCAT TGCCGGTATT GAGCGTTACG CCGCTGCAGA GGGTGATGTG GGGATGTTTA TCACGCTGAC CGCGCCGTCA AAGTATCACC CGACACGTCA GGTAGGAAAA GGCGAAAGTA AAACCGTGCA GCTTAATCAC GGCTGGAACG ATGAGGCATT TAATCCAAAG GATGCGCAGC GTTATCTCTG CCGTATCTGG AGCCTGATGC GCACGGCATT CAAGGATAAT GATTTACAGG TCTACGGTTT GCGTGTCGTC GAGCCACACC ACGACGGAAC GCCGCACTGG CATATGATGC TTTTTTGTAA TTCGCGCCAG CGTAACCAGA TTATCGAAAT CATGCGTCGC TATGCGCTCA AAGAGGATGG CGACGAAAGA GGAGCCGCGC GAAACCGTTT TCAGGCAAAA CACCTTAATC GGGGCGGTGC TGCGGGGTAT ATCGCGAAAT ACATTTCAAA AAACATCGAC GGCTATGCAC TGGATGGTCA GCTCGATAAC GATACCGGCA GGCCGCTGAA AGACACTGCC GCGGCTGTTA CCGCATGGGC GTCAACGTGG CGCATTCCGC AATTTAAAAC GGTTGGCCTA CCGACAATGG GGGCTTACCG TGAACTACGC AAATTACCTC GCGGCGTCAG TATTGCTGAT GAGTTTGACG AACGCGTCGA GGCTGCTCGC GCTGCCGCAG ACAGTGGTGA TTTTGCGTTG TATATCAGCG CGCAGGGTGG GGCAAATGTC CCGCGCGATT GTCAGACTGT CAGGGTTGCC CGTAGCCCGT CGAGTGACGT TAACGAGTAC GAGGAAGAAG TCGAGAGAGT GGTCGGTATT TACGCGCCGC ATCTCGGCGC GCGTCATATT CATATCACCA GAACGACGGA CTGGCGCATT GTGCCGAAAG TTCCGGTCGT TGAGCCTTTG ACTTTAAAAA GCGGCATCGC CGCGCCTCGG AGTCCTGTCA ATAACTGTGG AAAGCTCACC GGCAGTGATA CTTCGTTACC GGCTCCCACA CCTTATGAAC ATGCCGCAGC CGTGCTTAAT CTGGTTGATG ACGGTGTTAT CGAATGGAAT GAGCCAGAGG TCGTGAGGGC GCTCAGAGGT GCATTAAAAC ACGAACTGAG AACACCAAAT CGTCAGCAGA GAAACGGAAG CCCGTTAAAA CCACATGAAA TAGCGCCATC GGCCAGACTG ACCCGGTCGG AACGAACGCA AATTACCCGT ATCCGCGTTG ACCTTGCTCA GAACGGTATC AGGCCGCAGC GATGGGAGCT TGAGGCGCTG GCGCGTGGCG CGACCGTAAA TTATGACGGG AAAAAATTCA CGTATCCGGT CGCTGATGAG TGGCCGGGAT TCTCAACAGT AATGGAGTGG ACATAA
|
Protein sequence | MAVKASGRFV PPSAFAAGTG KMFTGAYAWN APREAVGRER PLTRDEMRQV QGVLSTINRL PYFLRSLFTS RYDYIRRNKS PVHGFYFLTS TFQRRLWPRI ERVNQRHEMN TDASLLFLAE RDHYARLPGM NDKELKKFAA RISSQLFMMY EELCDAWVDA HGEKESLFTD EAQAHLYGHV AGAARAFNIS PLYWRKYRKG QMTTRQAYSA IARLFNDEWW ISQLKGQRMR WHEALLIAVG EVNKDRSPYA SKHAIRDVRA RRQANLEFLK SCDLENRETG ERIDLISKVM GSISNPEIRR MELMNTIAGI ERYAAAEGDV GMFITLTAPS KYHPTRQVGK GESKTVQLNH GWNDEAFNPK DAQRYLCRIW SLMRTAFKDN DLQVYGLRVV EPHHDGTPHW HMMLFCNSRQ RNQIIEIMRR YALKEDGDER GAARNRFQAK HLNRGGAAGY IAKYISKNID GYALDGQLDN DTGRPLKDTA AAVTAWASTW RIPQFKTVGL PTMGAYRELR KLPRGVSIAD EFDERVEAAR AAADSGDFAL YISAQGGANV PRDCQTVRVA RSPSSDVNEY EEEVERVVGI YAPHLGARHI HITRTTDWRI VPKVPVVEPL TLKSGIAAPR SPVNNCGKLT GSDTSLPAPT PYEHAAAVLN LVDDGVIEWN EPEVVRALRG ALKHELRTPN RQQRNGSPLK PHEIAPSARL TRSERTQITR IRVDLAQNGI RPQRWELEAL ARGATVNYDG KKFTYPVADE WPGFSTVMEW T
|
| |