Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1532 |
Symbol | |
ID | 6065895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 1694729 |
End bp | 1698523 |
Gene Length | 3795 bp |
Protein Length | 1264 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641600949 |
Product | WGR domain-containing protein |
Protein accession | YP_001724519 |
Protein GI | 170019565 |
COG category | [S] Function unknown |
COG ID | [COG3831] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACACT TTATCTATCA GGACGAAAAA TCACATAAAT TCTGGGCGGT GGAGCAACAG GGAAACGAGT TGCATATCAG TTGGGGAAAA GTTGGCACCA AAGGGCAAAG TCAGATAAAA AGTTTTTCAG ATGCTGCGGC AGCGGAAAAA GCGGAACTTA AGCTGATTGC GGAGAAGGTG AAGAAGGGGT ATGTGGAGCA AGCGAAGGAT AATTCTTTGC AACCTTCCCA AACGGTAACG GGCTCTCTCA AGGTAGCGGA CTTATCCACC ATTATTCAGG AACAACCCTC TTTCGTAGCA GAAACCCGTG CGCCTGACAA AAATACAGAT GCTGTTTTAC CGTGGCTGGC GAAAGATATT GCTGTCGTTT TTCCGCCCGA AGTTGTACAC ACCACGTTAA GTCATCGCCG CTTTCCCGGA GTTCCTGTTC AGCAAGCAGA CAAATTGACC CAATTACGTC GCTTAGCCTG TAGTGTGTCG CAACGGGATA ATAAAACAGC CACATTTGAC TTCAGCGCCT GTTCTTTAGA ATGGCAAAAC ACCGTCGCCC AGGCGATCAG TCAGATCGAC GGCCTGAAAA CAACACAGTT ACCATCACCA GTAATGGCTG TACTCACGGC ACTTGAAATG AAATGCACAA GATATAAAGT GCGTGAGGAT GTTATGGATC AGATCGTCCA GGAAGGCGGT CTGGAATATG CTACTGATGT AATAATACAC CTTCAACAGA TTGATATTGA ATGGGATTAT GCGAATAATG TCATTATTAT TCTGCCGTCT GGCATTGCAC CTAGCTACTT GGAGCAATAT TCCAGATTTG AATTACGCCT ACGTAAACAT TTATCACTGA CGGAAGAGTC TCTCTGGCAA AAATGTGCAC AAAAACTTAT TGCCGCAATT CCACATATTC CAGAATGGCG GCAACCATTA ATTGCTTTGT TATTACCCGA AAAACCAGAA ATTGCACATG AAATTGCCCA GCGTCTACTG GGGCAAAAAA AATTACCCTC GCTTGAGTGG TTAAAAATAG TGGCGACTGA TGAGCACATT CTTGCCTCAT TAGAAAAATA TCACGAACCA TATGCCATTT TTGATGATTA CTATTGTGGT GCGATATGGT CAGCCACCGT ATTACAGGAG CAAGGTGTTG CAGCCCTGCC CCGATTTGCT CCCTATGCCG CAAGTGACTA CTGCGCCGAT GTGTTGCGTC ATATCAATCA TCCGTTCGCA TTGACACTGC TTATACGTGT AGCCGGGCAA ACTAAACGCT GTCACGATCG GATGACGAAA GCCATTGCTG CGTTCCCACA TGCAGCAATG GCGGCACTGA CGGAACTTCT TGGGCAAAAA GAAGAGAACA GTTGGCGCAT TATGCTAATG ACAATGCTTA TCTCACAACC AGCACTGGCA GAACAGGTCA TTCCCTGGCT CTCGACACCC GCAGTTGCCG TACTGAAATC ATGCCAGCAA CAACTGACAC AGCCCTCAAA CCATGCCAGC GCCGATCTAC TGCCAGCCGT AGTAGTCTCC CCTCCCTGGC TTTCGAAAAA GAAAAAATCG CCGATTCCGG TGCTGGATTT AGCGCCATTA GGCATTGAGC CAATCTGTTA TCTGACAGAA GAAATCAGTA ATCAACTTTT GGCGAAATAT ATCTGGTATT CAAAACACAT CACGGTTAGC CATGAAGAAA GTACTACCAA CCTGTTGGCA AGGATGGGTT TTCAACGACG GATCGCTGGT ACATATATTA AAGCTCCCGA AGCGGTAGTT GAGGCATGGC TAAATGAAGA TTATTCAACC TTACTAAGTG AATTTAAGGT GTTTCATTCA CCTACCGGGC ATTATTGGCA GTTGGGGATT TTGACAACAT TGCCGCTGGA GAAAGCAGTA AAAGCATGGA ATGCCCTTAC CCTATCTCCA CATACCGATA CCGAATACTC CATGTTACAT TTTGGACTCA AAGGGTTACC TGGGTTAGTA AACTCACTTG CACGCTATCC ACAAGAAGCC TTGCCCATCA CGAATTACTT CGCAGCGAGT GAGCTGGCTC CTGCCGTCGC CCGTGCCTTC AACAAACTGA AAACGCTACG CGAAAACGCC CGTAGCTGGC TGTTGAAATA CCCGGAACAT GCCCTTACCG GCCTGCTGCC TGCGGCGCTC GGCAAAGCCG GTGAAGCACA GGATAACGCC CGCGCTGCCT TGCGTATGCT TACCGAAAAC GGTCATCAGC CATTACTGCA AGAAATCGCC CGACGTTATA ACCAGCCGGA AGTAACCGAT GCGGTGAACG CTCTGCTTGC GCTCGATCCC TTAGATAATC ACCCGACAAA AATCCCCACT CTTCCGGCCT TTTATCAGCC ATCGCTCTGG ACGCGCCCGG TATTAAAAGC AAATGCCCAA TCACTGCCAG ATAGCGCCCT CCTCCACCTC GGTGAAATGC TCCGCTTCCC TCAGGAAGAG GCTCTGTATC CGGGATTATT GCAGGTGAAA GACGTCTGTT CCGCCGACTC ACTGGCGGGA TTTGCCTGGG ATCTGTTTAC CGCCTGGCAG ACCGCTGGCG CGCCGTCGAA AGAGAGTTGG GCGTTCACTG CGTTAGGCGT TCTCGGTAAC GATGACACCG CCCGCAAACT GACGCCATTA ATACGCGCCT GGCCTGGTGA ATCCCAGCAT AAACGCGCCA CCGTTGGGTT GGATATTCTC GCTGCTATCG GTAGTGATAT CGCCCTTATG CAGCTTAACG GCATCGCCCA GAAACTGAAA TTCAAAGCAT TACAGGAGCG GGCAAAAGAA AAAATTGCCG ACATTGCCGA GAGCCGCGAA CTCACGGTGG CGGAGCTTGA AGATCGGTTA GCACCGGATC TCGGTCTGGA TGATAACGGT TCGCTGCTGC TGGATTTTGG CCCACGGCAG TTCACCGTCA GCTTTGATGA AACCTTAAAA CCGTTTGTGC GTGATGTTTC CGGCAGCCGC CTGAAAGACT TGCCCAAACC GAACAAAAGC GATGATGAAA CGCGGGCGAA CGATGCGGTT AACCGCTACA AATTGCTGAA AAAAGATGCG CGTACCATCG CCGCCCAGCA GGTAGCAAGG CTGGAATCCG CCATGTGCCT GCGCCGCCGC TGGTCGCTGG AAAACTTCCA GCTCTTCCTG GTTGAGCATC CGCTGGTTCG TCACTTAACC CGCCGTCTGA TTTGGGGCGT TTATAGCGCC GAAAACCAGC TACTGGCTTG CTTTCGCGTA GCAGAAGATA ACAGCTCCAG CACCGCTGAC GATGATCTTT TCACCCTGCC GGAAGGCGAT ATCTCTATCG GCACTCCTCA CGTTCTGGAA ATATCACCAA CGGATGCTGC CGCCTTTGGT CAGCTTTTTG CCGACTACGA ACTGCTACCA CCGTTCCGCC AGCTCGACCG TAACAGCTAC GCCCTGACAG AAGCCGAGCG CAATGCCAGT GAACTGACCC GCTGGGCAGG CAGAAAATGC CCAAGTGGTC GGGTCATGGG GCTGGCGAAT AAAGGCTGGA TAAAGGGCGA ACCACAGGAT GGAGGCTGGA TCGGATGGAT GATCAAACCT TTGGGTCGCT GGTCGTTAAT CATGGAAATC GATGAAGGCT TTGCGGTAGG CATGTCGCCA GCCGAACTCA GCGCTGAGCA GCTCTTAAGC AAGCTGTGGC TATGGGAAGG CAAAGCAGAA AGATATGGCT GGGGGAGTAA TTCAACACAG GAAGCGCAGT TCTCCGTAAT CGATGCCATC ACCGCCAGCG AGCTAATTAA CGATATTGAA GCCCTGTTTG AATAA
|
Protein sequence | MRHFIYQDEK SHKFWAVEQQ GNELHISWGK VGTKGQSQIK SFSDAAAAEK AELKLIAEKV KKGYVEQAKD NSLQPSQTVT GSLKVADLST IIQEQPSFVA ETRAPDKNTD AVLPWLAKDI AVVFPPEVVH TTLSHRRFPG VPVQQADKLT QLRRLACSVS QRDNKTATFD FSACSLEWQN TVAQAISQID GLKTTQLPSP VMAVLTALEM KCTRYKVRED VMDQIVQEGG LEYATDVIIH LQQIDIEWDY ANNVIIILPS GIAPSYLEQY SRFELRLRKH LSLTEESLWQ KCAQKLIAAI PHIPEWRQPL IALLLPEKPE IAHEIAQRLL GQKKLPSLEW LKIVATDEHI LASLEKYHEP YAIFDDYYCG AIWSATVLQE QGVAALPRFA PYAASDYCAD VLRHINHPFA LTLLIRVAGQ TKRCHDRMTK AIAAFPHAAM AALTELLGQK EENSWRIMLM TMLISQPALA EQVIPWLSTP AVAVLKSCQQ QLTQPSNHAS ADLLPAVVVS PPWLSKKKKS PIPVLDLAPL GIEPICYLTE EISNQLLAKY IWYSKHITVS HEESTTNLLA RMGFQRRIAG TYIKAPEAVV EAWLNEDYST LLSEFKVFHS PTGHYWQLGI LTTLPLEKAV KAWNALTLSP HTDTEYSMLH FGLKGLPGLV NSLARYPQEA LPITNYFAAS ELAPAVARAF NKLKTLRENA RSWLLKYPEH ALTGLLPAAL GKAGEAQDNA RAALRMLTEN GHQPLLQEIA RRYNQPEVTD AVNALLALDP LDNHPTKIPT LPAFYQPSLW TRPVLKANAQ SLPDSALLHL GEMLRFPQEE ALYPGLLQVK DVCSADSLAG FAWDLFTAWQ TAGAPSKESW AFTALGVLGN DDTARKLTPL IRAWPGESQH KRATVGLDIL AAIGSDIALM QLNGIAQKLK FKALQERAKE KIADIAESRE LTVAELEDRL APDLGLDDNG SLLLDFGPRQ FTVSFDETLK PFVRDVSGSR LKDLPKPNKS DDETRANDAV NRYKLLKKDA RTIAAQQVAR LESAMCLRRR WSLENFQLFL VEHPLVRHLT RRLIWGVYSA ENQLLACFRV AEDNSSSTAD DDLFTLPEGD ISIGTPHVLE ISPTDAAAFG QLFADYELLP PFRQLDRNSY ALTEAERNAS ELTRWAGRKC PSGRVMGLAN KGWIKGEPQD GGWIGWMIKP LGRWSLIMEI DEGFAVGMSP AELSAEQLLS KLWLWEGKAE RYGWGSNSTQ EAQFSVIDAI TASELINDIE ALFE
|
| |