Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3235 |
Symbol | |
ID | 6066781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3542694 |
End bp | 3543896 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641602650 |
Product | exonuclease subunit SbcD |
Protein accession | YP_001726184 |
Protein GI | 170021230 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0420] DNA repair exonuclease |
TIGRFAM ID | [TIGR00619] exonuclease SbcD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.294623 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000199407 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCATCC TTCACACCTC AGACTGGCAT CTCGGCCAGA ACTTCTACAG TAAAAGCCGC GAAGCTGAAC ATCAGGCTTT TCTTGACTGG CTGCTGGAGA CAGCACAAAC CCATCAGGTG GATGCGATTA TTGTTGCCGG TGATGTTTTC GATACCGGCT CGCCGCCCAG TTACGCCCGC ACGTTATACA ACCGTTTTGT TGTCAATTTA CAGCAAACTG GCTGTCATCT GGTGGTACTG GCAGGAAACC ATGACTCGGT CGCCACGCTG AATGAATCGC GCGATATCAT GGCGTTCCTC AATACTACCG TGGTCGCCAG CGCCGGACAT GCGCCGCAAA TCTTGCCTCG TCGCGACGGG ACGCCAGGCG CAGTGCTGTG CCCCATTCCG TTTTTACGTC CGCGTGACAT TATTACCAGC CAGGCGGGGC TTAACGGTAT TGAAAAACAG CAGCATTTAC TGGCAGCGAT TACCGATTAT TACCAACAAC ACTATGCCGA TGCCTGCAAA CTGCGCGGCG ATCAGCCTCT GCCCATCATC GCCACGGGAC ATTTAACGAC CGTGGGGGCC AGTAAAAGTG ACGCCGTGCG TGACATTTAT ATTGGCACGC TGGACGCGTT TCCGGCACAA AACTTTCCAC CAGCCGACTA CATCGCGCTC GGGCATATTC ACCGCGCACA GATTATTGGC GGCATGGAAC ATGTTCGCTA TTGCGGCTCC CCCATTCCGC TGAGTTTTGA TGAATGCGGT AAGAGTAAAT ATGTCCATCT GGTGACATTT TCAAACGGCA AATTAGAGAG CGTAGAAAAC CTGAACGTAC CGGTAACGCA ACCCATGGCA GTGCTGAAAG GCGATCTGGC GTCGATTACC GCACAGCTGG AACAGTGGCG CGATGTATCG CAGGAGCCAC CTGTCTGGCT GGATATCGAA ATCACTACTG ATGAGTATCT GCATGATATT CAGCGCAAAA TCCAGGCATT AACCGAATCA TTGCCTGTCG AAGTATTGCT GGTACGTCGG AGTCGTGAAC AGCGCGAGCG TGTGTTAGCC AGCCAACAGC GTGAAACCCT CAGCGAACTC AGCGTCGAAG AGGTGTTCAA TCGCCGTCTG GCACTGGAAG AACTGGATGA ATCGCAGCAG CAACGTCTGC AGCATCTTTT CACCACGACG TTGCATACCC TCGCCGGAGA ACACGAAGCA TGA
|
Protein sequence | MRILHTSDWH LGQNFYSKSR EAEHQAFLDW LLETAQTHQV DAIIVAGDVF DTGSPPSYAR TLYNRFVVNL QQTGCHLVVL AGNHDSVATL NESRDIMAFL NTTVVASAGH APQILPRRDG TPGAVLCPIP FLRPRDIITS QAGLNGIEKQ QHLLAAITDY YQQHYADACK LRGDQPLPII ATGHLTTVGA SKSDAVRDIY IGTLDAFPAQ NFPPADYIAL GHIHRAQIIG GMEHVRYCGS PIPLSFDECG KSKYVHLVTF SNGKLESVEN LNVPVTQPMA VLKGDLASIT AQLEQWRDVS QEPPVWLDIE ITTDEYLHDI QRKIQALTES LPVEVLLVRR SREQRERVLA SQQRETLSEL SVEEVFNRRL ALEELDESQQ QRLQHLFTTT LHTLAGEHEA
|
| |