Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3884 |
Symbol | |
ID | 6064348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4248009 |
End bp | 4252937 |
Gene Length | 4929 bp |
Protein Length | 1642 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641603298 |
Product | hypothetical protein |
Protein accession | YP_001726813 |
Protein GI | 170021859 |
COG category | [V] Defense mechanisms |
COG ID | [COG1002] Type II restriction enzyme, methylase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTGG TCGGTATTAA TAACGAAAAC GAATTTTACT CTAACCACTA TTTGGGTGAG GTATTCACCA GTGATATCCG CGATGTGCTG GAACCCTGGA TAGCCCAGGA AAATGCAGCG CGTGAAGCGG AGCGTGCCGC TCGTGAACAG GGCAAAGACG TAGAGCCGGG ATACCGCGCT CCGTGGAACC AGTTTAACAG TCTGGCGACT GAGTTTTTCC GCAAACTTGC CGAGCACGAA AAACAGCGTC AGATCCCGCA GCGTCTGGCC GATCAACGTA ATCGCTGGCA GCCATTGTTA AAGGCGCTGG GCTACGAAAT TACGCCGCAG ATCCAAATGC TGGATGACGA TACACCACTG CCGGTACTGG CGCGTTACAA CAGCACTGAC GGTAGCCCGT GGCTGTGGAT TGTTGAAGCA CACGATCAGG AAGAAGGAAC GCTGGATCCG CTGGCGCTCT CTTTACTGAC CGCGCAATTC CCGGCGGATA CCGACAAACA TAAGCGCGAC AGCCTGCGCA AAAAAGCCAA CGGTGAATAT CGCAGCTGGC AGGATCTGCT CTCTACGGCG GTCTTCACTC AAAATGAACC GCCGCGTTTT GTGCTGCTGC TCGGTAACCG TCAGCTATTG TTGTTGGACC GTACTAAGTG GGCGCAAAAC CGTCTGCTAC GTTTTGATTT TGAAGAGATT TTAAGTCGTC GTGAAACGGA TACGCTGAAA GCGACGGCAG TGTTGCTACA TAAAGATTCG CTGCTGCCGG GCAGTGGGGC ACCTTATCTT GACTCGCTGG ATGACAATTC GCACAAACAT GCGTTTGGTG TTTCGGAAGA TCTGAAATAT GCCCTGCGCG AAAGTATTGA GCTGCTGGGC AACGAAGCGA TGCATTATCT GATCGACCGT GGCCTGGCAA ACTATACCGG TAACCGTGCG GTGGACCCGG ATGAACTGAG CCGCGAATGT CTGCGTTACA TGTACCGCCT GCTGTTCCTG TTCTACATTG AAGCGCGCCC GGAGCTGGGT TATGCGCCAA TGACCGCCAA AACCTATCTG CAAGGTTACA GCCTGGAAAC GTTGCGCGAT CTGGAGATGA TCCCGCTGAC CAGCGAAGAA GATCGCAACG GGCGCTACTT CCACGACAGC CTGAATATGC TGTTTAAACT GGTGCGCGAA GGCTACAACG GCGGCGTGAA AATACAGAGT GACCTGGAGA GCGGCGACCG GATCACCATC CATAGTCATC AGTTCAGCGT CCCGCGTCTG GAAAGTCATC TTTTCGATGC CAACAACACC CGCATTCTTA ACCGCGTGGT ATTCCGTAAC GAAACCCTGC AACAGATTAT CCAGGCGATG TCGTTAAGCC GCCCGGGCAA AGGGCGCTTT AACCGCCGCG GACGTATTTC TTATCGCCAG TTGGGTATCA ACCAGTTAGG TGCGGTGTAT GAGGCGCTGC TTTCCTATCG CGGATTCTTC GCCAGCGAAG ATCTCTACGA GGTGAAGAAA GCCGGGGAAG AGTTTAACGA GCTGGAGACG GGTTACTTCG TCAGTAAGGA TGAGATTGGC AAATACCACG AAGACGAGAA GGTCTACGAG AAAGACGGCA GTCTGCGCAT TCACCGCAAA GGCAGTTTTA TCTACCGTAT GGCCGGGCGC GACCGTGAGA AATCTGCCTC TTATTACACC CCGGAAGTGC TGACCCGCTC ACTGGTTAAA TATGCCCTGA AAGAACTGTT TAAGGAGCAG ATTGATCCCA TTAGCGATCC GCACGCCAAA GCTGATGCCA TCTTAAACCT CACCGTGTGC GAACCGGCGA TGGGCAGCGC GGCGTTCCTT AACGAAGCCA TCAACCAGCT GGCGGAAGCG TATCTGTTCC ACAAGCAGCA GGCGGAAGGT CGCCGTATTC CGCAGGATCG TTACACCCAG GAGTTACAGC GGGTGAAAAT GTACATTGCC GACAACAACG TTTTCGGCGT GGACTTAAAC CCGGTGGCGG TGGAACTGGC GGAAGTGTCG CTGTGGTTGA ACGCCATCAG TGGCGATGCC TTTGTACCGT GGTTTGGTTA CCAGCTGCAC TGCGGTAACT CACTGGTGGG CGCGCGCCGT CAGGTGTTCA ACAAGAGTGA ACTGACCTAC AAAAAAGCCA AAGATCCGAG CTGGCTTAAC AGCGAGCCGG TCGAACTGGC GATGAACACG CCGCGTGAAG AGACGCAGAT TTTTCACTTC CTGCTGCCCG ACGGCGGTAT GGCTAACTAC AGCGATAAAA CTGTTAAGCA GCGTTATCCG GATGACTTCA AAGCCCTGGA CAGCTGGCGC AAAGAGTTTA TTAAAAGCTT TGCCGGGCAT GAGATTGCTG ATGTGCAGCG TATCAGCGAA AAGGTGGAAG CTCTGTGGAA CACCTATCGC CAGCAACTTA AAGCAGAACG TCTGAAAACC GCCGACAGCT ATCCGGTGTG GCCGGCAGAA AACAGCGAGC AGACGCGTTC TTCGCTGAGC AGTAAAGATG AAACCTTCAG CGGTCGTCTT GAAGATAACA GCGCCTACCA GAAGCTGCGT TGGGTGATGG ACTACTGGTG CGCGCTGTGG TTCTGGCCGA TCGACAAAGC CGATGAGTTA CCGGATCGCG GTACCTGGTT GTTTGAGATT GAAACCCTGC TCGACGGGAT TGTGATTACG GAAAAAGTCA CTGAAGTTGC GGAGCACACC ACTGGCGATC TGTTTGCCGA AGACGGCCTG CTGCGAGAAG AGTCTTCACT GTTTTCTGTT GCTGGTCGTC TGAAAACCGA GGTGTTGTTC CGCCATTTGC CGCGTCTGGC GATTGTCGAT GCCCTGAGAA AGCAGCACCG TTTCTTCCAC TGGGATCTGG AGTTCTGCGA CCTGTTTGCC GAGCGCGGCG GTTTTGACCT GATGCTCGGA AACCCGCCGT GGCTAAAAGT GGAATGGCAG GAAGCTGGCG TGCTGGGTGA TTACGAGCCG GAATTTGTGC TGCGTAAGCT GAGTGCATCG AAGCTGGCGA TGTTGCGTAT TGATACCTTT AACCAGATCC CGGCGCTGGA AGCGGCCTGG CGCAGCGAGT ATGAAGGCTG TGAAGGGATG CAAAACTTCC TCAATGCGCA GCAGAACTAC CCGGTACTGC GCGGGGTACA GACCAACTTG TATAAATGCT TCCTGCCGCA GGCATGGCGC TTAGGGGCAG AGAAAGGCGT GGCAGGTTTC CTGCACCCGG AAGGGATTTA TGATGACCCG AAAGGCGGGC AATTACGTGC GGCGGTATAT CCGAGGCTGA GGGCGCATTT TCAATTTCAT AATGAATTGA GTCTTTTTGC CGAGGTTCAT CATGCAACGA TGTTCAGTAT CAATGTCTAT GGACCGCAAA ATACAACGCC GTCTTTTATT AACATGTCAA ACGTTTACGC AGTAAGCGCT ATTGATGCCT CGTTTGAACA CAGCAACGCT GGTCCTGTTC CTGGTATCAA AGATGAGCAG GAAATTGAAG GCAAAATCAA AGTCTCCTGG AATACGTCGG GCCACCGTTC ACGTCTAATT CATATTGGTT TGAAGGAGCT CTCTTTATTT GCTCGTCTGT ATGACAGCGA AGGAACGCCA GCCTGGCAGG CACGTTTGCC AGCCTTACAT GCTAAACAAC TTGTTGCTGT ACTGGAAAAG TTTGCTAATC AGCCGAAGAG ATTGGGTGAT TTGCAAGGGC AGTATTTTTC AACGGTTATG TTCGATGAAA CATATGCGCA GAGGGATGGG ACAATCTTGC GGCAGACTCA ATTCCCTCAA GATTCATCAC AATGGGTTTT ATCTGGCCCG CACTTCTTTG TTGGGACACC GTTCTACAAG ACTCCACGCC AAAACTGTAC ATTAAACAGC GATTATGACT GTCTGGACTT GCTAACTCTG CCTGACGACT ATCTGCCGCG CACTAACTAC ATTCCAGCAT GTGATGCACA GGAGTATGCA AAGCGCACCC CACGGGTTTC GTGGAAAGAG CAGGATGAAG ACGAGCCGAG GAAGGTGACG GACTATTATC GTTTTGTTGC TCGCTCAATG CTCAGTCAAT CGGGTGAACG AACCTTGATT CCTGCTATAT TCCCGGCCGG TGTTGCCCAT ATCGATCCAT GTTTCAGCAT CGCATTTAAT GAAGTAGAAA CTTTATTATC TTTTACTGGT CAATCAATGT CGATTTGCCA CGACTTTTTG ATAAAAAGCA CAGGAAACCC TCGTTTCAGG GAAAATTTGG CAAGATATTT ACCTATTGTT GAAAAATATA AAACCAAAAT TCAAATACGA GCACTCTGTT TGGTTTCACT CACTAATAAT TACAAGCAGC TGTGGGATAG TGTTGATTTA CAACTGGCAA TGCACCAACG CTGGAGCCGT AACCTCCCAC AATTACCTCA AGATTTCTTC GCCAAACTGA CCCCAGAGTG GCAGCGTAAC TGCGCTTTAC GCTCCGACTA CAGTCGCCGT CAGGCGCTGG TGGAAATCGA CGTACTGGTG GCGCAGGCGC TAGGGTTAAC CCTCGAAGAG CTGCTTACCA TTTACCGTGT ACAGTTCCCG GTTATGCGCC AGTACGAAGC GGACACCTGG TACGATCAAA ACGGTCGTAT TATCTTTACC CCAAGCAAAG GACTGGTGGG CGTTGGCTTG CCGCGTACCG CGCGTAAAGC TGACCTGAAA AACGGCTTTG TCTTTAACGT CGACAGCCCG GACTGGACCG GCGGTGACTG CACCGATCAA GCCATCGGCT GGGATGATGT CAAACATCTT AAAACCGGTA CCGTCAGCGT CACCTTTGAT GACTATACCC GCAGCGACGA AGGCGAGCGC CGTACCGTCA CCTGGCAGGC TCCGTTTATC AAGCCAGATC GCGAAGATGA CTACAAAGTG GCCTGGGCGT TCTTTGCACA AGATAAGGAG AGCGCCTGA
|
Protein sequence | MALVGINNEN EFYSNHYLGE VFTSDIRDVL EPWIAQENAA REAERAAREQ GKDVEPGYRA PWNQFNSLAT EFFRKLAEHE KQRQIPQRLA DQRNRWQPLL KALGYEITPQ IQMLDDDTPL PVLARYNSTD GSPWLWIVEA HDQEEGTLDP LALSLLTAQF PADTDKHKRD SLRKKANGEY RSWQDLLSTA VFTQNEPPRF VLLLGNRQLL LLDRTKWAQN RLLRFDFEEI LSRRETDTLK ATAVLLHKDS LLPGSGAPYL DSLDDNSHKH AFGVSEDLKY ALRESIELLG NEAMHYLIDR GLANYTGNRA VDPDELSREC LRYMYRLLFL FYIEARPELG YAPMTAKTYL QGYSLETLRD LEMIPLTSEE DRNGRYFHDS LNMLFKLVRE GYNGGVKIQS DLESGDRITI HSHQFSVPRL ESHLFDANNT RILNRVVFRN ETLQQIIQAM SLSRPGKGRF NRRGRISYRQ LGINQLGAVY EALLSYRGFF ASEDLYEVKK AGEEFNELET GYFVSKDEIG KYHEDEKVYE KDGSLRIHRK GSFIYRMAGR DREKSASYYT PEVLTRSLVK YALKELFKEQ IDPISDPHAK ADAILNLTVC EPAMGSAAFL NEAINQLAEA YLFHKQQAEG RRIPQDRYTQ ELQRVKMYIA DNNVFGVDLN PVAVELAEVS LWLNAISGDA FVPWFGYQLH CGNSLVGARR QVFNKSELTY KKAKDPSWLN SEPVELAMNT PREETQIFHF LLPDGGMANY SDKTVKQRYP DDFKALDSWR KEFIKSFAGH EIADVQRISE KVEALWNTYR QQLKAERLKT ADSYPVWPAE NSEQTRSSLS SKDETFSGRL EDNSAYQKLR WVMDYWCALW FWPIDKADEL PDRGTWLFEI ETLLDGIVIT EKVTEVAEHT TGDLFAEDGL LREESSLFSV AGRLKTEVLF RHLPRLAIVD ALRKQHRFFH WDLEFCDLFA ERGGFDLMLG NPPWLKVEWQ EAGVLGDYEP EFVLRKLSAS KLAMLRIDTF NQIPALEAAW RSEYEGCEGM QNFLNAQQNY PVLRGVQTNL YKCFLPQAWR LGAEKGVAGF LHPEGIYDDP KGGQLRAAVY PRLRAHFQFH NELSLFAEVH HATMFSINVY GPQNTTPSFI NMSNVYAVSA IDASFEHSNA GPVPGIKDEQ EIEGKIKVSW NTSGHRSRLI HIGLKELSLF ARLYDSEGTP AWQARLPALH AKQLVAVLEK FANQPKRLGD LQGQYFSTVM FDETYAQRDG TILRQTQFPQ DSSQWVLSGP HFFVGTPFYK TPRQNCTLNS DYDCLDLLTL PDDYLPRTNY IPACDAQEYA KRTPRVSWKE QDEDEPRKVT DYYRFVARSM LSQSGERTLI PAIFPAGVAH IDPCFSIAFN EVETLLSFTG QSMSICHDFL IKSTGNPRFR ENLARYLPIV EKYKTKIQIR ALCLVSLTNN YKQLWDSVDL QLAMHQRWSR NLPQLPQDFF AKLTPEWQRN CALRSDYSRR QALVEIDVLV AQALGLTLEE LLTIYRVQFP VMRQYEADTW YDQNGRIIFT PSKGLVGVGL PRTARKADLK NGFVFNVDSP DWTGGDCTDQ AIGWDDVKHL KTGTVSVTFD DYTRSDEGER RTVTWQAPFI KPDREDDYKV AWAFFAQDKE SA
|
| |