Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2109 |
Symbol | |
ID | 6067203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2301748 |
End bp | 2305227 |
Gene Length | 3480 bp |
Protein Length | 1159 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641601517 |
Product | hypothetical protein |
Protein accession | YP_001725076 |
Protein GI | 170020122 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000420238 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAAAG GCAGCAGTAA GGGGCATACC CCGCGCGAAG CGAAGGACAA CCTGAAGTCC ACGCAGCTGC TGAGTGTGAT CGATGTCATC AGCGAAGGGC CGATTGAAGG TCCGGTGGAT GGATTAAAAA GCGTGCTGCT GAACAGTACG CCGGTGCTGG ACAGTGAGGG GAATACCAAT ATCTCCGGCG TCACGGTGGT GTTCCGGGCA GGTGAGCAGG AGCAGACGCC GCCGGAGGGC TTTGAATCCT CCGGTTCCGA GACGGTGCTG GGTACGGAAG TGAAATACGA CACGCCGATT ACCCGGACCA TCACGTCTGC AAACATCGAC CGTCTGCGCT TTACCTTCGG TGTGCAGGCA CTGGTGGAAA CCACCTCAAA GGGGGACCGG AATCCGTCGG AAGTCCGCCT GCTGGTTCAG ATACAACGTA ACGGTGGCTG GGTGACGGAA AAAGACATCA CCATTAAAGG CAAAACCACC TCACAGTATC TGGCATCGGT GGTGGTGGGT AACCTGCCGC CGCGCCCGTT CAATATCCGG ATGCGCAGGA TGACGCCGGA CAGCACCACA GATCAGCTGC AGAACAAAAC GCTCTGGTCG TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCACT GGTCGGCGTA CAGGTGGATT CGGAGCAGTT CGGCAGCCAG CAGGTGAGCC GTAATTATCA TCTGCGCGGG CGTATTCTGC AGGTGCCGTC GAACTATAAC CCGCAGACGC GGCAATACAG CGGTATCTGG GACGGAACGT TAAAACCGGC ATACAGCAAT AACATGGCCT GGTGTCTGTG GGATATGCTG ACCCATCCAC GCTACGGCAT GGGGAAACGT CTTGGTGCGG CGGATGTGGA TAAATGGGCG CTGTATGTCA TCGGCCAGTA CTGCGACCAG TCAGTGCCGG ACGGCTTTGG CGGCACGGAG CCGCGCATCA CCTGTAATGC GTACCTGACC ACACAACGTA AGGCGTGGGA TGTGCTCAGT GATTTCTGCT CGGTGATGCG CTGTATGCCG GTATGGAACG GGCAGACGCT GACGTTCGTG CAGGACCGGC CGTCGGATAA GGCGTGGACC TATAACCGCA GTAATGTGGT GATGCCGGAT GATGGCGCGC CGTTCCGCTA CAGCTTCAGC GCCCTGAAAG ACCGCCATAA TGCCGTTGAG GTGAACTGGA TTGACCCGAA CAACGGCTGG GAGACGGCGA CAGAGCTTGT TGAAGATACG CAGGCCATTG CCCGTTACGG TCGTAATGTT ACGAAGATGG ATGCCTTTGG CTGTACCAGC CGGGGGCAGG CACACCGCGC CGGGCTGTGG CTGATTAAAA CGGAGCTGCT GGAGACGCAG ACCGTGGATT TCAGCGTCGG CGCAGAAGGG CTTCGCCATG TACCGGGCGA TGTTATTGAA ATCTGCGATG ATGACTATGC CGGTATCAGC ACCGGTGGTC GTGTGCTGGC GGTGAACAGC CAGACCCGGA CGCTGACGCT CGACCGTGAA ATCACGCTGC CATCCTCCGG TACCACGCTG ATAAGCCTGG TTGACGGAAG TGGCAATCCG GTCAGCGTGG AGGTTCAGTC CGTCACCGAC GGCGTGAAGG TAAAAGTGAG CCGTGTTCCT GAGGGTGTTG CTGAATACAG CGTATGGGGG CTGAAGCTGC CGACGCTGCG CCAGCGCCTG TTCCGCTGCG TGAGTATCCG TGAGAACGAC GACGGCACGT ATGCCATCGC CGCCGTGCAG CATGTGCCGG AAAAAGAGGC CATCGTGGAT AACGGGGCGC ACTTTGACGG CGAACAGAGT GGCACGGTGA ATGGTGTCAC GCCGCCAGCG GTGCAGCACC TGACCGCAGA AGTCACTGCA GACAGCGGGG AATATCAGGT GCTGGCGCGA TGGGACACAC CGAAGGTGGT GAAGGGCGTG AGCTTTATGC TTCGCCTGAC CGTGGCAGCG GACGACGGCA GTGAGCGGCT GGTCAGCGCG GCCCGGACGA CAGAAACCAC ATACCGCTTC AGGCAGCTGG CGCTGGGGAA CTACAGGCTG ACAGTCCGGG CGGTAAATGC GTGGGGGCAG CAGGGCGATC CGGCATCGGT ATCGTTCCGG ATTGCCGCAC CTGCAGTGCC GTCGCGGATT GAGCTGACGC CGGGCTATTT TCAGATAACC GCCACGCCGC ATCTTGCCGT TTATGACCCG ACGGTACAGT TTGAGTTCTG GTTCTCGGAA AAGCGGATTA CCGATATCAG GCAGGTTGAA ACCACAGCCC GCTATCTTGG CACGGCGCTG TACTGGATAG CCGCCAGTAT CAATATCAAA CCGGGCCATG ATTATTATTT TTACGTTCGC AGTGTGAACA CCGTTGGCAA ATCGGCATTC GTGGAGGCTG TTGGTCAGCC GAGTGATGAT GCATCCGGCT ATCTGGATTT TTTCAAAGGC GAGATAGGGA AAACCCATCT GGCTCAGGAG CTGTGGACGC AGATTGATAA CGGTCAGCTT GCGCCTGACC TGGCTGAAAT CAGGACGTCC ATTACGGATG TCAGCAATGA AATCACGCAG ACCGTCAATA AGAAACTGGA AGACCAGAGT GCGGCAATTC AGCAGATACA GAAGGTTCAG GTTGATACAA ATAATAACCT GAACAGCATG TGGGCTGTGA AGCTGCAGCA GATGCAGGAC GGACGCCTTT ATATCGCGGG TATTGGTGCC GGTATTGAGA ACACCCCTGA CGGCATGCAG AGTCAGGTGC TGCTGGCGGC GGACAGGATT GCGATGGTTA ATCCTGCGAA TGGCAACACA AAACCGATGT TTGTTGGTCA GGGCGATCAG ATATTCATGA ACGACGTGTT CCTGAAACGC CTGACGGCCC CCACCATTAC CAGCGGTGGA AATCCACCGG CATTTTCCCT GACACCGGAC GGAAAGCTGA CTGCTAAAAA TGCGGATATC AGTGGCAGTG TGAATGCGAA CTCCGGGACG CTCAACAACG TCACGATTAA TGAGAACTGT CAGATTAAGG GGAAACTGTC AGCCAATCAG ATTGAAGGCG ATATTGTCAA AACAGTGGGT AAGGCTTTCC CGCGGGACTC CCGGGCACCG GAGCGGTGGC CATCAGGGAC CATTACCGTC AGGATTTATG ACGATCAGCC TTTTGACAGG CAAATTGTTA TTCCAGCGGT GGCATTCTGC GGCGCTAAAC ATGAGCGGGA GAATAACGAT ATTTATTCGT CATGCCGCCT GATAGTGAAG AAAAATGGTG CTGAAATTTA TAACCGTACC GCGCTGGATA ATACGCTGAT TTACAGTGGT GTTATTGATA TGCCAGCTGG TCACGGCCAC ATGACGCTGG AGTTTTCGGT GTCAGCATGG CTGGTAAATG ACTGGTATCC CACAGCAAGT ATCAGCGATT TGCTGGTTGT GGTGATGAAG AAAGCCACCG CAGGCATCAG TATCAGCTGA
|
Protein sequence | MGKGSSKGHT PREAKDNLKS TQLLSVIDVI SEGPIEGPVD GLKSVLLNST PVLDSEGNTN ISGVTVVFRA GEQEQTPPEG FESSGSETVL GTEVKYDTPI TRTITSANID RLRFTFGVQA LVETTSKGDR NPSEVRLLVQ IQRNGGWVTE KDITIKGKTT SQYLASVVVG NLPPRPFNIR MRRMTPDSTT DQLQNKTLWS SYTEIIDVKQ CYPNTALVGV QVDSEQFGSQ QVSRNYHLRG RILQVPSNYN PQTRQYSGIW DGTLKPAYSN NMAWCLWDML THPRYGMGKR LGAADVDKWA LYVIGQYCDQ SVPDGFGGTE PRITCNAYLT TQRKAWDVLS DFCSVMRCMP VWNGQTLTFV QDRPSDKAWT YNRSNVVMPD DGAPFRYSFS ALKDRHNAVE VNWIDPNNGW ETATELVEDT QAIARYGRNV TKMDAFGCTS RGQAHRAGLW LIKTELLETQ TVDFSVGAEG LRHVPGDVIE ICDDDYAGIS TGGRVLAVNS QTRTLTLDRE ITLPSSGTTL ISLVDGSGNP VSVEVQSVTD GVKVKVSRVP EGVAEYSVWG LKLPTLRQRL FRCVSIREND DGTYAIAAVQ HVPEKEAIVD NGAHFDGEQS GTVNGVTPPA VQHLTAEVTA DSGEYQVLAR WDTPKVVKGV SFMLRLTVAA DDGSERLVSA ARTTETTYRF RQLALGNYRL TVRAVNAWGQ QGDPASVSFR IAAPAVPSRI ELTPGYFQIT ATPHLAVYDP TVQFEFWFSE KRITDIRQVE TTARYLGTAL YWIAASINIK PGHDYYFYVR SVNTVGKSAF VEAVGQPSDD ASGYLDFFKG EIGKTHLAQE LWTQIDNGQL APDLAEIRTS ITDVSNEITQ TVNKKLEDQS AAIQQIQKVQ VDTNNNLNSM WAVKLQQMQD GRLYIAGIGA GIENTPDGMQ SQVLLAADRI AMVNPANGNT KPMFVGQGDQ IFMNDVFLKR LTAPTITSGG NPPAFSLTPD GKLTAKNADI SGSVNANSGT LNNVTINENC QIKGKLSANQ IEGDIVKTVG KAFPRDSRAP ERWPSGTITV RIYDDQPFDR QIVIPAVAFC GAKHERENND IYSSCRLIVK KNGAEIYNRT ALDNTLIYSG VIDMPAGHGH MTLEFSVSAW LVNDWYPTAS ISDLLVVVMK KATAGISIS
|
| |