Gene EcolC_2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2109 
Symbol 
ID6067203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2301748 
End bp2305227 
Gene Length3480 bp 
Protein Length1159 aa 
Translation table11 
GC content56% 
IMG OID641601517 
Producthypothetical protein 
Protein accessionYP_001725076 
Protein GI170020122 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000420238 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAG GCAGCAGTAA GGGGCATACC CCGCGCGAAG CGAAGGACAA CCTGAAGTCC 
ACGCAGCTGC TGAGTGTGAT CGATGTCATC AGCGAAGGGC CGATTGAAGG TCCGGTGGAT
GGATTAAAAA GCGTGCTGCT GAACAGTACG CCGGTGCTGG ACAGTGAGGG GAATACCAAT
ATCTCCGGCG TCACGGTGGT GTTCCGGGCA GGTGAGCAGG AGCAGACGCC GCCGGAGGGC
TTTGAATCCT CCGGTTCCGA GACGGTGCTG GGTACGGAAG TGAAATACGA CACGCCGATT
ACCCGGACCA TCACGTCTGC AAACATCGAC CGTCTGCGCT TTACCTTCGG TGTGCAGGCA
CTGGTGGAAA CCACCTCAAA GGGGGACCGG AATCCGTCGG AAGTCCGCCT GCTGGTTCAG
ATACAACGTA ACGGTGGCTG GGTGACGGAA AAAGACATCA CCATTAAAGG CAAAACCACC
TCACAGTATC TGGCATCGGT GGTGGTGGGT AACCTGCCGC CGCGCCCGTT CAATATCCGG
ATGCGCAGGA TGACGCCGGA CAGCACCACA GATCAGCTGC AGAACAAAAC GCTCTGGTCG
TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCACT GGTCGGCGTA
CAGGTGGATT CGGAGCAGTT CGGCAGCCAG CAGGTGAGCC GTAATTATCA TCTGCGCGGG
CGTATTCTGC AGGTGCCGTC GAACTATAAC CCGCAGACGC GGCAATACAG CGGTATCTGG
GACGGAACGT TAAAACCGGC ATACAGCAAT AACATGGCCT GGTGTCTGTG GGATATGCTG
ACCCATCCAC GCTACGGCAT GGGGAAACGT CTTGGTGCGG CGGATGTGGA TAAATGGGCG
CTGTATGTCA TCGGCCAGTA CTGCGACCAG TCAGTGCCGG ACGGCTTTGG CGGCACGGAG
CCGCGCATCA CCTGTAATGC GTACCTGACC ACACAACGTA AGGCGTGGGA TGTGCTCAGT
GATTTCTGCT CGGTGATGCG CTGTATGCCG GTATGGAACG GGCAGACGCT GACGTTCGTG
CAGGACCGGC CGTCGGATAA GGCGTGGACC TATAACCGCA GTAATGTGGT GATGCCGGAT
GATGGCGCGC CGTTCCGCTA CAGCTTCAGC GCCCTGAAAG ACCGCCATAA TGCCGTTGAG
GTGAACTGGA TTGACCCGAA CAACGGCTGG GAGACGGCGA CAGAGCTTGT TGAAGATACG
CAGGCCATTG CCCGTTACGG TCGTAATGTT ACGAAGATGG ATGCCTTTGG CTGTACCAGC
CGGGGGCAGG CACACCGCGC CGGGCTGTGG CTGATTAAAA CGGAGCTGCT GGAGACGCAG
ACCGTGGATT TCAGCGTCGG CGCAGAAGGG CTTCGCCATG TACCGGGCGA TGTTATTGAA
ATCTGCGATG ATGACTATGC CGGTATCAGC ACCGGTGGTC GTGTGCTGGC GGTGAACAGC
CAGACCCGGA CGCTGACGCT CGACCGTGAA ATCACGCTGC CATCCTCCGG TACCACGCTG
ATAAGCCTGG TTGACGGAAG TGGCAATCCG GTCAGCGTGG AGGTTCAGTC CGTCACCGAC
GGCGTGAAGG TAAAAGTGAG CCGTGTTCCT GAGGGTGTTG CTGAATACAG CGTATGGGGG
CTGAAGCTGC CGACGCTGCG CCAGCGCCTG TTCCGCTGCG TGAGTATCCG TGAGAACGAC
GACGGCACGT ATGCCATCGC CGCCGTGCAG CATGTGCCGG AAAAAGAGGC CATCGTGGAT
AACGGGGCGC ACTTTGACGG CGAACAGAGT GGCACGGTGA ATGGTGTCAC GCCGCCAGCG
GTGCAGCACC TGACCGCAGA AGTCACTGCA GACAGCGGGG AATATCAGGT GCTGGCGCGA
TGGGACACAC CGAAGGTGGT GAAGGGCGTG AGCTTTATGC TTCGCCTGAC CGTGGCAGCG
GACGACGGCA GTGAGCGGCT GGTCAGCGCG GCCCGGACGA CAGAAACCAC ATACCGCTTC
AGGCAGCTGG CGCTGGGGAA CTACAGGCTG ACAGTCCGGG CGGTAAATGC GTGGGGGCAG
CAGGGCGATC CGGCATCGGT ATCGTTCCGG ATTGCCGCAC CTGCAGTGCC GTCGCGGATT
GAGCTGACGC CGGGCTATTT TCAGATAACC GCCACGCCGC ATCTTGCCGT TTATGACCCG
ACGGTACAGT TTGAGTTCTG GTTCTCGGAA AAGCGGATTA CCGATATCAG GCAGGTTGAA
ACCACAGCCC GCTATCTTGG CACGGCGCTG TACTGGATAG CCGCCAGTAT CAATATCAAA
CCGGGCCATG ATTATTATTT TTACGTTCGC AGTGTGAACA CCGTTGGCAA ATCGGCATTC
GTGGAGGCTG TTGGTCAGCC GAGTGATGAT GCATCCGGCT ATCTGGATTT TTTCAAAGGC
GAGATAGGGA AAACCCATCT GGCTCAGGAG CTGTGGACGC AGATTGATAA CGGTCAGCTT
GCGCCTGACC TGGCTGAAAT CAGGACGTCC ATTACGGATG TCAGCAATGA AATCACGCAG
ACCGTCAATA AGAAACTGGA AGACCAGAGT GCGGCAATTC AGCAGATACA GAAGGTTCAG
GTTGATACAA ATAATAACCT GAACAGCATG TGGGCTGTGA AGCTGCAGCA GATGCAGGAC
GGACGCCTTT ATATCGCGGG TATTGGTGCC GGTATTGAGA ACACCCCTGA CGGCATGCAG
AGTCAGGTGC TGCTGGCGGC GGACAGGATT GCGATGGTTA ATCCTGCGAA TGGCAACACA
AAACCGATGT TTGTTGGTCA GGGCGATCAG ATATTCATGA ACGACGTGTT CCTGAAACGC
CTGACGGCCC CCACCATTAC CAGCGGTGGA AATCCACCGG CATTTTCCCT GACACCGGAC
GGAAAGCTGA CTGCTAAAAA TGCGGATATC AGTGGCAGTG TGAATGCGAA CTCCGGGACG
CTCAACAACG TCACGATTAA TGAGAACTGT CAGATTAAGG GGAAACTGTC AGCCAATCAG
ATTGAAGGCG ATATTGTCAA AACAGTGGGT AAGGCTTTCC CGCGGGACTC CCGGGCACCG
GAGCGGTGGC CATCAGGGAC CATTACCGTC AGGATTTATG ACGATCAGCC TTTTGACAGG
CAAATTGTTA TTCCAGCGGT GGCATTCTGC GGCGCTAAAC ATGAGCGGGA GAATAACGAT
ATTTATTCGT CATGCCGCCT GATAGTGAAG AAAAATGGTG CTGAAATTTA TAACCGTACC
GCGCTGGATA ATACGCTGAT TTACAGTGGT GTTATTGATA TGCCAGCTGG TCACGGCCAC
ATGACGCTGG AGTTTTCGGT GTCAGCATGG CTGGTAAATG ACTGGTATCC CACAGCAAGT
ATCAGCGATT TGCTGGTTGT GGTGATGAAG AAAGCCACCG CAGGCATCAG TATCAGCTGA
 
Protein sequence
MGKGSSKGHT PREAKDNLKS TQLLSVIDVI SEGPIEGPVD GLKSVLLNST PVLDSEGNTN 
ISGVTVVFRA GEQEQTPPEG FESSGSETVL GTEVKYDTPI TRTITSANID RLRFTFGVQA
LVETTSKGDR NPSEVRLLVQ IQRNGGWVTE KDITIKGKTT SQYLASVVVG NLPPRPFNIR
MRRMTPDSTT DQLQNKTLWS SYTEIIDVKQ CYPNTALVGV QVDSEQFGSQ QVSRNYHLRG
RILQVPSNYN PQTRQYSGIW DGTLKPAYSN NMAWCLWDML THPRYGMGKR LGAADVDKWA
LYVIGQYCDQ SVPDGFGGTE PRITCNAYLT TQRKAWDVLS DFCSVMRCMP VWNGQTLTFV
QDRPSDKAWT YNRSNVVMPD DGAPFRYSFS ALKDRHNAVE VNWIDPNNGW ETATELVEDT
QAIARYGRNV TKMDAFGCTS RGQAHRAGLW LIKTELLETQ TVDFSVGAEG LRHVPGDVIE
ICDDDYAGIS TGGRVLAVNS QTRTLTLDRE ITLPSSGTTL ISLVDGSGNP VSVEVQSVTD
GVKVKVSRVP EGVAEYSVWG LKLPTLRQRL FRCVSIREND DGTYAIAAVQ HVPEKEAIVD
NGAHFDGEQS GTVNGVTPPA VQHLTAEVTA DSGEYQVLAR WDTPKVVKGV SFMLRLTVAA
DDGSERLVSA ARTTETTYRF RQLALGNYRL TVRAVNAWGQ QGDPASVSFR IAAPAVPSRI
ELTPGYFQIT ATPHLAVYDP TVQFEFWFSE KRITDIRQVE TTARYLGTAL YWIAASINIK
PGHDYYFYVR SVNTVGKSAF VEAVGQPSDD ASGYLDFFKG EIGKTHLAQE LWTQIDNGQL
APDLAEIRTS ITDVSNEITQ TVNKKLEDQS AAIQQIQKVQ VDTNNNLNSM WAVKLQQMQD
GRLYIAGIGA GIENTPDGMQ SQVLLAADRI AMVNPANGNT KPMFVGQGDQ IFMNDVFLKR
LTAPTITSGG NPPAFSLTPD GKLTAKNADI SGSVNANSGT LNNVTINENC QIKGKLSANQ
IEGDIVKTVG KAFPRDSRAP ERWPSGTITV RIYDDQPFDR QIVIPAVAFC GAKHERENND
IYSSCRLIVK KNGAEIYNRT ALDNTLIYSG VIDMPAGHGH MTLEFSVSAW LVNDWYPTAS
ISDLLVVVMK KATAGISIS