Gene EcolC_2753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2753 
Symbol 
ID6066221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3025168 
End bp3028245 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content56% 
IMG OID641602159 
ProductTP901 family phage tail tape measure protein 
Protein accessionYP_001725708 
Protein GI170020754 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000729903 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGATA ATAACCTGCG CCTGCAGGTC ATTCTTAATG CGGTTGACAA ACTCACCCGC 
CCATTCCGTG CTGCACAGGC CAGTTCGAAA GAGCTGGCTG GCGCAATCAG AAACTCCCGT
GACGCATTAA AGCAACTCAA TCAGGCGGGT AACAGCCTGG AAAAATTTCG CAAGCTGCAG
GCCGATAACA AAAAGTTAGG CGACAGGCTG AACTATGCCA GACAGAAGGC AAATTTGCTT
AGTTCTGAGC TGGAAGCGAT GGAACAACCA TCACAAAGGC ATCTTGTGGC TTTAGGTCGG
CAAACGCTGG CAGTCCAACG CCTGGAAGAA CAACAAAAAT ATTTGCAGAA GCAAACGGCG
CTTGTGCGTG CAGAACTGTA CCGGGCGGGA ATTTCTGCGA AAGATGATGC GGGAGCAACT
GCCCGTTTAG CCCGTGAAAC ATCACGTTAT AACCAGGAAC TTTCGAAACA AGAGGCGCGG
CTGAAGCGAC TGGGGGAAGC TCAGCGCAGG ATGAATGCGG CGCGTGCCAG TTATGCCCGT
TCGCTGGAGG TGCGTGATCG TATTGCAGGT GCCGGAGCCA CCACCACGGC TGCAGGGCTG
GCAATGGGCG CACCAGTGAT GGCGGCAGTA AAAAGCTATA CCAGCATGGA AGATGCCATG
AAAGGTGTGG CAAAGCAGGT CAATGGTCTG CGTGACGATA ATGGCAACCG CACCGCGCGT
TTTTACGAAA TGCAGGATGT CATCAAGGCT GCCAGCGAAC AGTTGCCGAT GGAAAACGGT
GCTGTGGACT TTGCCGCACT GGTTGAAGGT GGTGCGCGCA TGAACGTCGC AAATCCTGAC
GACAGCTGGG AAGACCAGAA ACGTGACCTG CTGGCCTTCG CCAGCACGGC AGCAAAGGCG
GCAACAGCCT TTGAGCTGCC AGCGGATGAA CTGTCAGAAA GTCTGGGGAA AATCGCCCAG
CTCTACAAAA TCCCTACCCG CAATATTGAA CAGCTCGGTG ATGCGCTGAA CTATCTGGAT
GATAACGCCA TGTCGAAAGG GGCAGACATC ATTGATGTCA TGCAACGCCT GGGCGGTGTG
GCTGATCGTC TGGATTATCG TAAAGCGGCG GCGCTGGGTT CCACCTTTCT GACACTGGGC
GCTGCGCCAG AGGTTGCAGC CAGTGCAGCA AACGCGATGG TGCGTGAATT GTCCATTGCC
ACCATGCAAA GCAAGAGTTT CTTTGAAGGG ATGAATCTGC TGAAACTCAA TCCTGAAGTG
ATTGAAAAGC AGATGACGAA GGATGCGATG GGAACTATCC AGCGTGTGCT GGAGAAGGTG
AACGCACTGC CGCAGGACAA GCGTCTGTCT GCCATGACCA TGTTGTTTGG TAAAGAGTTT
GGCGATGACG CGGCGAAACT GGCAAACAAC CTGCCGGAAC TACAGCGCCA GCTAAAACTG
ACAGCGGGCA ATGATGCGCT CGGCTCCATG CAGAAAGAAT CCGACATCAA CAAGGACTCA
CTTTCTGCTC AGTGGTTGCT GGTTAAAACC GGAGCGCAGA ATACCTTCAG CAGCCTGGGC
GAAACGCTGC GCCAGCCGCT GATGGATATT CTGTACACGG TGAAAAGCGT TACGGGAGCG
TTGCGTCGCT GGGTCGAAGC TAACCCGGAA CTGACGGGCA CACTGATGAA AGTAGCCGCT
ATTGTGGCTG CGGTTACCGT AGGCCTCGGC ACCTTGGCTG TGGCGCTGGC TGCAGTGCTG
GGGCCGCTGG CAGTCATCCG TCTGGGATTC TCTGTGCTGG GCATAAAAAC GTTACCTTCC
GTTACGGCAG CAGTAACACG AACCAGCAGC GCGTTGTCCT GGCTGGCTGG CGCTCCACTG
GCAGTGCTGC GACGCGGGCT TGCTTCATCG GGTAACGCAG CGGGTTTACT TACTGCGCCG
TTGTCGTCTT TGCGCCGTAC GGCATCACTG ACGGGAAATG TCCTGAAAAC TGTAGCAGGT
GCGCCGGTTG CACTATTGCG GTCTGGATTA TCCGCTTTAC GTGCTGTTGC TGTGATGTTT
ATGAATCCTC TGGCGGTACT GCGCGGTGGA CTGGCTGCCG CAGGCGCGGT GCTTCGTGTG
CTTGCATCTG GTCCGCTGGC GATGCTGCGC GTTGCCCTGT ATGCCATATC TGGTCTGTTA
GGTGCTCTGC TCAGTCCGAT AGGTCTTGTG GTTACTGCAC TGGCGGGTGT GGCGCTGGTT
GTCTGGAAAT ACTGGCAACC CATCACCGCA TTTCTCGGTG GCGTGGTGGA AGGATTCAAA
GCGGCGGCAG GTCCCATCAG TGCTGCATTC GAACCACTTA AGCCTGTGTT TCAGTGGATT
GGCGACAAAG TACAGGCGCT GTGGGGCTGG TTTACTGATC TGCTGACGCC CGTTAAGTCG
ACCTCTGCCG AACTGCAGAG CGCAGCGGCA ATGGGGCGAC GATTCGGGGA GGCACTGGCG
GAAGGGCTGA ATATGGTCAT GCATCCGCTG GACTCCCTGA AATCCGGTGT TTCCTGGTTG
CTGGAGAAGC TCGGCATTGT CAGTAAAGAG GCTGCAAAGG CGAAACTGCC GGAAAGCGTG
ACGCGTCAGC AACCTGCGAC GGTGAATGCA GACGGTAAAG TGATGATGCC ATCGGGTGGT
TTTCCGTCAT GGGGATATGG CTTTGCGGGG ATGTATGACA GCGGCGGCTA TATCCCGCGC
GGGCAGTTTG GCATCGTCGG TGAAAACGGG CCGGAAATTG TCAACGGCCC GGCAAACGTG
ACCAGCCGGA GAAATACCGC TGCACTGGCT GCGGTTGTTG CCGGAATGAT GGGCGTTGCT
GCCGCGCCTA CAGAGCTTCC ACCGTTACAT CCTTTGGCAC TTCCCGCGAA AGGCGGCGAA
GCGATGGTGA GTCGTGCAGC CACTGTGCCG CCCGTTCAAC GGATTGAGGC ACCGACGCAG
ATCATCATTC AGACGCAGCC AGGACAAAGT GCGCAGGATA TTGCGCGGGA GGTGGCCCGC
CAGCTTGATG AACGTGAACG CAGGCTGAAG GCAAAAGCCA GGAGTAACTA CAGCGATCAG
GGGGGATACG ACGCATGA
 
Protein sequence
MSDNNLRLQV ILNAVDKLTR PFRAAQASSK ELAGAIRNSR DALKQLNQAG NSLEKFRKLQ 
ADNKKLGDRL NYARQKANLL SSELEAMEQP SQRHLVALGR QTLAVQRLEE QQKYLQKQTA
LVRAELYRAG ISAKDDAGAT ARLARETSRY NQELSKQEAR LKRLGEAQRR MNAARASYAR
SLEVRDRIAG AGATTTAAGL AMGAPVMAAV KSYTSMEDAM KGVAKQVNGL RDDNGNRTAR
FYEMQDVIKA ASEQLPMENG AVDFAALVEG GARMNVANPD DSWEDQKRDL LAFASTAAKA
ATAFELPADE LSESLGKIAQ LYKIPTRNIE QLGDALNYLD DNAMSKGADI IDVMQRLGGV
ADRLDYRKAA ALGSTFLTLG AAPEVAASAA NAMVRELSIA TMQSKSFFEG MNLLKLNPEV
IEKQMTKDAM GTIQRVLEKV NALPQDKRLS AMTMLFGKEF GDDAAKLANN LPELQRQLKL
TAGNDALGSM QKESDINKDS LSAQWLLVKT GAQNTFSSLG ETLRQPLMDI LYTVKSVTGA
LRRWVEANPE LTGTLMKVAA IVAAVTVGLG TLAVALAAVL GPLAVIRLGF SVLGIKTLPS
VTAAVTRTSS ALSWLAGAPL AVLRRGLASS GNAAGLLTAP LSSLRRTASL TGNVLKTVAG
APVALLRSGL SALRAVAVMF MNPLAVLRGG LAAAGAVLRV LASGPLAMLR VALYAISGLL
GALLSPIGLV VTALAGVALV VWKYWQPITA FLGGVVEGFK AAAGPISAAF EPLKPVFQWI
GDKVQALWGW FTDLLTPVKS TSAELQSAAA MGRRFGEALA EGLNMVMHPL DSLKSGVSWL
LEKLGIVSKE AAKAKLPESV TRQQPATVNA DGKVMMPSGG FPSWGYGFAG MYDSGGYIPR
GQFGIVGENG PEIVNGPANV TSRRNTAALA AVVAGMMGVA AAPTELPPLH PLALPAKGGE
AMVSRAATVP PVQRIEAPTQ IIIQTQPGQS AQDIAREVAR QLDERERRLK AKARSNYSDQ
GGYDA