Gene EcolC_2104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2104 
Symbol 
ID6067235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2296784 
End bp2299345 
Gene Length2562 bp 
Protein Length853 aa 
Translation table11 
GC content59% 
IMG OID641601512 
Productlambda family phage tail tape measure protein 
Protein accessionYP_001725071 
Protein GI170020117 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000689811 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAC CGGTAGGCGA TCTGGTCGTT GATTTGAGTC TGGATGCGGC CAGATTTGAC 
GAGCAGATGG CCAGAGTCAG GCGTCATTTT TCCGGTACGG AAAGTGATGC GAAAAAAACA
GCGGCAGTCG TTGAACAGTC GATGAACCGG CAGGCGCTGG CTGCACAGAA AGCGGGAATT
TCCGTCGGGC AGTATAAAGC CGCCATGCGT ATGCTGCCTG CGCAGTTCAC TGACGTGGCC
ACGCAGCTTG CAGGCGGGCA AAGTCCGTGG CTGATCCTGC TGCAACAGGG GGGTCAGGTG
AAGGACTCCT TCGGCGGGAT GATCCCCATG TTCCGGGGGC TTGCCGGTGC GATCACCCTG
CCGATGGTGG GGGCTACCTC GCTGGCGGTG GCGACCGGAG CGCTGGCGTA TGCCTGGTAT
CAGGGCAACT CAACCCTGTC CGATTTCAAC AAAACGCTGG TCCTTTCCGG CAATCAGGCG
GGACTGACGG CAGATCGTAT GCTGGTCCTG TCCAGAGCCG GGCAGGCGGC AGGGCTGACG
TTTAACCAGA CCAGCGAGTC ACTCAGCGCA CTGGTTAAGG CGGGAGTAAG CGGTGAGGCT
CAGATTGCGT CCATCAGCCA GAGTGTGGCG CGTTTCTCCT CTGCATCCGG CGTGGAGGTG
GACAAGGTCG CTGAAGCCTT CGGGAAGCTG ACCACAGACC CGACGTCGGG GCTGACGGCG
ATGGCACGCC AGTTCCATAA CGTGACGGCG GAGCAGATTG CGTATGTTGC TCAGTTGCAG
CGTTCCGGCG ATGAAGCCGG GGCATTGCAG GCGGCGAACG AGGCCGCGAC GAAAGGGTTT
GATGACCAGA CCCGCCGCCT GAAAGAGAAC ATGGGCACGC TGGAAACCTG GGCAGACAGG
ACAGCACGGG CATTCAAATC CATGTGGGAT GCGGTGCTGG ATATTGGTCG TCCTGATACC
GCGCAGGAGA TGCTGATTAA GGCAGAGGCC GCGTTTAAGA AAGCGGACGA TATCTGGAAT
CTGCGCAAGG ATGATTATTT TGTTAACGAT GAAGCGCGGG CGCGTTACTG GGATGATCGT
GAAAAGGCCC GTCTTGCGCT TGAAGCCGCC CGAAAGAAGG CTGAGCAGCA GAGTCAACAG
GACAAAAATG CGCAGCAGCA GAGCGATACC GAAGCGTCAC GGCTGAAATA TACCGAAGAG
GCGCAGAAGG CTTACGAACG CCTGCAGACG CCGCTGGAGA AATATACCGC CCGTCAGGAA
GAACTGAACA AGGCACTGAA AGACGGGAAA ATCCTGCAGG CAGATTACAA CACGCTGATG
GCGGCGGCGA AAAAGGATTA TGAAGCGACG CTGAAAAAGC CGAAACAGTC CGGCGTGAAA
GTGTCTGCGG GCGATCGTCA GGAAGACAGT GCTCATGCTG CCCTGCTGAC GCTTCAGGCT
GAACTCCGGA CGCTGGAGAA GCATGCCGGA GCAAATGAGA AAATCAGCCA GCAGCGCCGG
GATTTGTGGA AGGCGGAGAG TCAGTTCGCG GTACTGGAGG AGGCGGCGCA ACGTCGCCAG
CTGTCTGCAC AGGAGAAATC CCTGCTGGCG CATAAAGATG AGACGCTGGA GTACAAACGC
CAGCTGGCTG CACTTGGCGA CAAGGTTACG TATCAGGAGC GCCTGAACGC GCTGGCGCAG
CAGGCGGATA AATTCGCACA GCAGCAACGG GCAAAACGGG CCGCCATTGA TGCGAAAAAC
CGGGGGCTGA CTGACCGGCA GGCAGAACGG GAAGCCACGG AACAGCGCCT GAAGGAACAG
TATGGCGATA ATCCGCTGGC GCTGAATAAC GTCATGTCAG AGCAGAAAAA GACCTGGGCG
GCTGAAGACC AGCTTCGCGG GAGCTGGATG GCAGGCCTGA CGTCCGGCTG GAGTGAGTGG
GAAGAGAGCG CCACGGACAG TATGTCGCAG GTAAAAAGTG CAGCCACGCA GACCTTTGAT
GGTATTGCAC AGAATATGGC GGCGATGCTG ACCGGCAGTG AGCAGAACTG GCGCAGCTTC
ACCCGTTCCG TGCTGTCCAT GATGACAGAA ATTCTGCTTA AGCAGGCAAT GGTGGGGATT
GTCGGGAGTA TCGGCAGCGC CATTGGCGGG GCTGTTGGTG GCGGCGCATC CGCGTCAGGC
GGTACAGCCA TTCAGGCCGC TGCGGCGAAA TTCCATTTTG CAACCGGAGG ATTTACGGGA
ACCGGCGGCA AATATGAGCC AGCGGGGATT GTTCACCGTG GTGAATTTGT CTTCACAAAG
GAGGCAACCA GCCGGATTGG CGTGGGGAAT CTCTACCGGC TGATGCGCGG CTATGCCACC
GGTGGTTATG TCGGTACACC GGGCAGTCTG GCTGACAGCC GGTCGCAGGC GTCCGGGACG
TTTGAGCAGA ATAACCATGT GGTGATTAAC AACGACGGCA CGAACGGGCA GATAGGTCCG
GCTGCTCTGA AGGCGGTGTA TGACATGGCC CGCAAGGGTG CCCGTGATGA AATTCAGACA
CAGATGCGTG ATGGTGGACT GTTCTCCGGA GGTGGACGAT GA
 
Protein sequence
MAEPVGDLVV DLSLDAARFD EQMARVRRHF SGTESDAKKT AAVVEQSMNR QALAAQKAGI 
SVGQYKAAMR MLPAQFTDVA TQLAGGQSPW LILLQQGGQV KDSFGGMIPM FRGLAGAITL
PMVGATSLAV ATGALAYAWY QGNSTLSDFN KTLVLSGNQA GLTADRMLVL SRAGQAAGLT
FNQTSESLSA LVKAGVSGEA QIASISQSVA RFSSASGVEV DKVAEAFGKL TTDPTSGLTA
MARQFHNVTA EQIAYVAQLQ RSGDEAGALQ AANEAATKGF DDQTRRLKEN MGTLETWADR
TARAFKSMWD AVLDIGRPDT AQEMLIKAEA AFKKADDIWN LRKDDYFVND EARARYWDDR
EKARLALEAA RKKAEQQSQQ DKNAQQQSDT EASRLKYTEE AQKAYERLQT PLEKYTARQE
ELNKALKDGK ILQADYNTLM AAAKKDYEAT LKKPKQSGVK VSAGDRQEDS AHAALLTLQA
ELRTLEKHAG ANEKISQQRR DLWKAESQFA VLEEAAQRRQ LSAQEKSLLA HKDETLEYKR
QLAALGDKVT YQERLNALAQ QADKFAQQQR AKRAAIDAKN RGLTDRQAER EATEQRLKEQ
YGDNPLALNN VMSEQKKTWA AEDQLRGSWM AGLTSGWSEW EESATDSMSQ VKSAATQTFD
GIAQNMAAML TGSEQNWRSF TRSVLSMMTE ILLKQAMVGI VGSIGSAIGG AVGGGASASG
GTAIQAAAAK FHFATGGFTG TGGKYEPAGI VHRGEFVFTK EATSRIGVGN LYRLMRGYAT
GGYVGTPGSL ADSRSQASGT FEQNNHVVIN NDGTNGQIGP AALKAVYDMA RKGARDEIQT
QMRDGGLFSG GGR