Gene EcolC_2751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2751 
Symbol 
ID6065632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3023589 
End bp3024689 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content55% 
IMG OID641602157 
Productlate control D family protein 
Protein accessionYP_001725706 
Protein GI170020752 
COG category[R] General function prediction only 
COG ID[COG3500] Phage protein D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000516519 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAATTTCA GCTCTGAACT GCTTAACAAA GGCAACAAAA CTCCGGCATT CAGCATCAGT 
ATTGAAGGTA GGGATATCAC CACTGTGCTG GACAACCGCC TGATGGGGCT GACGCTGACG
GATAACCGGG GCTTTGACGC AGACCAGCTT GATCTGGAGC TGGACGACGC CGACGGAAAA
ATCGTGCTGC CGCGCCGTGG TGCGGTCATT ACGCTGGCGC TGGGCTGGAA GGGGCAGCCG
CTTTTCCCGA AAGGGGCATT CACGGTGGAC GAGATTGAAC ACACTGGCGC ACCGGACCGC
CTGACTATCC GGGCGCGAAG TGCTGATTTT CGGGAAACGC TGAATACCCG TCGTGAAAAG
TCGTGGCACA AGACCACCGT CGGGGAAGTG GTGAAGGAAA TAGCCGCGCG GCACAAGCTG
AAGATGGCAC TGGGTAAAGA CCTGTCGGAT AAGCCCGTGG AGCATATAGA CCAGACTAAT
GAGAGTGACG GCAGTTTTCT GATGCGGCTG GCGCGACAGT ACGGTGCCAT CGCGTCGGTG
AAAAATGGCA ATCTGTTATT CATCCGGCAG GGGCAGGGCA AAAGCGCCAC TGGTAAACCA
CTGCCAGTGA TCACTATCAC ACGCAAGGAC GGCGACAGTC ACCGCTTTAC CCTGGCAGAT
CGCGGAGCCT ACACGGGCGT AATTGCCAGC TGGTTGCATA CCCGCGAACC TGCGAAGAAA
GAAAGTACCA CGGTGAAGCG TAAGCGCAGA ACTAAGAAGC AGAAGAATGA GCCGGAAGCG
AAGCAGGGCG ATTACCTGGT GGGTACGGAT GAAAACGTGC TGGTACTTAA TCGCACTTAT
GCCAACCGGA GCAACGCTGA ACGGGCAGCG AAAATGCAGT GGGAACGCCT GCAACGCGGC
GTTGCGTCAT TCTCGCTACA ACTGGCGGAA GGTCGGGCAG ATCTCTACAC GGAAATGCCT
GTGAAGGTCA GCGGCTTTAA ACCGCCGATA GATGATGCGG AATGGACCAT TACGACTCTG
ACACATACCG TCAGCCCGGA TAACGGTTTT ACGACCAGTC TGGAGCTTGA AGTGAGGATT
GATGATTTCG AAATGGAATG A
 
Protein sequence
MNFSSELLNK GNKTPAFSIS IEGRDITTVL DNRLMGLTLT DNRGFDADQL DLELDDADGK 
IVLPRRGAVI TLALGWKGQP LFPKGAFTVD EIEHTGAPDR LTIRARSADF RETLNTRREK
SWHKTTVGEV VKEIAARHKL KMALGKDLSD KPVEHIDQTN ESDGSFLMRL ARQYGAIASV
KNGNLLFIRQ GQGKSATGKP LPVITITRKD GDSHRFTLAD RGAYTGVIAS WLHTREPAKK
ESTTVKRKRR TKKQKNEPEA KQGDYLVGTD ENVLVLNRTY ANRSNAERAA KMQWERLQRG
VASFSLQLAE GRADLYTEMP VKVSGFKPPI DDAEWTITTL THTVSPDNGF TTSLELEVRI
DDFEME