Gene EcolC_2504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2504 
Symbol 
ID6066341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2754773 
End bp2755795 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content51% 
IMG OID641601910 
Producthypothetical protein 
Protein accessionYP_001725462 
Protein GI170020508 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000670285 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000117832 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAAAAAG TGTTATTGAT AATCTTGTTA TTGCTGGTGG TACTGGGTAT CGCCGCTGGT 
GTGGGCGTCT GGAAGGTTCG CCATCTTGCC GACAGCAAAT TGCTTATCAA AGAAGAGACG
ATATTTACCC TGAAGCCAGG GACCGGACGT CTGGCGCTCG GTGAACAGCT TTATGCTGAT
AAGATCATCA ATCGCCCACG GGTTTTTCAA TGGCTGCTGC GTATCGAACC GGATCTTTCT
CACTTTAAAG CCGGGACTTA CCGCTTTACA CCGCAGATGA CCGTGCGCGA GATGCTGAAA
TTGTTGGAAA GCGGTAAAGA AGCACAGTTC CCGCTGCGAC TGGTAGAAGG GATGCGTCTG
AGCGACTACC TCAAGCAATT GCGTGAGGCT CCGTATATCA AGCACACGCT GAGTGACGAT
AAGTACGCCA CCGTAGCGCA GGCACTTGAA CTGGAAAACC CGGAGTGGAT TGAAGGTTGG
TTCTGGCCAG ACACCTGGAT GTATACCGCC AATACCACCG ATGTCGCGTT ACTCAAGCGA
GCGCACAAGA AAATGGTGAA AGCGGTCGAT AGCGCCTGGG AAGGGCGTGC GGACGGTCTG
CCTTATAAAG ATAAAAATCA GCTGGTGACG ATGGCATCAA TTATCGAAAA AGAAACCGCC
GTTGCCAGTG AACGCGATCA GGTTGCCTCG GTATTTATCA ACCGTTTACG CATTGGTATG
CGCTTGCAGA CCGACCCAAC CGTGATTTAC GGGATGGGAG AGCGTTATAA TGGCAAACTT
TCTCGTGCAG ACCTGGAAAC GCCGACAGCG TATAACACCT ATACCATTAC CGGTTTGCCG
CCGGGTGCGA TAGCTACGCC GGGGGCGGAT TCGCTGAAGG CTGCTGCGCA TCCGGCAAAA
ACGCCGTATC TCTATTTTGT GGCCGATGGT AAAGGTGGTC ACACGTTTAA TACCAATCTT
GCCAGTCATA ACAAGTCTGT GCAGGATTAT CTGAAAGTGC TTAAGGAAAA AAATGCGCAG
TAA
 
Protein sequence
MKKVLLIILL LLVVLGIAAG VGVWKVRHLA DSKLLIKEET IFTLKPGTGR LALGEQLYAD 
KIINRPRVFQ WLLRIEPDLS HFKAGTYRFT PQMTVREMLK LLESGKEAQF PLRLVEGMRL
SDYLKQLREA PYIKHTLSDD KYATVAQALE LENPEWIEGW FWPDTWMYTA NTTDVALLKR
AHKKMVKAVD SAWEGRADGL PYKDKNQLVT MASIIEKETA VASERDQVAS VFINRLRIGM
RLQTDPTVIY GMGERYNGKL SRADLETPTA YNTYTITGLP PGAIATPGAD SLKAAAHPAK
TPYLYFVADG KGGHTFNTNL ASHNKSVQDY LKVLKEKNAQ