Gene EcHS_A1219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1219 
Symbol 
ID5593685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1217792 
End bp1218814 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content51% 
IMG OID640920379 
Producthypothetical protein 
Protein accessionYP_001457941 
Protein GI157160623 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000544785 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TGTTATTGAT AATCTTGTTA TTGCTGGTGG TACTGGGTAT CGCCGCTGGT 
GTGGGCGTCT GGAAGGTTCG CCATCTTGCC GACAGCAAAT TGCTTATCAA AGAAGAGACG
ATATTTACCC TGAAGCCAGG GACCGGACGT CTGGCGCTCG GTGAACAGCT TTATGCTGAT
AAGATCATCA ATCGCCCACG GGTTTTTCAA TGGCTGCTGC GTATCGAACC GGATCTTTCT
CACTTTAAAG CCGGGACTTA CCGCTTTACA CCGCAGATGA CCGTGCGCGA GATGCTGAAA
TTGTTGGAAA GCGGTAAAGA AGCACAGTTC CCGCTGCGAC TGGTAGAAGG GATGCGTCTG
AGCGACTACC TCAAGCAATT GCGTGAGGCT CCGTATATCA AGCACACGCT GAGTGACGAT
AAGTACGCCA CCGTAGCGCA GGCACTTGAA CTGGAAAACC CGGAGTGGAT TGAAGGTTGG
TTCTGGCCAG ACACCTGGAT GTATACCGCC AATACCACCG ATGTCGCGTT ACTCAAGCGA
GCGCACAAGA AAATGGTGAA AGCGGTCGAT AGCGCCTGGG AAGGGCGTGC GGACGGTCTG
CCTTATAAAG ATAAAAATCA GCTGGTGACG ATGGCATCAA TTATCGAAAA AGAAACCGCC
GTTGCCAGTG AACGCGATCA GGTTGCCTCG GTATTTATCA ACCGTTTACG CATTGGTATG
CGCTTGCAGA CCGACCCAAC CGTGATTTAC GGGATGGGAG AGCGTTATAA TGGCAAACTT
TCTCGTGCAG ACCTGGAAAC GCCGACAGCG TATAACACCT ATACCATTAC CGGTTTGCCG
CCGGGTGCGA TAGCTACGCC GGGGGCGGAT TCGCTGAAGG CTGCTGCGCA TCCGGCAAAA
ACGCCGTATC TCTATTTTGT GGCCGATGGT AAAGGTGGTC ACACGTTTAA TACCAATCTT
GCCAGTCATA ACAAGTCTGT GCAGGATTAT CTGAAAGTGC TTAAGGAAAA AAATGCGCAG
TAA
 
Protein sequence
MKKVLLIILL LLVVLGIAAG VGVWKVRHLA DSKLLIKEET IFTLKPGTGR LALGEQLYAD 
KIINRPRVFQ WLLRIEPDLS HFKAGTYRFT PQMTVREMLK LLESGKEAQF PLRLVEGMRL
SDYLKQLREA PYIKHTLSDD KYATVAQALE LENPEWIEGW FWPDTWMYTA NTTDVALLKR
AHKKMVKAVD SAWEGRADGL PYKDKNQLVT MASIIEKETA VASERDQVAS VFINRLRIGM
RLQTDPTVIY GMGERYNGKL SRADLETPTA YNTYTITGLP PGAIATPGAD SLKAAAHPAK
TPYLYFVADG KGGHTFNTNL ASHNKSVQDY LKVLKEKNAQ