Gene EcHS_A4528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4528 
Symbol 
ID5592200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4534995 
End bp4536170 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content37% 
IMG OID640923624 
Producthypothetical protein 
Protein accessionYP_001461064 
Protein GI157163746 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value0.179172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACAA ACCCAGAGAA GTGTATTTTT TGTGATATTC CCTTTGATAA AGGTAGCCTA 
GAGCATGTAT TCCCTAGTGC GTTAGGTGGA AGAATTACCA CAACACATGC GACATGCAAA
AGTTGCAACA ATTTATTTTC AGAAGCGAGT TCAGATGCGG TTGAGATCGC TCTGGCTGAT
AACTTTATTT ATATCAGGAA TGCTTTGAAT GTTTGGTCTG GAAGAGGAAA CCCACCACCA
ACAATAAAGG AAGCAGGTCA GTTTGATGAT GGCATCAAGT ATGATCTCGC ACCAAACTTG
ACACCTATTG TGTCTAAGTC AAGAATACCC TCAAAAGATG AAACTGATAG CAATACAGTC
TTTGATTTCG TTGCTCAAGA TGTAGGTGAT GCTAATAGGA TTGTAGGAAT TCTTAAGAAG
CGCGGCCTTA ATATAGGGGA TATTAATGCA AAATATGTTA CAACTAAGGC TCCTGTTATA
AGAGCTAGCA TTAAGTTTGA AGGAAATAAG ATTTTTCGTG CTATAGCGAA AATTGCTGTA
GTCTCCTATG TTGTTTTGTA TGGCAATTCG CGAGCAAGAA CCGATATCTA TCAGAGCCTT
AGGGGATCCA TACGAAGTGG AGAACCTGAC ATTACAAAAT ATTGCGGATG GGACTATACA
AATGATTTTC CTGTCATTAC AAATTTACAC CCACACGAAA AAACCCCAGA CGCCATTCAA
TGTGGTTTTG AACACACTGT ATTTATAACT AATGTAAATC ACCAATTGGT TGCTTATATA
AAGCTTTTTG GTGCATTTAA TTTCTCTATT ATTTTAGGTA ATCATTCGAG TATATCACCT
AAATGCTTGT GCTTAAATCC TACAGCAGGA AAATCCTCAA GGTTTAACGT TTTATTTAAC
CCGCCATTAA GTTACATACC TAAAAATATT GACTCATTCA AAATTGAACA TGAATCCGTT
AGGAAACATG TTCAGTTAGC AATGAGTTCT ATAGTAGAGC ACTGTCAAAG TTTATCGACC
GAAGAATATA TTAGAAGCCT CAGCCAAGAG TTAATGATTT CTGTTCAAAC TGCATCCGTT
GACTCTGACA TATCTGAAAT AATCAGATCA TTTTCAGAGA AGCTTGCTCA TATAGAAAAT
GGATTGGCGT GGGAAGAAGA AATTAATATT GAATAA
 
Protein sequence
MQTNPEKCIF CDIPFDKGSL EHVFPSALGG RITTTHATCK SCNNLFSEAS SDAVEIALAD 
NFIYIRNALN VWSGRGNPPP TIKEAGQFDD GIKYDLAPNL TPIVSKSRIP SKDETDSNTV
FDFVAQDVGD ANRIVGILKK RGLNIGDINA KYVTTKAPVI RASIKFEGNK IFRAIAKIAV
VSYVVLYGNS RARTDIYQSL RGSIRSGEPD ITKYCGWDYT NDFPVITNLH PHEKTPDAIQ
CGFEHTVFIT NVNHQLVAYI KLFGAFNFSI ILGNHSSISP KCLCLNPTAG KSSRFNVLFN
PPLSYIPKNI DSFKIEHESV RKHVQLAMSS IVEHCQSLST EEYIRSLSQE LMISVQTASV
DSDISEIIRS FSEKLAHIEN GLAWEEEINI E