Gene EcHS_A4564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4564 
Symbol 
ID5595312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4570974 
End bp4572254 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content52% 
IMG OID640923660 
Producthypothetical protein 
Protein accessionYP_001461100 
Protein GI157163782 
COG category[S] Function unknown 
COG ID[COG2733] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAC TCATTGAACT CAGACGCGCC AAAATGTTGG CGCTCTCTTT ACTGCTTATC 
GCCGCTGCTA CCTTTGTCGT TACGCTGTTT TTGCCGCCCA ATTTTTGGGT GAGCGGCGTG
AAGGCGATTG CTGAAGCGGC GATGGTCGGC GCGCTGGCGG ACTGGTTTGC GGTGGTGGCG
CTGTTTCGCC GCGTGCCGAT TCCGATCATT TCTCGTCATA CGGCGATTAT CCCGCGTAAT
AAAGACCGGA TTGGCGAAAA TCTCGGCCAG TTCGTGCAGG AAAAATTTCT CGATACCCAG
TCGCTGGTGG CATTGATTCG ACGCCACGAA CCGGTGTTGT TGATTGGCAA CTGGTTTAGT
CAGCCAGAAA ACGCCCGCCG CGTTGGTCAG CATCTGTTGC AGATCATGAG CGGTTTTCTT
GAACTGACCG ATGATGCGCG TATTCAGCGC CTGCTTAAGC GCGCGGTCCA TCGGGCGATT
GATAAGGTCG ATCTTTCCGG CACCAGTGCG TTGATGCTGG AGAGTATGAC CAAAAACGAT
CGTCATCAGG TGCTACTGGA TACGCTGATC GCACAGTTGA TCGCCCTTCT CCAGCGCGAT
AAATCGCGCA AGTTTATTGC CCAGCAAATT GTTCGCTGGC TGGAGAGTGA GCATCCACTG
AAAGCCAAAA TTCTCCCCAC CGAATGGTTG GGCGAACATA GCGCGGAGTT GGTTTCTGAC
GCGGTGAATT CTTTGCTTGA TGATATCAGC CGTGATCGTG CGCATCAGAT CCGTCATGCG
TTTGATCGCG CCACTTTTGC CCTGATCGAC AAGTTGAAAA ACGATCCGGA AATGGCAGCG
CGAGCCGATG CCGTAAAAAG TTATCTGAAA GAAGATGAAG CTTTTAACCG CTATCTCAGT
GAATTGTGGG GGGATTTACG GGAGTGGCTG AAAGCGGATA TCAACAGTGA AGATTCTCGT
GTGAAAGAAC GTATCGCGCG GGCGGGTCAA TGGTTTGGCG AAACGTTAAT TGCCGATGAT
GCCTTGCGGG CGTCGTTAAA TGGTCACCTG GAACAAGCCG CGCACCGCGT CGCGCCTGAG
TTTTCCGCAT TCCTGACGCG CCACATCAGC GATACAGTAA AAAGCTGGGA TGCGCGAGAT
ATGTCGCGGC AAATCGAGTT AAATATCGGC AAAGATCTGC AGTTTATCCG TGTCAACGGT
ACGCTGGTTG GCGGTTGTAT TGGGCTAATT TTATATTTGT TGTCGCAGCT CCCGGCCTTG
TTCCCCCTCA GCAATTTTTA G
 
Protein sequence
MNKLIELRRA KMLALSLLLI AAATFVVTLF LPPNFWVSGV KAIAEAAMVG ALADWFAVVA 
LFRRVPIPII SRHTAIIPRN KDRIGENLGQ FVQEKFLDTQ SLVALIRRHE PVLLIGNWFS
QPENARRVGQ HLLQIMSGFL ELTDDARIQR LLKRAVHRAI DKVDLSGTSA LMLESMTKND
RHQVLLDTLI AQLIALLQRD KSRKFIAQQI VRWLESEHPL KAKILPTEWL GEHSAELVSD
AVNSLLDDIS RDRAHQIRHA FDRATFALID KLKNDPEMAA RADAVKSYLK EDEAFNRYLS
ELWGDLREWL KADINSEDSR VKERIARAGQ WFGETLIADD ALRASLNGHL EQAAHRVAPE
FSAFLTRHIS DTVKSWDARD MSRQIELNIG KDLQFIRVNG TLVGGCIGLI LYLLSQLPAL
FPLSNF