Gene EcHS_A1689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1689 
Symbol 
ID5592050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1711986 
End bp1713494 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content49% 
IMG OID640920837 
Producthypothetical protein 
Protein accessionYP_001458393 
Protein GI157161075 
COG category[S] Function unknown 
COG ID[COG5339] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAT CGCTGGTAGC GGTAGGCGTC ATTGTTGCGC TAGGCGTAGT CTGGACAGGC 
GGCGCATGGT ATACAGGCAA GAAGATTGAA ACCCATCTCG AAGACATGGT CGCGCAGGCG
AACGCGCAAC TCAAACTGAC AGCTCCTGAA TCCAACCTGG AAGTGAGTTA TCAAAACTAT
CATCGCGGCG TATTCAGCAG CCAGTTGCAA CTGTTGGTGA AACCCATTGC CGGGAAAGAA
AATCCGTGGA TTAAAAGCGG TCAGAGCGTC ATCTTCAACG AATCGGTTGA TCATGGTCCC
TTCCCGCTTG CCCAGCTTAA AAAACTGAAC CTGATCCCGT CGATGGCATC AATTCAAACC
ACGCTGGTTA ATAACGAAGT AAGCAAACCA CTGTTTGATA TGGCAAAAGG TGAAACGCCT
TTTGAGATTA ACTCGCGCAT TGGTTACAGC GGTGATTCCA GTTCCGATAT TTCGCTCAAG
CCACTGAATT ACGAGCAAAA GGATGAAAAA GTCGCCTTTA GCGGCGGCGA GTTCCAGTTA
AATGCTGACA GAGACGGCAA AGCCATCTCC CTTTCCGGGG AGGCGCAAAG TGGTCGGATA
GACGCAGTTA ACGAATACAA CCAGAAAGTG CAGTTGACCT TTAATAATCT GAAAACCGAC
GGTTCCAGCA CGCTGGCAAG TTTTGGTGAG CGCGTAGGAA ACCAAAAACT GTCACTGGAA
AAAATGACCA TTTCAGTGGA AGGCAAAGAA CTGGCACTGC TGGAAGGCAT GGAGATCAGC
GGTAAATCGG ATCTGGTCAA TGACGGTAAA ACGATCAATA GCCAACTGGA TTACTCGCTA
AACAGCCTGA AGGTACAGAA TCAGGATCTG GGCAGCGGCA AGCTGACTTT AAAAGTCGGC
CAGATTGATG GTGAAGCCTG GCATCAGTTT AGCCAGCAAT ATAACGCGCA AACTCAGGCG
CTGCTGGCAC AGCCAGAAAT TGCCAACAAT CCCGAACTTT ATCAGGAGAA AGTGACGGAA
GCCTTCTTTA GCGCCCTGCC GCTGATGTTG AAAGGCGATC CGGTGATTAC TATCGCGCCG
CTAAGCTGGA AAAACAGTCA GGGTGAAAGT GCGCTGAATC TGTCGCTGTT CCTGAAAGAT
CCGGCAACGA CTAAAGAAGC GCCGCAAACG CTGGCGCAGG AAGTAGATCG TTCGGTTAAA
TCTCTGGATG CGAAACTGAC CATTCCGGTG GATATGGCAA CTGAGTTTAT GACTCAGGTA
GCGAAGCTGG AAGGTTATCA GGAAGATCAA GCGAAAAAAC TGGCGAAACA GCAAGTTGAA
GGTGCATCAG CAATGGGGCA GATGTTCCGT CTGACCACCT TGCAGGACAA TACCATCACC
ACCAGCCTGC AATATACTAA CGGTCAGATA ACGTTAAACG GGCAGAAAAT GCCACTGGAA
GATTTCGTTG GTATGTTTGC AATGCCGGCA TTAAATGTTC CGGTCGTACC CGCTATTCCG
CAGCAGTAA
 
Protein sequence
MNKSLVAVGV IVALGVVWTG GAWYTGKKIE THLEDMVAQA NAQLKLTAPE SNLEVSYQNY 
HRGVFSSQLQ LLVKPIAGKE NPWIKSGQSV IFNESVDHGP FPLAQLKKLN LIPSMASIQT
TLVNNEVSKP LFDMAKGETP FEINSRIGYS GDSSSDISLK PLNYEQKDEK VAFSGGEFQL
NADRDGKAIS LSGEAQSGRI DAVNEYNQKV QLTFNNLKTD GSSTLASFGE RVGNQKLSLE
KMTISVEGKE LALLEGMEIS GKSDLVNDGK TINSQLDYSL NSLKVQNQDL GSGKLTLKVG
QIDGEAWHQF SQQYNAQTQA LLAQPEIANN PELYQEKVTE AFFSALPLML KGDPVITIAP
LSWKNSQGES ALNLSLFLKD PATTKEAPQT LAQEVDRSVK SLDAKLTIPV DMATEFMTQV
AKLEGYQEDQ AKKLAKQQVE GASAMGQMFR LTTLQDNTIT TSLQYTNGQI TLNGQKMPLE
DFVGMFAMPA LNVPVVPAIP QQ