Gene EcHS_A1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1749 
Symbol 
ID5591385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1769155 
End bp1770759 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content48% 
IMG OID640920897 
Producthypothetical protein 
Protein accessionYP_001458451 
Protein GI157161133 
COG category[S] Function unknown 
COG ID[COG4529] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0000314791 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTGCTATTGT GGGTGCCGGG CCTACGGGGA TCTACACCTT ATTCTCGCTT 
CTACAGCAAC AAACTCCACT TTCTATTTCT ATCTTCGAGC AGGCTGACGA GGCCGGTGTC
GGGATGCCAT ACAGTGATGA GGAAAACTCA AAAATGATGC TGGCAAATAT TGCCAGTATT
GAAATACCGC CGATTAATTG TACGTATCTC GAATGGCTAC AAAAGCAAGA AGCCAGCCAT
CTCCAGCGTT ATGGCGTTAA AAAAGAAACC TTGCACGATC GTCAGTTTTT ACCGCGAATT
CTGCTGGGCG AATATTTCCG CGATCAATTT TTACGATTAG TAGACCAGGC ACGAAAGCAA
AAATTTGCAG TGGCTGTTTA TGAATCATGC CAGGTTACCG ATCTGCAAAT TACAAATGCT
GGCGTCATGC TCGCTACAAA TCAGGGTTTA CCCAGAGAGA CGTTTGATTT AGCGGTGATT
GCCACGGGTC ACGTCTGGCC TGATGAAGAA GAAGCAACCC GAACGTATTT TCCCAGCCCG
TGGTCAGGCC TGATGGAAGC AAAGGTCGAT GCGTGTAACG TGGGTATTAT GGGAACATCC
TTGAGCGGAC TGGATGCGGC AATGGCAGTG GCTATTCAGC ATGGTTCGTT CATTGAAGAT
GATAAACAAC ACGTCGTTTT TCACCGCGAT AACGCAAGTG AAAAGCTAAA TATCACGTTG
ATGTCGCGCA CGGGTATTTT ACCCGAAGCC GATTTCTATT GCCCTATTCC CTACGAGCCC
TTACACATTG TCACCGATCA GGCATTAAAT GCTGAGATTC AAAAAGGCGA AGAGGGCCTT
TTGGATCGGG TATTTAGATT GATAGTAGAG GAAATCAAGT TTGCTGATCC AGACTGGAGC
CAACGCATAG CCTTAGAGAG CCTGAATGTC GATTCCTTTG CTCAAGCCTG GTTTGCCGAG
CGCAAACAAC GCGACCCATT TGACTGGGCA GAAAAAAATC TCCAGGAAGT CGAACGCAAT
AAACGAGAAA AACATACTGT TCCCTGGCGT TATGTCATTC TGCGCCTGCA TGAAGCCGTA
CAGGAAATTG TTCCACATCT GAATGAACAC GACCATAAAC GGTTCAGTAA AGGCCTTGCC
CGGGTTTTTA TCGATAATTA TGCGGCAATC CCTTCAGAGT CTATTCGTCG GCTGCTGGCC
TTACGTGAAG CGGGGATCAT TCATATTCTC GCCCTCGGTG AAGACTACGA AATGGAAATT
AATGAGTCGC GCACCGTCCT GAAAACGGAA GACAACAGCT ACTCGTTTGA CGTTTTTATT
GATGCCCGCG GACAGCGTCC GCTTAAAGTG AAAGATATTC CTTTCCCTGG GCTACGCGAG
CAATTACAGA AAACAGGGGA TGAAATCCCT GATGTTGGCG AAGATTATAC GTTACAGCAA
CCCGAAGATA TTCGTGGACG CGTAGCGTTC GGCGCGTTGC CCTGGTTGAT GCACGACCAG
CCTTTCGTTC AGGGACTTAC GGCATGTGCA GAAATTGGTG AGGCGATGGC TCGGGCGGTC
GTAAAACCTG CATCCCGTGC ACGTCGGCGT CTTTCGTTTG ATTAA
 
Protein sequence
MKKIAIVGAG PTGIYTLFSL LQQQTPLSIS IFEQADEAGV GMPYSDEENS KMMLANIASI 
EIPPINCTYL EWLQKQEASH LQRYGVKKET LHDRQFLPRI LLGEYFRDQF LRLVDQARKQ
KFAVAVYESC QVTDLQITNA GVMLATNQGL PRETFDLAVI ATGHVWPDEE EATRTYFPSP
WSGLMEAKVD ACNVGIMGTS LSGLDAAMAV AIQHGSFIED DKQHVVFHRD NASEKLNITL
MSRTGILPEA DFYCPIPYEP LHIVTDQALN AEIQKGEEGL LDRVFRLIVE EIKFADPDWS
QRIALESLNV DSFAQAWFAE RKQRDPFDWA EKNLQEVERN KREKHTVPWR YVILRLHEAV
QEIVPHLNEH DHKRFSKGLA RVFIDNYAAI PSESIRRLLA LREAGIIHIL ALGEDYEMEI
NESRTVLKTE DNSYSFDVFI DARGQRPLKV KDIPFPGLRE QLQKTGDEIP DVGEDYTLQQ
PEDIRGRVAF GALPWLMHDQ PFVQGLTACA EIGEAMARAV VKPASRARRR LSFD