Gene EcHS_A1590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1590 
Symbol 
ID5591497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1607677 
End bp1608999 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content48% 
IMG OID640920743 
Productprotein HipA 
Protein accessionYP_001458299 
Protein GI157160981 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value0.943886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAAC TTGTCACTTG GATGAACAAC CAGCGGGTAG GCGAGTTAAC GAAGTTAGCC 
AACGGCGCGC ACACCTTTAA GTATGCACCG GAGTGGTTAG CAAGCCGTTA TGCCAGACCG
TTGTCACTTT CGCTGCCATT GCAGAGGGGG AATATCACCT CTGATGCCGT ATTTAACTTC
TTCGATAACC TGTTACCCGA TAGCCCGATT GTACGTGACC GGATCGTTAA ACGTTATCAT
GCCAAATCCA GACAACCGTT TGATTTATTG TCAGAAATAG GGCGAGACAG CGTTGGTGCC
GTGACGTTAA TACCCGAAGA CGAAACCGTA ACGCATCCGA TAATGGCATG GGAAAAGCTT
ACTGAAGCCA GACTTGAAGA AGTATTAACG GCTTATAAAG CAGATATCCC GCTAGGCATG
ATTAGAGAAG AAAATGACTT TCGCATCTCG GTTGCTGGCG CACAGGAGAA GACAGCACTG
CTCAGAATAG GCAATGACTG GTGCATTCCG AAAGGAATAA CGCCGACGAC GCACATCATT
AAATTACCGA TTGGCGAAAT CAGGCAGCCC AATGCGACGC TCGATCTCAG CCAAAGCGTT
GATAATGAGT ATTACTGTCT GCTGCTGGCG AAAGAACTTG GGTTGAATGT TCCGGACGCA
GAAATCATTA AAGCGGGAAA TGTGCGCGCG TTAGCGGTCG AACGTTTTGA CAGGCGTTGG
AATGCTGAGC GAACGGTTTT ACTTCGCTTG CCACAGGAGG ATATGTGTCA GACATTCGGT
TTACCTTCAT CGGTGAAATA TGAATCAGAT GGAGGCCCAG GCATCGCGCG GATCATGGCT
TTTTTGATGG GGTCCAGCGA GGCGCTGAAA GATCGCTATG ATTTTATGAA ATTCCAGGTC
TTCCAGTGGT TGATTGGCGC AACGGACGGT CATGCAAAAA ACTTCTCCGT ATTTATTCAG
GCTGGCGGCA GTTATCGACT CACGCCATTT TACGACATCA TTTCAGCATT TCCGGTCCTT
GGCGGTACGG GAATACACAT CAGCGATCTC AAACTGGCAA TGGGGCTTAA CGCATCCAAA
GGCAAAAAAA CGGCAATCGA TAAAATTTAT CCGCGACATT TTTTGGCGAC AGCAAAGGTG
CTGAGATTCC CGGAAGTGCA GATGCATGAA ATCCTGAGTG ACTTTGCCAG AATGATTCCA
GCAGCACTGG ATAACGTGAA GACTTCATTA CCGACAGATT TTCCGGAGAA CGTGGTGACG
GCAGTTGAAA GCAATGTGTT GAGGTTGCAT GGACGGTTAA GCCGAGAATA CGGTAGTAAG
TGA
 
Protein sequence
MPKLVTWMNN QRVGELTKLA NGAHTFKYAP EWLASRYARP LSLSLPLQRG NITSDAVFNF 
FDNLLPDSPI VRDRIVKRYH AKSRQPFDLL SEIGRDSVGA VTLIPEDETV THPIMAWEKL
TEARLEEVLT AYKADIPLGM IREENDFRIS VAGAQEKTAL LRIGNDWCIP KGITPTTHII
KLPIGEIRQP NATLDLSQSV DNEYYCLLLA KELGLNVPDA EIIKAGNVRA LAVERFDRRW
NAERTVLLRL PQEDMCQTFG LPSSVKYESD GGPGIARIMA FLMGSSEALK DRYDFMKFQV
FQWLIGATDG HAKNFSVFIQ AGGSYRLTPF YDIISAFPVL GGTGIHISDL KLAMGLNASK
GKKTAIDKIY PRHFLATAKV LRFPEVQMHE ILSDFARMIP AALDNVKTSL PTDFPENVVT
AVESNVLRLH GRLSREYGSK