Gene EcHS_A4519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4519 
Symbol 
ID5593982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4524687 
End bp4526189 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content54% 
IMG OID640923615 
Producthypothetical protein 
Protein accessionYP_001461056 
Protein GI157163738 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00000000859541 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAC CCCTGTTAAT TGCCCGCACG CCGGACACAG AACTGTTTTT ACTGCCGGGA 
ATGGCTAACC GTCACGGGCT GATTACTGGC GCAACGGGGA CGGGTAAAAC CGTTACGCTG
CAAAAACTGG CGGAGTCATT GTCGGAAATC GGCGTGCCGG TGTTTATGGC TGATGTGAAA
GGCGATCTGA CCGGTATCGC GCAGGCAGGA ACGGCGTCGG AAAAACTGCT CGCAAGGCTT
AAAAATATCG GCGTCAATGA CTGGCAACCG CATGCCAATC CGGTGGTGGT GTGGGATATC
TTTGGCGAGA AAGGCCATCC GGTGCGGGCG ACGGTTTCGG ATCTGGGGCC GCTGTTGCTG
GCGCGGCTGT TGAATCTCAA CGATGTGCAA TCTGGCGTGC TGAATATCAT CTTCCGTATT
GCTGACGATC AGGGATTGTT GCTGCTCGAC TTTAAAGATC TGCGGGCGAT TACCCAGTAC
ATCGGCGATA ACGCCAAATC TTTCCAGAAT CAGTACGGTA ATATCAGTAG CGCATCGGTT
GGTGCCATCC AGCGCGGACT GTTGTCGCTG GAACAGCAAG GCGCAGCACA CTTCTTTGGC
GAGCCGATGC TGGATATCAA AGACTGGATG CGCACCGATA CCAACGGTAA AGGCGTTATC
AATATCCTCA GCGCCGAGAA GCTTTATCAG ATGCCGAAAC TGTACGCCGC CAGCCTGCTG
TGGATGCTTT CAGAGTTGTA TGAACAATTG CCGGAAGCAG GCGATCTGGA GAAACCAAAA
CTGGTGTTTT TCTTCGACGA GGCACATCTG CTGTTTAACG ATGCACCGCA GGTACTGCTG
GATAAGATTG AGCAGGTGAT ACGGCTTATT CGCTCAAAAG GCGTAGGCGT CTGGTTCGTT
TCGCAAAACC CGTCTGATAT TCCGGATAAT GTGCTCGGGC AGCTCGGTAA TCGCGTTCAA
CACGCTTTGC GTGCTTTTAC GCCCAAAGAT CAGAAAGCGG TAAAAGCTGC GGCGCAAACC
ATGCGGGCCA ATCCGGCATT TGATACCGAA AAGGCGATTC AGGAACTGGG CACCGGCGAG
GCGTTGATCT CTTTTCTGGA TGCGAAAGGA AGCCCTTCTG TGGTGGAGCG TGCGATGGTG
ATCGCGCCTT GTTCGCGGAT GGGGCCGGTG ACGGAAGATG AGCGTAATGG CTTGATTAAT
CACTCTCCGG TGTATGGCAA ATATGAGGAT GAGGTGGACC GGGAATCCGC CTATGAGATG
TTGCAAAAAG GCTTTCAGGC CAGTACCGAG CAGCAAAATA ATCCCCCCGC GAAAGGGAAA
GAGGTAGCGG TGGATGACGG CATTCTTGGT GGATTGAAGG ATATTTTGTT CGGCACTACC
GGACCACGCG GCGGGAAGAA AGATGGTGTG GTGCAAACAA TGGCCAAAAG CGCCGCTCGC
CAGGTGACGA ATCAGATTGT GCGTGGAATG TTGGGGAGTT TGCTGGGGGG GAGAAGAAGG
TAA
 
Protein sequence
MSEPLLIART PDTELFLLPG MANRHGLITG ATGTGKTVTL QKLAESLSEI GVPVFMADVK 
GDLTGIAQAG TASEKLLARL KNIGVNDWQP HANPVVVWDI FGEKGHPVRA TVSDLGPLLL
ARLLNLNDVQ SGVLNIIFRI ADDQGLLLLD FKDLRAITQY IGDNAKSFQN QYGNISSASV
GAIQRGLLSL EQQGAAHFFG EPMLDIKDWM RTDTNGKGVI NILSAEKLYQ MPKLYAASLL
WMLSELYEQL PEAGDLEKPK LVFFFDEAHL LFNDAPQVLL DKIEQVIRLI RSKGVGVWFV
SQNPSDIPDN VLGQLGNRVQ HALRAFTPKD QKAVKAAAQT MRANPAFDTE KAIQELGTGE
ALISFLDAKG SPSVVERAMV IAPCSRMGPV TEDERNGLIN HSPVYGKYED EVDRESAYEM
LQKGFQASTE QQNNPPAKGK EVAVDDGILG GLKDILFGTT GPRGGKKDGV VQTMAKSAAR
QVTNQIVRGM LGSLLGGRRR