Gene EcHS_A3142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3142 
Symbol 
ID5591928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3151413 
End bp3155924 
Gene Length4512 bp 
Protein Length1503 aa 
Translation table11 
GC content52% 
IMG OID640922261 
Producthypothetical protein 
Protein accessionYP_001459760 
Protein GI157162442 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCGCAA CCCTGTTAGC CGGTTGTGAT GGCGGTGGTT CCGGATCTTC CTCCGATACG 
CCGCCTGTAG ATTCTGGAAC AGGATCTTTG CCGGAAGTGA AACCTGATCC AACACCAAAC
CCGGAGCCGA CGCCTGAGCC AACGCCGGAC CCAGAGCCTA CGCCAGAACC GATACCTGAT
CCTGAACCAA CACCAGAACC GGAGCCAGAA CCTGTTCCTA CGAAAACGGG TTATCTGACC
CTGGGCGGAA GCCAGCGGGT AACTGGTGCT ACCTGTAATG GTGAATCCAG CGATGGCTTT
ACATTTAAAC CTGGCGAGGA CGTTACTTGC GTGGCGGGTA ACACGACAAT TGCCACCTTC
AACACTCAGT CAGAAGCTGC GCGTAGCTTG CGTGCGGTTG AAAAAGTGTC GTTTAGCCTT
GAGGACGCGC AAGAACTGGC GGGCTCCGAT GACAAGAAAA GCAATGCGGT TTCGCTGGTA
ACGTCCAGTA ACAGCTGTCC GGCGAATACA GAACAGGTTT GTCTGACGTT CTCCTCGGTG
ATCGAGAGTA AACGCTTCGA CTCGCTGTAT AAGCAAATCG ATCTGGCACC GGAAGAGTTC
AAAAAGCTGG TCAATGAAGA GGTGGAAAAC AATGCTGCGA CCGATAAAGC GCCATCCACT
CATACTTCAC CGGTCGTGCC CGTCACCACG CCGGGAACAA AACCGGATCT GAACGCTTCC
TTCGTGTCGG CTAACGCGGA ACAGTTTTAT CAGTATCAAC CCACTGAAAT CATTCTCTCT
GAAGGTCGAC TGGTCGATAG CCAGGGATAT GGTGTTGCTG GCGTCAACTA CTACACCAAT
TCAGGCCGTG GCGTGACAGG GGAAAATGGT GAATTTTCCT TTAGCTGGGG CGAAACCATC
TCCTTTGGTA TCGATACCTT TGAACTGGGT TCAGTGCGCG GCAATAAGTC GACCATTGCG
CTGACTGAAC TGGGTGATGA AGTTCGCGGG GCGAATATTG ATCAGCTTAT TCATCGCTAT
TCGACGACCG GGCAAAATAA TACCCGTGTT GTTCCGGACG ATGTACGCAA GGTCTTTGCC
GAATATCCCA ACGTGATCAA CGAGATTATC AATCTCTCGT TATCCAACGG TGCGACGCTG
GGGGAAGGTG AGCAAGTCGT TAATCTGCCT AACGAATTTA TTGAGCAGTT TAATACGGGT
CAGGCCAAAG AGATCGATAC CGCGATTTGT GCGAAAACCG ATGGTTGTAA CGAGGCTCGC
TGGTTCTCGC TGACGACGCG CAATGTTAAT GACGGCCAGA TTCAGGGCGT TATCAACAAG
CTGTGGGGCG TGGATACGAA CTACAAATCT GTCAGCAAGT TCCATGTATT CCATGACTCC
ACCAACTTCT ATGGCAGCAC GGGTAATGCG CGCGGTCAGG CGGTGGTGAA TATCTCCAAC
GCGGCCTTCC CGATTCTGAT GGCGCGTAAT GATAAAAACT ACTGGCTGGC CTTCGGCGAG
AAACGGGCCT GGGATAAAAA TGAGCTGGCG TACATTACTG AAGCGCCTTC CATTGTGCGA
CCAGAGAACG TGACACGCGA AACCGCCACC TTCAACCTGC CGTTTATTTC GCTGGGGCAA
GTGGGCGATG GCAAGCTGAT GGTTATCGGT AACCCACACT ACAACAGCAT CCTGCGTTGC
CCGAACGGTT ACAGCTGGAA CGGGGGCGTT AATAAAGATG GGCAGTGTAC GCTCAACAGC
GACCCGGATG ACATGAAGAA CTTCATGGAG AACGTGCTGC GCTATCTGTC AAATGATCGC
TGGTTGCCGG ATGCAAAATC CAATATGACC GTGGGTACTA ACCTGGACAC GGTGTATTTC
AAAAAACACG GGCAGGTTAC AGGAAATAGT GCTGCGTTCG GCTTTCATCC GGATTTTGCG
GGTATCTCTG TTGAGCATTT AAGTAGCTAT GGCGATCTCG ACCCGCAGGA AATGCCGCTG
CTGATCCTCA ACGGCTTTGA GTATGTGACT CAGGTTGGTA ACGATCCTTA TGCAATCCCG
CTGCGTGCAG ATACCAGCAA ACCGAAGCTG ACCCAGCAGG ATGTGACCGA TTTGATCGCC
TATATGAACA AAGGTGGATC GGTGCTGATC ATGGAAAACG TGATGAGCAA TCTTAAGGAA
GAGAGCGCAT CTGGCTTTGT ACGTCTGCTT GATGCCGCAG GTTTGTCGAT GGCGCTTAAC
AAGTCGGTAG TAAATAACGA TCCGCAAGGC TACCCGGACC GCGTTCGTCA ACGACGTTCA
ACGCCAATTT GGGTCTATGA GCGTTATCCG GCTGTCGATG GTAAACCACC GTATACCATT
GATGACACCA CGAAAGAAGT TATCTGGAAA TATCAGCAAG AAAACAAACC TGATGACAAA
CCGAAGCTGG AAGTTGCCAG CTGGCAGGAA GAAGTTGAGG GTAAACAGGT AACTCAATTC
GCCTTTATCG ATGAAGCCGA CCACAAAACG CCTGAGTCAC TGGCTGCGGC GAAGAAGAGA
ATTCTGGACG CGTTCCCAGG GCTGGAAGAG TGTAAGGATT CTGACTACCA CTATGAGGTC
AACTGTCTGG AATATCGTCC TGGCACGGGG GTTCCGGTTA CTGGTGGCAT GTATGTTCCA
CAGTATACGC AACTAAGCCT TAACGCCGAC ACTGCGAAAG CGATGGTGCA GGCTGCGGAT
TTAGGCACCA ACATTCAGCG TCTGTATCAG CATGAGCTTT ACTTCCGTAC CAATGGTCGC
AAAGGTGAGC GTCTGAGCAG CGTCGATCTG GAACGTCTGT ACCAGAACAT GTCGGTCTGG
CTGTGGAATA AAATTGAATA TCGCTATGAA AACGACAAGG ATGACGAGCT GGGCTTTAAA
ACGTTCACCG AGTTCCTGAA CTGTTACGCC AACAATGCTT ATGATGGTGG CACGCAGTGC
TCCGCAGAGC TGAAACAATC GCTGATCGAT AACAAGATGA TCTACGGTGA AGGCAGCAAA
GCGGGCATGA TGAACCCGAG CTACCCGCTT AACTATATGG AAAAACCGCT GACGCGCCTG
ATGCTGGGGC GTTCCTGGTG GGATCTGAAC ATCAAGGTTG ATGTCGAGAA GTATCCGGGG
GCGGTATCGG CTGAAGGTGA GGAGGTTACT GAAACCATCA ACCTGTACTC GAATCCGACC
AAATGGTTTG CGGGTAACAT GCAGTCTACT GGCCTGTGGG CTCCGGCTCA GCAGGAAGTC
AGCATTAAGT CCAATGCGAA AGTCCCTGTG ACTGTTACCG TGGCGCTGGC TGACGACCTG
ACCGGGCGTG AGAAGCATGA GGTTGCGCTG AACCGTCCGC CAAGAGTGAC TAAAACATAC
TCTCTGGATG CTAGCGGCAC GGTGAAGTTC AAGGTTCCTT ACGGTGGTCT GATTTATATC
AAGAGCGACA GTAAAGAGGA GAAATCAGCC AACTTCACCT TTACTGGCGT GGTAAAAGCG
CCGTTCTATA AAGACGGTAA ATGGAAAAAC GACCTGAAAT CCCCTGCGCC GTTGGGTGAG
CTGGAGTCTG CGTCGTTCGT CTATACCACG CCGAAGAAGA ACCTTGAGGC CAGCAATTAC
AAGGGCGGTC TGAAACAATT CGCTGAGGAT CTGGATACCT TTGCCAGCTC GATGAATGAC
TTCTACGGTC GTGATGGCGA AAGCGGTAAG CACCGGATGT TTACCTATGA AGCATTGACG
GGGCACAAAC ATCGTTTCAC CAACGATGTG CAGATCTCCA TCGGTGATGC GCACTCTGGT
TATCCGGTGA TGAACAGCAG CTTCTCGCCG AACAGCACCA CGCTGCCGAC GACGCCGCTG
AACGACTGGC TGATCTGGCA CGAAGTAGGG CACAACGCTG CAGAAACGCC GCTGACTGTA
CCGGGCGCAA CTGAAGTGGC GAACAACGTG CTGGCGCTGT ACATGCAGGA TCGTTATCTC
GGCAAGATGA ACCGTGTCGC TGACGATATT ACCGTTGCGC CGGAATATCT GGAGGAGAGC
AACGGTCAGG CATGGGCGCG TGGCGGTGCG GGTGACCGTC TGCTGATGTA CGCGCAGCTG
AAGGAATGGG CAGAGAAAAA CTTTGATATC AAACAGTGGT ATCCAGAAGG CTCTCTGCCA
GCGTTCTACA GCGAGCGTGA AGGGATGAAA GGCTGGAACC TGTTCCAGTT GATGCACCGT
AAAGCACGCG GCGATGATGT TGGCAATGAC AAATTTGGCA ACAGAAACTA CTGTGCCGAA
TCCAACGGTA ACGCTGCCGA CACGCTGATG CTGTGTGCAT CCTGGGTCGC TCAGACGGAC
CTTTCCGCAT TCTTTAAGAA ATGGAATCCG GGCGCGAATG CTTACCAGTT GCCGGGAGCG
ACAGAGATGA GCTTCGAGGG CGGTGTGAGC CAGTCGGCTT ACAACACGCT CGCGTCACTC
GATCTGCCGA AACCGAAGCA AGGGCCGGAA ACCATTAACA AGGTTACCGA GTATTCGATG
CCTGCTGAAT AA
 
Protein sequence
MSATLLAGCD GGGSGSSSDT PPVDSGTGSL PEVKPDPTPN PEPTPEPTPD PEPTPEPIPD 
PEPTPEPEPE PVPTKTGYLT LGGSQRVTGA TCNGESSDGF TFKPGEDVTC VAGNTTIATF
NTQSEAARSL RAVEKVSFSL EDAQELAGSD DKKSNAVSLV TSSNSCPANT EQVCLTFSSV
IESKRFDSLY KQIDLAPEEF KKLVNEEVEN NAATDKAPST HTSPVVPVTT PGTKPDLNAS
FVSANAEQFY QYQPTEIILS EGRLVDSQGY GVAGVNYYTN SGRGVTGENG EFSFSWGETI
SFGIDTFELG SVRGNKSTIA LTELGDEVRG ANIDQLIHRY STTGQNNTRV VPDDVRKVFA
EYPNVINEII NLSLSNGATL GEGEQVVNLP NEFIEQFNTG QAKEIDTAIC AKTDGCNEAR
WFSLTTRNVN DGQIQGVINK LWGVDTNYKS VSKFHVFHDS TNFYGSTGNA RGQAVVNISN
AAFPILMARN DKNYWLAFGE KRAWDKNELA YITEAPSIVR PENVTRETAT FNLPFISLGQ
VGDGKLMVIG NPHYNSILRC PNGYSWNGGV NKDGQCTLNS DPDDMKNFME NVLRYLSNDR
WLPDAKSNMT VGTNLDTVYF KKHGQVTGNS AAFGFHPDFA GISVEHLSSY GDLDPQEMPL
LILNGFEYVT QVGNDPYAIP LRADTSKPKL TQQDVTDLIA YMNKGGSVLI MENVMSNLKE
ESASGFVRLL DAAGLSMALN KSVVNNDPQG YPDRVRQRRS TPIWVYERYP AVDGKPPYTI
DDTTKEVIWK YQQENKPDDK PKLEVASWQE EVEGKQVTQF AFIDEADHKT PESLAAAKKR
ILDAFPGLEE CKDSDYHYEV NCLEYRPGTG VPVTGGMYVP QYTQLSLNAD TAKAMVQAAD
LGTNIQRLYQ HELYFRTNGR KGERLSSVDL ERLYQNMSVW LWNKIEYRYE NDKDDELGFK
TFTEFLNCYA NNAYDGGTQC SAELKQSLID NKMIYGEGSK AGMMNPSYPL NYMEKPLTRL
MLGRSWWDLN IKVDVEKYPG AVSAEGEEVT ETINLYSNPT KWFAGNMQST GLWAPAQQEV
SIKSNAKVPV TVTVALADDL TGREKHEVAL NRPPRVTKTY SLDASGTVKF KVPYGGLIYI
KSDSKEEKSA NFTFTGVVKA PFYKDGKWKN DLKSPAPLGE LESASFVYTT PKKNLEASNY
KGGLKQFAED LDTFASSMND FYGRDGESGK HRMFTYEALT GHKHRFTNDV QISIGDAHSG
YPVMNSSFSP NSTTLPTTPL NDWLIWHEVG HNAAETPLTV PGATEVANNV LALYMQDRYL
GKMNRVADDI TVAPEYLEES NGQAWARGGA GDRLLMYAQL KEWAEKNFDI KQWYPEGSLP
AFYSEREGMK GWNLFQLMHR KARGDDVGND KFGNRNYCAE SNGNAADTLM LCASWVAQTD
LSAFFKKWNP GANAYQLPGA TEMSFEGGVS QSAYNTLASL DLPKPKQGPE TINKVTEYSM
PAE