Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3142 |
Symbol | |
ID | 5591928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3151413 |
End bp | 3155924 |
Gene Length | 4512 bp |
Protein Length | 1503 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640922261 |
Product | hypothetical protein |
Protein accession | YP_001459760 |
Protein GI | 157162442 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCGCAA CCCTGTTAGC CGGTTGTGAT GGCGGTGGTT CCGGATCTTC CTCCGATACG CCGCCTGTAG ATTCTGGAAC AGGATCTTTG CCGGAAGTGA AACCTGATCC AACACCAAAC CCGGAGCCGA CGCCTGAGCC AACGCCGGAC CCAGAGCCTA CGCCAGAACC GATACCTGAT CCTGAACCAA CACCAGAACC GGAGCCAGAA CCTGTTCCTA CGAAAACGGG TTATCTGACC CTGGGCGGAA GCCAGCGGGT AACTGGTGCT ACCTGTAATG GTGAATCCAG CGATGGCTTT ACATTTAAAC CTGGCGAGGA CGTTACTTGC GTGGCGGGTA ACACGACAAT TGCCACCTTC AACACTCAGT CAGAAGCTGC GCGTAGCTTG CGTGCGGTTG AAAAAGTGTC GTTTAGCCTT GAGGACGCGC AAGAACTGGC GGGCTCCGAT GACAAGAAAA GCAATGCGGT TTCGCTGGTA ACGTCCAGTA ACAGCTGTCC GGCGAATACA GAACAGGTTT GTCTGACGTT CTCCTCGGTG ATCGAGAGTA AACGCTTCGA CTCGCTGTAT AAGCAAATCG ATCTGGCACC GGAAGAGTTC AAAAAGCTGG TCAATGAAGA GGTGGAAAAC AATGCTGCGA CCGATAAAGC GCCATCCACT CATACTTCAC CGGTCGTGCC CGTCACCACG CCGGGAACAA AACCGGATCT GAACGCTTCC TTCGTGTCGG CTAACGCGGA ACAGTTTTAT CAGTATCAAC CCACTGAAAT CATTCTCTCT GAAGGTCGAC TGGTCGATAG CCAGGGATAT GGTGTTGCTG GCGTCAACTA CTACACCAAT TCAGGCCGTG GCGTGACAGG GGAAAATGGT GAATTTTCCT TTAGCTGGGG CGAAACCATC TCCTTTGGTA TCGATACCTT TGAACTGGGT TCAGTGCGCG GCAATAAGTC GACCATTGCG CTGACTGAAC TGGGTGATGA AGTTCGCGGG GCGAATATTG ATCAGCTTAT TCATCGCTAT TCGACGACCG GGCAAAATAA TACCCGTGTT GTTCCGGACG ATGTACGCAA GGTCTTTGCC GAATATCCCA ACGTGATCAA CGAGATTATC AATCTCTCGT TATCCAACGG TGCGACGCTG GGGGAAGGTG AGCAAGTCGT TAATCTGCCT AACGAATTTA TTGAGCAGTT TAATACGGGT CAGGCCAAAG AGATCGATAC CGCGATTTGT GCGAAAACCG ATGGTTGTAA CGAGGCTCGC TGGTTCTCGC TGACGACGCG CAATGTTAAT GACGGCCAGA TTCAGGGCGT TATCAACAAG CTGTGGGGCG TGGATACGAA CTACAAATCT GTCAGCAAGT TCCATGTATT CCATGACTCC ACCAACTTCT ATGGCAGCAC GGGTAATGCG CGCGGTCAGG CGGTGGTGAA TATCTCCAAC GCGGCCTTCC CGATTCTGAT GGCGCGTAAT GATAAAAACT ACTGGCTGGC CTTCGGCGAG AAACGGGCCT GGGATAAAAA TGAGCTGGCG TACATTACTG AAGCGCCTTC CATTGTGCGA CCAGAGAACG TGACACGCGA AACCGCCACC TTCAACCTGC CGTTTATTTC GCTGGGGCAA GTGGGCGATG GCAAGCTGAT GGTTATCGGT AACCCACACT ACAACAGCAT CCTGCGTTGC CCGAACGGTT ACAGCTGGAA CGGGGGCGTT AATAAAGATG GGCAGTGTAC GCTCAACAGC GACCCGGATG ACATGAAGAA CTTCATGGAG AACGTGCTGC GCTATCTGTC AAATGATCGC TGGTTGCCGG ATGCAAAATC CAATATGACC GTGGGTACTA ACCTGGACAC GGTGTATTTC AAAAAACACG GGCAGGTTAC AGGAAATAGT GCTGCGTTCG GCTTTCATCC GGATTTTGCG GGTATCTCTG TTGAGCATTT AAGTAGCTAT GGCGATCTCG ACCCGCAGGA AATGCCGCTG CTGATCCTCA ACGGCTTTGA GTATGTGACT CAGGTTGGTA ACGATCCTTA TGCAATCCCG CTGCGTGCAG ATACCAGCAA ACCGAAGCTG ACCCAGCAGG ATGTGACCGA TTTGATCGCC TATATGAACA AAGGTGGATC GGTGCTGATC ATGGAAAACG TGATGAGCAA TCTTAAGGAA GAGAGCGCAT CTGGCTTTGT ACGTCTGCTT GATGCCGCAG GTTTGTCGAT GGCGCTTAAC AAGTCGGTAG TAAATAACGA TCCGCAAGGC TACCCGGACC GCGTTCGTCA ACGACGTTCA ACGCCAATTT GGGTCTATGA GCGTTATCCG GCTGTCGATG GTAAACCACC GTATACCATT GATGACACCA CGAAAGAAGT TATCTGGAAA TATCAGCAAG AAAACAAACC TGATGACAAA CCGAAGCTGG AAGTTGCCAG CTGGCAGGAA GAAGTTGAGG GTAAACAGGT AACTCAATTC GCCTTTATCG ATGAAGCCGA CCACAAAACG CCTGAGTCAC TGGCTGCGGC GAAGAAGAGA ATTCTGGACG CGTTCCCAGG GCTGGAAGAG TGTAAGGATT CTGACTACCA CTATGAGGTC AACTGTCTGG AATATCGTCC TGGCACGGGG GTTCCGGTTA CTGGTGGCAT GTATGTTCCA CAGTATACGC AACTAAGCCT TAACGCCGAC ACTGCGAAAG CGATGGTGCA GGCTGCGGAT TTAGGCACCA ACATTCAGCG TCTGTATCAG CATGAGCTTT ACTTCCGTAC CAATGGTCGC AAAGGTGAGC GTCTGAGCAG CGTCGATCTG GAACGTCTGT ACCAGAACAT GTCGGTCTGG CTGTGGAATA AAATTGAATA TCGCTATGAA AACGACAAGG ATGACGAGCT GGGCTTTAAA ACGTTCACCG AGTTCCTGAA CTGTTACGCC AACAATGCTT ATGATGGTGG CACGCAGTGC TCCGCAGAGC TGAAACAATC GCTGATCGAT AACAAGATGA TCTACGGTGA AGGCAGCAAA GCGGGCATGA TGAACCCGAG CTACCCGCTT AACTATATGG AAAAACCGCT GACGCGCCTG ATGCTGGGGC GTTCCTGGTG GGATCTGAAC ATCAAGGTTG ATGTCGAGAA GTATCCGGGG GCGGTATCGG CTGAAGGTGA GGAGGTTACT GAAACCATCA ACCTGTACTC GAATCCGACC AAATGGTTTG CGGGTAACAT GCAGTCTACT GGCCTGTGGG CTCCGGCTCA GCAGGAAGTC AGCATTAAGT CCAATGCGAA AGTCCCTGTG ACTGTTACCG TGGCGCTGGC TGACGACCTG ACCGGGCGTG AGAAGCATGA GGTTGCGCTG AACCGTCCGC CAAGAGTGAC TAAAACATAC TCTCTGGATG CTAGCGGCAC GGTGAAGTTC AAGGTTCCTT ACGGTGGTCT GATTTATATC AAGAGCGACA GTAAAGAGGA GAAATCAGCC AACTTCACCT TTACTGGCGT GGTAAAAGCG CCGTTCTATA AAGACGGTAA ATGGAAAAAC GACCTGAAAT CCCCTGCGCC GTTGGGTGAG CTGGAGTCTG CGTCGTTCGT CTATACCACG CCGAAGAAGA ACCTTGAGGC CAGCAATTAC AAGGGCGGTC TGAAACAATT CGCTGAGGAT CTGGATACCT TTGCCAGCTC GATGAATGAC TTCTACGGTC GTGATGGCGA AAGCGGTAAG CACCGGATGT TTACCTATGA AGCATTGACG GGGCACAAAC ATCGTTTCAC CAACGATGTG CAGATCTCCA TCGGTGATGC GCACTCTGGT TATCCGGTGA TGAACAGCAG CTTCTCGCCG AACAGCACCA CGCTGCCGAC GACGCCGCTG AACGACTGGC TGATCTGGCA CGAAGTAGGG CACAACGCTG CAGAAACGCC GCTGACTGTA CCGGGCGCAA CTGAAGTGGC GAACAACGTG CTGGCGCTGT ACATGCAGGA TCGTTATCTC GGCAAGATGA ACCGTGTCGC TGACGATATT ACCGTTGCGC CGGAATATCT GGAGGAGAGC AACGGTCAGG CATGGGCGCG TGGCGGTGCG GGTGACCGTC TGCTGATGTA CGCGCAGCTG AAGGAATGGG CAGAGAAAAA CTTTGATATC AAACAGTGGT ATCCAGAAGG CTCTCTGCCA GCGTTCTACA GCGAGCGTGA AGGGATGAAA GGCTGGAACC TGTTCCAGTT GATGCACCGT AAAGCACGCG GCGATGATGT TGGCAATGAC AAATTTGGCA ACAGAAACTA CTGTGCCGAA TCCAACGGTA ACGCTGCCGA CACGCTGATG CTGTGTGCAT CCTGGGTCGC TCAGACGGAC CTTTCCGCAT TCTTTAAGAA ATGGAATCCG GGCGCGAATG CTTACCAGTT GCCGGGAGCG ACAGAGATGA GCTTCGAGGG CGGTGTGAGC CAGTCGGCTT ACAACACGCT CGCGTCACTC GATCTGCCGA AACCGAAGCA AGGGCCGGAA ACCATTAACA AGGTTACCGA GTATTCGATG CCTGCTGAAT AA
|
Protein sequence | MSATLLAGCD GGGSGSSSDT PPVDSGTGSL PEVKPDPTPN PEPTPEPTPD PEPTPEPIPD PEPTPEPEPE PVPTKTGYLT LGGSQRVTGA TCNGESSDGF TFKPGEDVTC VAGNTTIATF NTQSEAARSL RAVEKVSFSL EDAQELAGSD DKKSNAVSLV TSSNSCPANT EQVCLTFSSV IESKRFDSLY KQIDLAPEEF KKLVNEEVEN NAATDKAPST HTSPVVPVTT PGTKPDLNAS FVSANAEQFY QYQPTEIILS EGRLVDSQGY GVAGVNYYTN SGRGVTGENG EFSFSWGETI SFGIDTFELG SVRGNKSTIA LTELGDEVRG ANIDQLIHRY STTGQNNTRV VPDDVRKVFA EYPNVINEII NLSLSNGATL GEGEQVVNLP NEFIEQFNTG QAKEIDTAIC AKTDGCNEAR WFSLTTRNVN DGQIQGVINK LWGVDTNYKS VSKFHVFHDS TNFYGSTGNA RGQAVVNISN AAFPILMARN DKNYWLAFGE KRAWDKNELA YITEAPSIVR PENVTRETAT FNLPFISLGQ VGDGKLMVIG NPHYNSILRC PNGYSWNGGV NKDGQCTLNS DPDDMKNFME NVLRYLSNDR WLPDAKSNMT VGTNLDTVYF KKHGQVTGNS AAFGFHPDFA GISVEHLSSY GDLDPQEMPL LILNGFEYVT QVGNDPYAIP LRADTSKPKL TQQDVTDLIA YMNKGGSVLI MENVMSNLKE ESASGFVRLL DAAGLSMALN KSVVNNDPQG YPDRVRQRRS TPIWVYERYP AVDGKPPYTI DDTTKEVIWK YQQENKPDDK PKLEVASWQE EVEGKQVTQF AFIDEADHKT PESLAAAKKR ILDAFPGLEE CKDSDYHYEV NCLEYRPGTG VPVTGGMYVP QYTQLSLNAD TAKAMVQAAD LGTNIQRLYQ HELYFRTNGR KGERLSSVDL ERLYQNMSVW LWNKIEYRYE NDKDDELGFK TFTEFLNCYA NNAYDGGTQC SAELKQSLID NKMIYGEGSK AGMMNPSYPL NYMEKPLTRL MLGRSWWDLN IKVDVEKYPG AVSAEGEEVT ETINLYSNPT KWFAGNMQST GLWAPAQQEV SIKSNAKVPV TVTVALADDL TGREKHEVAL NRPPRVTKTY SLDASGTVKF KVPYGGLIYI KSDSKEEKSA NFTFTGVVKA PFYKDGKWKN DLKSPAPLGE LESASFVYTT PKKNLEASNY KGGLKQFAED LDTFASSMND FYGRDGESGK HRMFTYEALT GHKHRFTNDV QISIGDAHSG YPVMNSSFSP NSTTLPTTPL NDWLIWHEVG HNAAETPLTV PGATEVANNV LALYMQDRYL GKMNRVADDI TVAPEYLEES NGQAWARGGA GDRLLMYAQL KEWAEKNFDI KQWYPEGSLP AFYSEREGMK GWNLFQLMHR KARGDDVGND KFGNRNYCAE SNGNAADTLM LCASWVAQTD LSAFFKKWNP GANAYQLPGA TEMSFEGGVS QSAYNTLASL DLPKPKQGPE TINKVTEYSM PAE
|
| |