Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2080 |
Symbol | |
ID | 5594403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2065622 |
End bp | 2067616 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640921221 |
Product | hypothetical protein |
Protein accession | YP_001458765 |
Protein GI | 157161447 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00000313758 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAAAA GATATAATAC CGGCAATCCA AGACCTTCAA ATAGCATGAA GGATCTGAAT GATAACGCCC TGGCGTACGA TGATTTCCTG AACAGCGAAA GCGATACTTT TATAGATCGT TTTGGTAACG CCCAGGATAC GATAATTGGG GCTACTAAAA AAATGGCAGC TGCTACCGAC GCTGTTATTG ATGAAGCCCG CCAAAACCTG ATCCCTCTCA GCCGGCAGTA CATGACGCTG GCGGCGGCGC AGGCGGATAT TGCGAATATT CCGGCAGGTT CAACAACCTA TGTCCGCAGT CAGGACGGAA GCTCTCTGGC CGATGAGTAT ATCAACCTCG CTGGAACGCT GCAGCCAACC GGACGGCGGA TGGTTCGTGA CGACTACGCA TACCAGGTAT CGCCAGACAG CGTGACCCTG GCAGCATATG ATCCGGAGAC TTCCCGCGTG GCTCCATTTT TAAATACAAG CGGCAGATTA ATTCAAATCG GTCCTGACGG AAAATATTAC GAACTTTTAA CCCAACAAGA ATCCGAACTC TATGCGCTGG GCCGGGAGGG TTCTATACCG CAGTTTATTG GCGGTGAAAA AGTGTGGCGG ATGACGGTTG ATTCAACCAC AAACCAGATC GTTGAAGCTT ATACGGTTGG TGGGAAGCAC TGGATTTACT CAGACGGTGG CCTGGTAGCT GTTAATAACG GAAATGGCGG TGGTGGTGGC GACGATGATG CCAACCAGCT CCCTGAGTAT GGACTTCATT TGTCAGGGTC TACTGTGTAC CCCTACTCAG AGACAGTGCC TGTATGTTTT ATCTTTGTGA CTGCTGGGCA ATCCAACGCT CGAGGATATT GTCCTGACGC CGATCAAACC ATTGTCGCAG CAACGCCGAT ATATCCTGAT AACGCTTTCA TGCTCAGCGG CGGGGTTAGG CGTACAGGGA CACGCAGCAC TACTCTGGTG CCACTGGTTG AGGCAGTAAG TGGGACAGAT AAAGAAACGG CCGCAAGCGG CCTCGCGAAC ACCTTCATTC GCGATATGGC TGCAGCTACC GGAATCATGC CGCGCACGCT ATCAATCGTA TGTGCGCAGT CTGGTCAGGC TTACGAGTAC CAGAAACGGG GTAACCAGGT ATATCAGTAT CTGCTCGATT CAATCGAAGA CTGCGTAACG GCCTGTAAAG CACGCGGCTG GCTGCCGATT GTTCTCTGCG TTGACTGGAT GCAGGGAGAG TCCGACGAGG ACTGGTCAGG ATTACGAGAA GGAATGTATG AATCACGGAT GAGGCAGTAC CAGAGACAAA TCACCAGCGA CATCATCGCA AGAACGGGTC AAAACGAACC GCCGATTATC GCCATTACCC AGCTGGGGTA TGTCAATGAC GGGCATGGTG CATTTACAGG CCAGTACGCG CGACTGGCGT CGACGCGATT GCACGGAAAA GAGCAATTCA GGCTGGTCAA TAGTTTGTAC CAGTACGATT TTATTTCAGA CGGTCTGCAC TTGACGTGTG CGGGCCAGAA CCGGCGCGGA GCAGCTGTGG CGAGAGCGCT TCTCCAGGAG TGGTTTACGA GCGGCTGGTC AGGGATGGTT CCGACCAGTT TCGTGTGGAA CTCACCCACG CAGATACAAA TCAATGTCCC AGCGTATACG AACCTGGTGC TGGACACGAC TACGATCAAC ACCTCCGGTC TGGCCAATTA CGGCTTTAGC TACACGGATG AGACTGGTGC TCCACCTGCT ATATCGAGCA TCGCGATCAG CTCGGACGGC AAGGGCGTGC TGATTAACCT GGCGACCGCC CCCTCTGGAC GTTTTGGGCG CGTTTCCTAT GCGACAGCAG AAAACCCACT TCAGAGCGGC GCATCTGTAA AACCTTCCGG GCGGACTCTT GGTGCAAGAG GGTGTGTTCG ATCTTCCGCT GGAATCATAT GGGTGTATGA CACATCCGTG ACTCTTTACG ACTGGCTCCC CGCTTTTCGT ATTAACGTTT TCTGA
|
Protein sequence | MDKRYNTGNP RPSNSMKDLN DNALAYDDFL NSESDTFIDR FGNAQDTIIG ATKKMAAATD AVIDEARQNL IPLSRQYMTL AAAQADIANI PAGSTTYVRS QDGSSLADEY INLAGTLQPT GRRMVRDDYA YQVSPDSVTL AAYDPETSRV APFLNTSGRL IQIGPDGKYY ELLTQQESEL YALGREGSIP QFIGGEKVWR MTVDSTTNQI VEAYTVGGKH WIYSDGGLVA VNNGNGGGGG DDDANQLPEY GLHLSGSTVY PYSETVPVCF IFVTAGQSNA RGYCPDADQT IVAATPIYPD NAFMLSGGVR RTGTRSTTLV PLVEAVSGTD KETAASGLAN TFIRDMAAAT GIMPRTLSIV CAQSGQAYEY QKRGNQVYQY LLDSIEDCVT ACKARGWLPI VLCVDWMQGE SDEDWSGLRE GMYESRMRQY QRQITSDIIA RTGQNEPPII AITQLGYVND GHGAFTGQYA RLASTRLHGK EQFRLVNSLY QYDFISDGLH LTCAGQNRRG AAVARALLQE WFTSGWSGMV PTSFVWNSPT QIQINVPAYT NLVLDTTTIN TSGLANYGFS YTDETGAPPA ISSIAISSDG KGVLINLATA PSGRFGRVSY ATAENPLQSG ASVKPSGRTL GARGCVRSSA GIIWVYDTSV TLYDWLPAFR INVF
|
| |