Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1377 |
Symbol | |
ID | 5592876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1371803 |
End bp | 1373698 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640920532 |
Product | hypothetical protein |
Protein accession | YP_001458091 |
Protein GI | 157160773 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 0.469999 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGGGA AGTTTCGCTG CATTTTGCTG TTGATAGCAG GGCTGTTTGT ATCATCTCTA AGTTATGCAG AAAACACGGA GATCCCTTCT TATGAAGAAG GGATTTCGCT CTTTGATGTT GAAGCCACTC TGCAACCGGA TGGGGTGCTC GACATCAAAG AAAATATTCA TTTTCAGGCG CGAAATCAGC AGATTAAACA CGGATTTTAT CGTGATTTAC CACGACTCTG GATGCAGCCT GATGGGGACG CTGCACTGCT GAACTATCAT ATTGTTGGCG TCACCCGTGA TGGTATTCCT GAACCCTGGC ATCTTGACTG GCATATCGGG TTAATGATTA TTGTCGTGGG CGATAAACAA CGTTTCTTGC CTCAAGGCGA CTATCATTAT CAAATTCATT ATCAGGTTAA AAATGCTTTC CTGCGTGAGG GAGATTCAGA TCTGTTAATC TGGAACGTGA CTGGTAACCA CTGGCCGTTT GAAATCTATA AGACCCGATT TTCACTCAAG TTCCCTGATA TCGCGGGTAA TCCATTTAGC GAAATCGATC TCTTTACTGG AGAAGAGGGC GACACATATC GAAATGGCCG CATCTTTGAG GACGGAAGAA TTGAATCCAG CGATCCGTTT TATCGTGAAG ATTTCACGGT ACTCTACCGC TGGCCTCACG CTTTACTCAG CAATGCCCCG GCTCCACAAA CGACGAATAT TTTCAGCCAT ATTCTTTTAC CCTCCACGTC ATCGTTGTTA ATTTGGTTTC CGTGTCTATT CCTGGTTTGT GGATGGTTAT ATCTCTGGAA GCGCAGGCCG CAATTTACGC CGGTAGATGT GATTGAAACC GATGTCATTC CGCCAGATTA CACGCCCGGC ATGTTACGTC TCGATGCGAA GCTGGTTTAC GACGATAAAG GTTTCTGTGC CGATATCGTA AATCTGATCG TGAAGGGAAA AATTCATCTG GAAGATCAGT ATGATAAGAA CCAGCAAATC CTGATTTGTG TTAATGAAGG CGCGACCAGA AATAATGAGG TATTACTGCC CGCAGAGCAG TTATTACTGG AAGCGTTATT TCGTAAAGGC GATAAGGTCG TTCTTACGGG GAGACGCAAC AGACTCTTAC GCAGGGCATT TTTACGGATG CAGAAATTTT ATCTGCCGCG TAAAAAGTCT TCGTTTTACC GGCCTGATAC GTTTTTGCAA TGGGGTGGAC TGGCAATATT GGCGGTCATT CTCTGCGGTA ACCTGAGTCC CGTAGGTTGG GCAGGAATGA GTCTGGTAGG CGATATGTTT ATTATGATCT GCTGGATTAT TCCTTTTTTA TTTTGTTCCC TTGAGCTTTT ATTTGCCCGC GATGATGACA AGCCTTGCGT TAATCGTGCA ATCATCACTT TGTATTTACC ACTGATTTGT TCAGGCGTGG CCTTCTATTC TCTCTATATC AATGTCGGAG ATGTATTCTT TTACTGGTAT ATGCCAGCGG GTTATTTTAG CGCTGTTTGC CTGACCGGTT ATCTCACTGG CATGGGGTAT ATTTTTCTGC CAAAGTTTAC CCAAACTGGG CAGCAACGTT ATGCCCGCGG TGAAGCTATC GTTAACTATC TTGCGCGTAA AGAGGCAGCA ACACACAGTG GACGTCGGCG GAAAGGGGAA ACACGGAAAC TGGATTACGC GTTGCTAGGT TGGGCTATAT CGGCAAATTT GGGGAGGGAA TGGGCGTTAC GCATTGCCCC TTCGCTTTCT TCGGCGATTC GCGCTCCAGA GATTGCCCGT AACGGCGTTT TATTCTCATT ACAGACGCAC CTAAGTTGCG GGGCCAATAC CAGTTTGTTG GGGCGAAGTT ATTCCGGTGG TGGTGCTGGC GGCGGCGCGG GTGGCGGAGG CGGTGGTGGC TGGTAA
|
Protein sequence | MAGKFRCILL LIAGLFVSSL SYAENTEIPS YEEGISLFDV EATLQPDGVL DIKENIHFQA RNQQIKHGFY RDLPRLWMQP DGDAALLNYH IVGVTRDGIP EPWHLDWHIG LMIIVVGDKQ RFLPQGDYHY QIHYQVKNAF LREGDSDLLI WNVTGNHWPF EIYKTRFSLK FPDIAGNPFS EIDLFTGEEG DTYRNGRIFE DGRIESSDPF YREDFTVLYR WPHALLSNAP APQTTNIFSH ILLPSTSSLL IWFPCLFLVC GWLYLWKRRP QFTPVDVIET DVIPPDYTPG MLRLDAKLVY DDKGFCADIV NLIVKGKIHL EDQYDKNQQI LICVNEGATR NNEVLLPAEQ LLLEALFRKG DKVVLTGRRN RLLRRAFLRM QKFYLPRKKS SFYRPDTFLQ WGGLAILAVI LCGNLSPVGW AGMSLVGDMF IMICWIIPFL FCSLELLFAR DDDKPCVNRA IITLYLPLIC SGVAFYSLYI NVGDVFFYWY MPAGYFSAVC LTGYLTGMGY IFLPKFTQTG QQRYARGEAI VNYLARKEAA THSGRRRKGE TRKLDYALLG WAISANLGRE WALRIAPSLS SAIRAPEIAR NGVLFSLQTH LSCGANTSLL GRSYSGGGAG GGAGGGGGGG W
|
| |