Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1436 |
Symbol | |
ID | 5591068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1431568 |
End bp | 1432965 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640920591 |
Product | hypothetical protein |
Protein accession | YP_001458150 |
Protein GI | 157160832 |
COG category | [R] General function prediction only |
COG ID | [COG3106] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 61 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGAC TTAAAAATGA ACTCAATGCG CTGGTGAATC GGGGTGTCGA CAGACATCTG CGCCTCGCTG TAACCGGACT TAGCCGCAGC GGCAAAACAG CGTTTATCAC CGCGATGGTT AATCAGTTGC TTAATATTCA CGCCGGAGCA CGTTTGCCGC TGTTAAGTGC GGTGCGTGAA GAGCGCCTGC TGGGCGTGAA ACGCATTCCC CAGCGTGACT TTGGCATTCC GCGCTTCACC TATGACGAAG GGCTGGCGCA GTTATATGGC GATCCACCCG CCTGGCCAAC GCCAACGCGC GGCGTCAGTG AAATCCGCCT GGCGCTACGT TTTAAATCGA ATGATTCGCT GCTACGCCAC TTTAAGGATA CCTCCACGCT GTATCTGGAA ATTGTGGATT ATCCCGGCGA ATGGTTGCTC GACCTGCCGA TGCTGGCGCA GGACTATTTA AGCTGGTCGC GCCAGATGAC GGGCTTACTC AATGGTCAGC GCGGAGAATG GTCGGCGAAA TGGCGAATGA TGAGCGAAGG GCTGGACCCG CTAGCGCCTG CCGACGAAAA CCGGCTGGCG GACATTGCCG CCGCGTGGAC CGATTATCTC CACCACTGTA AAGAGCAGGG GCTGCACTTT ATTCAGCCTG GGCGCTTTGT CTTGCCGGGA GATATGGCAG GTGCGCCCGC GCTGCAATTC TTCCCGTGGC CGGATGTCGA TACCTGGGGC GAGTCCAAAC TGGCGCAGGC CGATAAGCAT ACCAATGCCG GAATGCTGCG CGAGCGCTTT AATTATTACT GCGAGAAGGT GGTGAAGGGG TTCTATAAGA ATCATTTTCT GCGCTTTGAC CGCCAGATTG TGCTGGTGGA TTGCCTGCAA CCTCTCAACA GTGGGCCACA GGCATTTAAT GATATGCGTC TGGCACTGAC GCAGCTGATG CAAAGTTTTC ACTACGGGCA GCGTACCCTG TTCCGGCGTT TGTTTTCGCC GGTTATCGAT AAGCTATTGT TTGCTGCCAC TAAAGCGGAC CATGTGACCA TCGATCAGCA CGCTAATATG GTTTCATTAC TGCAACAACT GATTCAGGAT GCCTGGCAAA ATGCGGCGTT CGAAGGGATC AGCATGGACT GCCTGGGGCT GGCGTCAGTT CAGGCGACCA CCAGCGGCAT TATTGATGTT AACGGTGAGA AAATCCCGGC GCTGCGTGGT AATCGACTTA GCGATGGCGC ACCGCTCACT GTTTATCCTG GCGAAGTTCC CGCACGTTTG CCTGGTCAGG CGTTCTGGGA TAAGCAAGGC TTCCAGTTTG AGGCATTTCG TCCGCAGGTG ATGGATGTCG ACAAACCACT ACCGCATATT CGTCTTGATG CTGCGCTGGA ATTTTTAATA GGAGATAAAT TGCGATGA
|
Protein sequence | MKRLKNELNA LVNRGVDRHL RLAVTGLSRS GKTAFITAMV NQLLNIHAGA RLPLLSAVRE ERLLGVKRIP QRDFGIPRFT YDEGLAQLYG DPPAWPTPTR GVSEIRLALR FKSNDSLLRH FKDTSTLYLE IVDYPGEWLL DLPMLAQDYL SWSRQMTGLL NGQRGEWSAK WRMMSEGLDP LAPADENRLA DIAAAWTDYL HHCKEQGLHF IQPGRFVLPG DMAGAPALQF FPWPDVDTWG ESKLAQADKH TNAGMLRERF NYYCEKVVKG FYKNHFLRFD RQIVLVDCLQ PLNSGPQAFN DMRLALTQLM QSFHYGQRTL FRRLFSPVID KLLFAATKAD HVTIDQHANM VSLLQQLIQD AWQNAAFEGI SMDCLGLASV QATTSGIIDV NGEKIPALRG NRLSDGAPLT VYPGEVPARL PGQAFWDKQG FQFEAFRPQV MDVDKPLPHI RLDAALEFLI GDKLR
|
| |