Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1841 |
Symbol | |
ID | 5592116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1857549 |
End bp | 1858856 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640920985 |
Product | rhodanese-like domain-containing protein |
Protein accession | YP_001458537 |
Protein GI | 157161219 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2897] Rhodanese-related sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGTG TTTCTCAAAT GACCGCGCTG GCAATGGCTT TAGGGCTGGC TTGCGCTTCT TCGTGGGCCG CTGAACTGGC GAAGCCTCTT ACACTTGACC AGCTTCAACA ACAAAATGGC AAAGCGATAG ATACTCGCCC CAGCGCGTTT TATAACGGCT GGCCACAAAC CTTAAATGGC CCTTCTGGTC ATGAACCTGC CGCCTTAAAC CTCTCTGCCA GCTGGCTTGA CAAAATGAGC ACCGAACAGC TCAACGCGTG GATCAAGCAA CATAACCTGA AAACCGATGC TCCGGTGGCG CTGTACGGTA ATGACAAAGA TGTCGACGCC GTCAAAACGC GACTGCAAAA AGCAGGTTTA ACGCATATCT CCATCCTGAG TGACGCGCTA AGCGAACCTT CCCGTCTGCA AAAACTGCCG CATTTTGAGC AGCTGGTTTA TCCGCAATGG CTGCACGACC TGCAACAAGG TAAAGAGGTT ACGGCGAAAC CTGCCGGTGA CTGGAAAGTC ATTGAAGCGG CCTGGGGCGC TCCTAAGCTT TACCTTATCA GCCATATTCC CGGCGCTGAC TACATCGATA CCAACGAAGT GGAAAGTGAA CCGCTGTGGA ACAAAGTTTC TGATGAACAA CTAAAAGCGA TGCTGGCAAA ACACGGCATT CGCCATGACA CCACGGTCAT TCTGTATGGG CGTGACGTAT ACGCTGCAGC GCGTGTGGCG CAGATTATGC TTTATGCTGG CGTGAAAGAT GTGCGCCTGC TGGATGGCGG CTGGCAAACC TGGTCCGACG CGGGACTGCC TGTTGAGCGC GGAACGCCAC CGAAAGTGAA AGCGGAACCG GATTTCGGCG TGAAGATCCC GGCACAACCG CAACTGATGC TTGATATGGA ACAGGCGCGT GGACTGCTGC ATCGCCAGGA TGCATCGCTG GTGAGCATTC GTTCGTGGCC AGAATTTATC GGTACGACCA GCGGTTACAG CTATATTAAA CCAAAAGGTG AAATAGCCGG AGCACGTTGG GGACACGCTG GTAGCGACTC GACGCATATG GAAGATTTCC ATAACCCGGA TGGCACCATG CGTAGCGCCG ATGATATTAC CGCTATGTGG AAAGCATGGA ATATCAAACC AGATCAGCAA GTTTCATTCT ACTGCGGCAC CGGCTGGCGC GCGTCCGAAA CCTTTATGTA CGCACGAGCA ATGGGCTGGA ATAACGTCTC CGTTTACGAC GGCGGCTGGT ACGAATGGAG CAGCGATCCA AAAAATTCGG TAGCAACCGG TGAACGCGGC CCGGACAGCA GCAAATAA
|
Protein sequence | MKRVSQMTAL AMALGLACAS SWAAELAKPL TLDQLQQQNG KAIDTRPSAF YNGWPQTLNG PSGHEPAALN LSASWLDKMS TEQLNAWIKQ HNLKTDAPVA LYGNDKDVDA VKTRLQKAGL THISILSDAL SEPSRLQKLP HFEQLVYPQW LHDLQQGKEV TAKPAGDWKV IEAAWGAPKL YLISHIPGAD YIDTNEVESE PLWNKVSDEQ LKAMLAKHGI RHDTTVILYG RDVYAAARVA QIMLYAGVKD VRLLDGGWQT WSDAGLPVER GTPPKVKAEP DFGVKIPAQP QLMLDMEQAR GLLHRQDASL VSIRSWPEFI GTTSGYSYIK PKGEIAGARW GHAGSDSTHM EDFHNPDGTM RSADDITAMW KAWNIKPDQQ VSFYCGTGWR ASETFMYARA MGWNNVSVYD GGWYEWSSDP KNSVATGERG PDSSK
|
| |