Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1400 |
Symbol | |
ID | 5592703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1396540 |
End bp | 1397745 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640920555 |
Product | hypothetical protein |
Protein accession | YP_001458114 |
Protein GI | 157160796 |
COG category | [S] Function unknown |
COG ID | [COG4950] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.0205416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTATCGC CGATCCGTCT TTCTCCCCTT CCCGCCTTGC GTCAGGATAA CGATTTCCTT TACGACCAAG GAGCGCCCAT GGAACAACGC CACATCACCG GCAAAAGCCA CTGGTATCAT GAAACGCAAT CCAGTACTGC GGAGTATGAC GTTCTGCCTC TGGTCCCGGA AGCCGCAAAG GTCAGCGATC CCTTTCTGCT CGACGTGATC CTTGATGAAG AAACGCTGGC CCCCTTCCTT TCATGGCTGG TCCCTGCGCG CGTTCTTGCA GTGGAATTGT TCCCTGACCA GCTTACCGTG ACCCGTTCAC AGACTTTCAC CGCTTATGAA CGCTTGTCTA CGGCCCTGAC GGTTGCTCAG GTTTGCGGCG TCCAGCGGTT ATGTAACTAC TATTCGGCGC GACTTACGCC GCTCCCCGGG CCTGATTCCT CCAGGGAAAG TAATCATCGG TTGGCACAAA TCACGCAATA TGCCCGCCAA CTGGTTAGCT CGCCTTCTAT TATCGACAAC CGATCGCGCC AGCATCTGAA TGACGTCGGT CTTACTGCCT GGGACTGTGT AATCATTAAC CAAATCATTG GATTTATTGG CTTTCAGGCG CGGACCATTG CGACATTTCA GGCTTATCTT GGACATCCGG TACGCTGGTT ACCCGGTCTG GAGATACAAA ACTATGCCGA CGCGTCACTG TTTACTGATG AATCAATACG CTGGCGAAGC AGCTATGAGG TGGAAAAACT ACCTGAAGAT TACACAAAAA GTTCAACTGC AGAACTTTGC CAACTGGCTG AAACACTCTC TCTCCACCCT ATTTCACTTT CCCTTCTTGA AAAGTTGTTA AACAGCACAC GGGTTAATAC ACAGCCGGAT AATCAGCTTG CGGCGTTGTT ATGCGCACGG ATAAATGGCA GCCCTGCTTG TTTTGCCGCC TGTATGGATT CAGTAAATGA ATATAAAAAA ATCAACACCC TTCTGCGCAA GGGCGAAAAT GAAATTAACC AATGGGCTGA CCGTCATTCT GTTGAGCACG CTACCGTTCA GGCGATACAA TGGCTGACCC GAGCACCCGA TCGCTTTAGC GCCGCCCAGT TCAGCCCTTT ACTCGAACAC GAAAAATCAT CAACGCAGAT TATTAATCTG CTGGTATGGA GCGGGCTGTG TGGCTGGATA AATCGCTTAA AAATCGCGTT GGGTGAGACA TATTAA
|
Protein sequence | MLSPIRLSPL PALRQDNDFL YDQGAPMEQR HITGKSHWYH ETQSSTAEYD VLPLVPEAAK VSDPFLLDVI LDEETLAPFL SWLVPARVLA VELFPDQLTV TRSQTFTAYE RLSTALTVAQ VCGVQRLCNY YSARLTPLPG PDSSRESNHR LAQITQYARQ LVSSPSIIDN RSRQHLNDVG LTAWDCVIIN QIIGFIGFQA RTIATFQAYL GHPVRWLPGL EIQNYADASL FTDESIRWRS SYEVEKLPED YTKSSTAELC QLAETLSLHP ISLSLLEKLL NSTRVNTQPD NQLAALLCAR INGSPACFAA CMDSVNEYKK INTLLRKGEN EINQWADRHS VEHATVQAIQ WLTRAPDRFS AAQFSPLLEH EKSSTQIINL LVWSGLCGWI NRLKIALGET Y
|
| |