Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1554 |
Symbol | |
ID | 5591690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1561196 |
End bp | 1562527 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640920708 |
Product | leucine-rich repeat-containing protein |
Protein accession | YP_001458264 |
Protein GI | 157160946 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.041299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCACTG ACCTTATATT ACACAATCAT CCCAGGATGA AAACAATCAC TTTAAACGAC AACCATATTG CACATTTAAA CGCCAAAAAC ACTACAAAAC TGGAATATTT AAACTTAAGC AATAACAATT TACTGCCAAC CAATGACATT GATCAACTAA TATCATCAAA GCATCTTTGG CATGTATTAG TTAACGGCAT CAACAATGAT CCACTTGCAC AAATGCAGTA CTGGACTGCA GTAAGAAATA TAATTGATGA CACTAATGAA GTGACCATTG ATTTATCAGG ACTTAATTTA ACCACTCAAC CACCAGGGCT GCAAAACTTC ACCTCTATCA ATCTTGATAA TAACCAACTC ACACATTTTG ATGCAACCAA CTACGATAGA CTCGTAAAGC TAAGTCTGAA TAGTAATACT CTTGAGTCAA TAAATTTTCC TCAAGGCAGA AATGTAAGTA TTACACATAT ATCTATGAAT AATAATGCTC TCAGAAATAT TGATATAGAT AGGCTTTCAT CAGTTACTTA TTTTAGTGCG GCACATAATC AACTAGAGTT TGTGCAATTA GAATCTTGCG AATGGCTGCA ATACCTGAAT CTCAGCCATA ATCAATTAAC TGATATTGTT GCAGGTAATA AAGATGAACT CTTACTGCTG GATCTATCCC ATAATAAACT AACAAGTTTA CACAATGACT TATTTCCCAA CTTGAATACG TTACTTATTA ACAACAATTT GCTTTCTGAA ATTAAAATAT TCTATAGCAA CTTCTGCAAT GTTCAGACAT TAAACGCTGC TAACAACCAG TTGAAATATA TAAATCTTGA TTTCCTGACT TATCTTCCAT CTATCAAAAG TTTAAGACTG GACAATAATA AAATAACCCA CACTGATACT AATAATACAT CCGATATTGG AACTTTATTC CCCATAATAA AACAGAGCAA AAACTTAAAT TTTTTAAATG TTTCTGGGAA GAACAATTGC CCTACTATGC AGCTCATGTT ATTTAATTTA TTTTCCCCAG CACTTAAGCT TAATACTGGC CCGGCAATTC TTTCGCCTGG TGCATTTGAA GTTCACTCTG ACGGAATAGA TGTGGATAAC GAATTGTTTC ACTATCCTAT TAAAAAAGCA TATACCCCAT ATAATATACA CACTTACAAG ACAGAGGAAG TTGTAAACCA GAGGAATATA AAAGTTAAAA ACATGACCTT AGATGAAATA AACAATACTT ACTGTAATAA CGATTATTAC AATCAGGCAA TAAGAGAGGA ACCGATAGAC CTTCTGGACA GATCGTTTTC CTCCAGTTCA TGGCCTTTTT AG
|
Protein sequence | MITDLILHNH PRMKTITLND NHIAHLNAKN TTKLEYLNLS NNNLLPTNDI DQLISSKHLW HVLVNGINND PLAQMQYWTA VRNIIDDTNE VTIDLSGLNL TTQPPGLQNF TSINLDNNQL THFDATNYDR LVKLSLNSNT LESINFPQGR NVSITHISMN NNALRNIDID RLSSVTYFSA AHNQLEFVQL ESCEWLQYLN LSHNQLTDIV AGNKDELLLL DLSHNKLTSL HNDLFPNLNT LLINNNLLSE IKIFYSNFCN VQTLNAANNQ LKYINLDFLT YLPSIKSLRL DNNKITHTDT NNTSDIGTLF PIIKQSKNLN FLNVSGKNNC PTMQLMLFNL FSPALKLNTG PAILSPGAFE VHSDGIDVDN ELFHYPIKKA YTPYNIHTYK TEEVVNQRNI KVKNMTLDEI NNTYCNNDYY NQAIREEPID LLDRSFSSSS WPF
|
| |