Gene EcHS_A1554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1554 
Symbol 
ID5591690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1561196 
End bp1562527 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content33% 
IMG OID640920708 
Productleucine-rich repeat-containing protein 
Protein accessionYP_001458264 
Protein GI157160946 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.041299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCACTG ACCTTATATT ACACAATCAT CCCAGGATGA AAACAATCAC TTTAAACGAC 
AACCATATTG CACATTTAAA CGCCAAAAAC ACTACAAAAC TGGAATATTT AAACTTAAGC
AATAACAATT TACTGCCAAC CAATGACATT GATCAACTAA TATCATCAAA GCATCTTTGG
CATGTATTAG TTAACGGCAT CAACAATGAT CCACTTGCAC AAATGCAGTA CTGGACTGCA
GTAAGAAATA TAATTGATGA CACTAATGAA GTGACCATTG ATTTATCAGG ACTTAATTTA
ACCACTCAAC CACCAGGGCT GCAAAACTTC ACCTCTATCA ATCTTGATAA TAACCAACTC
ACACATTTTG ATGCAACCAA CTACGATAGA CTCGTAAAGC TAAGTCTGAA TAGTAATACT
CTTGAGTCAA TAAATTTTCC TCAAGGCAGA AATGTAAGTA TTACACATAT ATCTATGAAT
AATAATGCTC TCAGAAATAT TGATATAGAT AGGCTTTCAT CAGTTACTTA TTTTAGTGCG
GCACATAATC AACTAGAGTT TGTGCAATTA GAATCTTGCG AATGGCTGCA ATACCTGAAT
CTCAGCCATA ATCAATTAAC TGATATTGTT GCAGGTAATA AAGATGAACT CTTACTGCTG
GATCTATCCC ATAATAAACT AACAAGTTTA CACAATGACT TATTTCCCAA CTTGAATACG
TTACTTATTA ACAACAATTT GCTTTCTGAA ATTAAAATAT TCTATAGCAA CTTCTGCAAT
GTTCAGACAT TAAACGCTGC TAACAACCAG TTGAAATATA TAAATCTTGA TTTCCTGACT
TATCTTCCAT CTATCAAAAG TTTAAGACTG GACAATAATA AAATAACCCA CACTGATACT
AATAATACAT CCGATATTGG AACTTTATTC CCCATAATAA AACAGAGCAA AAACTTAAAT
TTTTTAAATG TTTCTGGGAA GAACAATTGC CCTACTATGC AGCTCATGTT ATTTAATTTA
TTTTCCCCAG CACTTAAGCT TAATACTGGC CCGGCAATTC TTTCGCCTGG TGCATTTGAA
GTTCACTCTG ACGGAATAGA TGTGGATAAC GAATTGTTTC ACTATCCTAT TAAAAAAGCA
TATACCCCAT ATAATATACA CACTTACAAG ACAGAGGAAG TTGTAAACCA GAGGAATATA
AAAGTTAAAA ACATGACCTT AGATGAAATA AACAATACTT ACTGTAATAA CGATTATTAC
AATCAGGCAA TAAGAGAGGA ACCGATAGAC CTTCTGGACA GATCGTTTTC CTCCAGTTCA
TGGCCTTTTT AG
 
Protein sequence
MITDLILHNH PRMKTITLND NHIAHLNAKN TTKLEYLNLS NNNLLPTNDI DQLISSKHLW 
HVLVNGINND PLAQMQYWTA VRNIIDDTNE VTIDLSGLNL TTQPPGLQNF TSINLDNNQL
THFDATNYDR LVKLSLNSNT LESINFPQGR NVSITHISMN NNALRNIDID RLSSVTYFSA
AHNQLEFVQL ESCEWLQYLN LSHNQLTDIV AGNKDELLLL DLSHNKLTSL HNDLFPNLNT
LLINNNLLSE IKIFYSNFCN VQTLNAANNQ LKYINLDFLT YLPSIKSLRL DNNKITHTDT
NNTSDIGTLF PIIKQSKNLN FLNVSGKNNC PTMQLMLFNL FSPALKLNTG PAILSPGAFE
VHSDGIDVDN ELFHYPIKKA YTPYNIHTYK TEEVVNQRNI KVKNMTLDEI NNTYCNNDYY
NQAIREEPID LLDRSFSSSS WPF