Gene ECD_02898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02898 
SymbolygiY 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3037921 
End bp3039270 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content53% 
IMG OID 
Productsensory histidine kinase in two-component regulatory system with QseB 
Protein accessionACT44702 
Protein GI253979032 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.760394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA CCCAACGTCT TAGTCTGCGC GTCAGGCTGA CGCTAATCTT TTTAATTCTG 
GCCTCGGTGA CCTGGCTGCT TTCCAGCTTT GTCGCCTGGA AACAAACAAC GGATAACGTC
GATGAATTGT TCGACACCCA ACTGATGCTG TTTGCCAAGC GGTTAAGTAC GCTCGATCTC
AACGAAATCA ACGCGGCGGA TCGCATGGCA CAGACGCCAA ATAGATTAAA ACACGGTCAT
GTTGATGACG ATGCGCTGAC CTTTGCCATC TTTACCCACG ACGGCAGAAT GGTCCTTAAT
GATGGCGATA ACGGAGAAGA TATTCCCTAT AGCTATCAAC GGGAAGGTTT TGCTGACGGG
CAACTGGTCG GTGAAGACGA TCCTTGGCGT TTTGTCTGGA TGACCTCACC TGATGGCAAA
TATCGCATCG TTGTTGGCCA GGAATGGGAA TACCGTGAAG ACATGGCGCT GGCGATTGTT
GCCGGGCAAT TGATCCCGTG GCTGGTCGCA CTGCCGATTA TGTTAATCAT CATGATGGTA
CTACTGGGTC GTGAACTCGC GCCGCTGAAC AAACTGGCGC TGGCACTACG TATGCGTGAC
CCTGACTCGG AAAAACCACT AAACGCGACT GGCGTACCCA GCGAAGTGCG GCCACTGGTT
GAGTCGCTAA ATCAACTGTT CGCCCGCACA CATGCGATGA TGGTTCGTGA ACGACGCTTT
ACCTCCGACG CAGCTCACGA ACTTCGTAGC CCGTTAACGG CGCTGAAAGT GCAAACCGAA
GTTGCGCAGC TCTCTGACGA TGATCCGCAG GCGCGGAAAA AAGCACTGCT CCAATTACAT
TCCGGGATCG ATCGCGCTAC TCGTCTGGTT GATCAACTGC TCACGCTATC GCGGCTGGAC
TCACTGGATA ACCTTCAGGA CGTCGCGGAG ATCCCGCTTG AAGATCTCCT GCAATCGTCG
GTGATGGATA TTTACCACAC GGCGCAGCAG GCGAAAATTG ACGTGCGACT GACACTCAAT
GCCCACAGCA TCAAACGCAC CGGGCAACCG CTATTGCTAA GTTTGTTGGT GCGAAATTTG
CTGGATAACG CCGTGCGCTA CAGTCCACAG GGCAGCGTGG TAGACGTCAC GCTGAATGCT
GATAATTTCA TCGTGAGGGA TAACGGCCCC GGTGTGACAC CAGAGGCACT GGCGCGAATT
GGCGAACGCT TCTATCGCCC ACCCGGACAA ACCGCTACCG GCAGCGGGCT TGGGCTATCG
ATTGTCCAGC GAATCGCCAA ATTGCATGGC ATGAATGTTG AATTTGGGAA TGCGGAACAA
GGTGGATTTG AGGCGAAGGT AAGCTGGTAA
 
Protein sequence
MKFTQRLSLR VRLTLIFLIL ASVTWLLSSF VAWKQTTDNV DELFDTQLML FAKRLSTLDL 
NEINAADRMA QTPNRLKHGH VDDDALTFAI FTHDGRMVLN DGDNGEDIPY SYQREGFADG
QLVGEDDPWR FVWMTSPDGK YRIVVGQEWE YREDMALAIV AGQLIPWLVA LPIMLIIMMV
LLGRELAPLN KLALALRMRD PDSEKPLNAT GVPSEVRPLV ESLNQLFART HAMMVRERRF
TSDAAHELRS PLTALKVQTE VAQLSDDDPQ ARKKALLQLH SGIDRATRLV DQLLTLSRLD
SLDNLQDVAE IPLEDLLQSS VMDIYHTAQQ AKIDVRLTLN AHSIKRTGQP LLLSLLVRNL
LDNAVRYSPQ GSVVDVTLNA DNFIVRDNGP GVTPEALARI GERFYRPPGQ TATGSGLGLS
IVQRIAKLHG MNVEFGNAEQ GGFEAKVSW