Gene ECD_00589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00589 
SymbolcitA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp612602 
End bp614260 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content50% 
IMG OID 
Productsensory histidine kinase in two-component regulatory system with citB 
Protein accessionACT42467 
Protein GI253976797 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCAGC TTAACGAGAA TAAACAGTTT GCATTTTTCC AAAGACTGGC ATTTCCGCTG 
CGTATCTTTT TGCTGATTCT GGTGTTCTCA ATATTTGTCA TTGCAGCCCT GGCGCAATAT
TTTACGGCCA GTTTTGAGGA CTATTTAACG CTTCATGTAC GCGACATGGC AATGAATCAG
GCGAAAATTA TTGCCTCCAA TGACAGTGTC ATCTCTGCGG TGAAAACGCG TGACTACAAA
CGGCTGGCGA CCATCGCTAA CAAATTACAA AGAGATACCG ATTTTGATTA TGTGGTGATT
GGGGACCGGC ACTCGATCCG CCTTTACCAT CCTAATCCGG AGAAAATTGG TTATCCTATG
CAGTTCACCA AACAGGGCGC GCTGGAGAAA GGGGAGAGCT ACTTCATTAC CGGGAAAGGG
TCAATGGGGA TGGCGATGCG CGCCAAAACG CCAATCTTTG ATGACGATGG AAAAGTCATC
GGCGTGGTGT CGATTGGCTA CCTGGTGAGT AAAATCGATA GCTGGCGGGC TGAGTTTTTA
TTACCGATGG CAGGCGTGTT TGTCGTGCTG TTAGGGATTC TGATGTTGCT GTCGTGGTTC
CTGGCAGCGC ATATCCGTCG GCAAATGATG GGCATGGAGC CAAAGCAAAT CGCACGCGTG
GTCCGTCAGC AAGAGGCGCT GTTTAGTTCG GTTTATGAAG GGCTGATTGA GGTGGATCCG
CATGGTTACA TTACCGCCAT CAATCGTAAC GCAAGAAAGA TGCTGGGGCT GAGCTCTCCC
GGACGGCAAT GGTTGGGTAA ACCCATTGCT GAAGTGGTCA GGCCCGCCGA TTTCTTTACC
GAACAGATTG ATGAAAAACG TCAGGATGTG GTGGCTAACT TTAACGGTCT GAGCGTTATT
GCCAATCGGG AAGCTATTCG TTCAGGTGAT GATTTGCTGG GGGCCATTAT CAGCTTTCGT
AGTAAAGACG AAATTTCCAC CCTCAATGCG CAACTGACGC AAATAAAACA ATACGTCGAG
AGCCTTCGTA CATTGCGACA CGAGCATCTC AATTGGATGT CGACGCTCAA TGGTCTGTTG
CAGATGAAAG AGTATGATCG CGTGCTGGCG ATGGTGCAGG GGGAGTCTCA GGCCCAGCAA
CAGCTTATTG ACAGCCTGCG TGAGGCGTTT GCCGATCGCC AGGTGGCGGG GCTGCTTTTT
GGTAAAGTGC AGCGCGCCCG GGAACTGGGG CTAAAAATGA TCATTGTCCC CGGTAGCCAG
CTTTCGCAAC TGCCGCCAGG ACTGGATAGC ACCGAGTTTG CAGCCATTGT GGGCAATTTA
CTTGATAACG CCTTCGAAGC CAGCCTGCGT AGCGATGAAG GAAACAAGAT CGTTGAATTA
TTCCTCAGCG ATGAAGGCGA TGATGTGGTG ATTGAAGTCG CCGATCAGGG CTGCGGCGTT
CCAGAGTCTC TACGAGACAA AATATTTGAG CAGGGGGTCA GTACGCGTGC TGACGAGCCC
GGTGAACATG GCATTGGGTT GTACTTGATT GCCAGCTACG TAACGCGCTG CGGTGGTGTT
ATCACTCTCG AAGATAATGA TCCCTGCGGT ACCTTGTTTT CAATCTATAT TCCGAAAGTG
AAACCTAATG ACAGCTCCAT TAACCCTATT GATCGTTGA
 
Protein sequence
MLQLNENKQF AFFQRLAFPL RIFLLILVFS IFVIAALAQY FTASFEDYLT LHVRDMAMNQ 
AKIIASNDSV ISAVKTRDYK RLATIANKLQ RDTDFDYVVI GDRHSIRLYH PNPEKIGYPM
QFTKQGALEK GESYFITGKG SMGMAMRAKT PIFDDDGKVI GVVSIGYLVS KIDSWRAEFL
LPMAGVFVVL LGILMLLSWF LAAHIRRQMM GMEPKQIARV VRQQEALFSS VYEGLIEVDP
HGYITAINRN ARKMLGLSSP GRQWLGKPIA EVVRPADFFT EQIDEKRQDV VANFNGLSVI
ANREAIRSGD DLLGAIISFR SKDEISTLNA QLTQIKQYVE SLRTLRHEHL NWMSTLNGLL
QMKEYDRVLA MVQGESQAQQ QLIDSLREAF ADRQVAGLLF GKVQRARELG LKMIIVPGSQ
LSQLPPGLDS TEFAAIVGNL LDNAFEASLR SDEGNKIVEL FLSDEGDDVV IEVADQGCGV
PESLRDKIFE QGVSTRADEP GEHGIGLYLI ASYVTRCGGV ITLEDNDPCG TLFSIYIPKV
KPNDSSINPI DR