Gene EcolC_3025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3025 
Symbol 
ID6066019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3303342 
End bp3305000 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content50% 
IMG OID641602441 
Productsignal transduction histidine kinase regulating citrate/malate metabolism 
Protein accessionYP_001725976 
Protein GI170021022 
COG category[T] Signal transduction mechanisms 
COG ID[COG3290] Signal transduction histidine kinase regulating citrate/malate metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCAGC TTAACGAGAA TAAACAGTTT GCATTTTTCC AAAGACTGGC ATTTCCGCTG 
CGTATCTTTT TGCTGATTCT GGTGTTCTCA ATATTTGTCA TTGCAGCCCT GGCGCAATAT
TTTACGGCCA GTTTTGAGGA CTATTTAACG CTTCATGTAC GCGACATGGC AATGAATCAG
GCGAAAATTA TTGCCTCCAA TGACAGTGTC ATCTCTGCGG TGAAAACGCG TGACTACAAA
CGGCTGGCGA CCATCGCTAA CAAATTACAA AGAGATACCG ATTTTGATTA TGTGGTGATT
GGGGACCGGC ACTCGATCCG CCTTTACCAT CCTAATCCGG AGAAAATTGG TTATCCTATG
CAGTTCACCA AACAGGGCGC GCTGGAGAAA GGGGAGAGCT ACTTCATTAC CGGGAAAGGG
TCAATGGGGA TGGCGATGCG CGCCAAAACG CCAATCTTTG ATGACGATGG AAAAGTCATC
GGCGTGGTGT CGATTGGCTA CCTGGTGAGT AAAATCGATA GCTGGCGGGC TGAGTTTTTA
TTACCGATGG CAGGCGTGTT TGTCGTGCTG TTAGGGATTC TGATGTTGCT GTCGTGGTTC
CTGGCAGCGC ATATCCGTCG GCAGATGATG GGCATGGAGC CAAAGCAAAT CGCACGCGTG
GTCCGTCAGC AAGAGGCGCT GTTTAGTTCG GTTTATGAAG GGCTGATTGC GGTGGATCCG
CATGGTTACA TTACCGCCAT CAATCGTAAC GCAAGAAAGA TGCTGGGGCT GAGCTCTCCC
GGACGGCAAT GGTTGGGTAA ACCCATTGCT GAAGTGGTCA GGCCCGCCGA TTTCTTTACC
GAACAGATTG ATGAAAAACG TCAGGATGTG GTGGCTAACT TTAACGGTCT GAGCGTTATT
GCCAATCGGG AAGCTATTCG TTCAGGTGAT GATTTGCTGG GGGCCATTAT CAGCTTTCGC
AGTAAAGACG AAATTTCCAC CCTCAATGCG CAACTGACGC AAATAAAACA ATACGTTGAG
AGCCTTCGTA CATTGCGACA CGAGCATCTC AATTGGATGT CGACGCTCAA TGGTCTGTTG
CAGATGAAAG AGTATGATCG CGTGCTGGCG ATGGTGCAGG GGGAGTCTCA GGCCCAGCAA
CAGCTTATTG ACAGCCTGCG CGAGGCGTTT GCCGATCGCC AGGTGGCGGG GCTACTTTTT
GGTAAAGTGC AGCGCGCCCG GGAACTGGGG CTAAAAATGA TCATTGTCCC CGGTAGCCAG
CTTTCGCAAC TGCCGCCAGG ACTGGATAGC ACCGAGTTTG CAGCCATTGT GGGCAATTTA
CTTGATAACG CCTTCGAAGC CAGCCTGCGT AGCGATGAAG GAAACAAGAG CGTTGAATTA
TTCCTCAGCG ATGAAGGCGA TGATGTGGTG ATTGAAGTCG CCGATCAGGG CTGCGGCGTT
CCAGAGTCTC TACGAGACAA AATATTTGAG CAGGGGGTCA GTACGCGTGC TGACGAGCCC
GGCGAACATG GCATTGGGTT GTACTTGATT GCCAGCTACG TAACGCGCTG CGGTGGTGTT
ATCACTCTCG AAGATAATGA TCCCTGCGGT ACCTTGTTTT CAATCTATAT TCCGAAAGTG
AAACCTAATG ACAGCTCCAT TAACCCTATT GATCGTTGA
 
Protein sequence
MLQLNENKQF AFFQRLAFPL RIFLLILVFS IFVIAALAQY FTASFEDYLT LHVRDMAMNQ 
AKIIASNDSV ISAVKTRDYK RLATIANKLQ RDTDFDYVVI GDRHSIRLYH PNPEKIGYPM
QFTKQGALEK GESYFITGKG SMGMAMRAKT PIFDDDGKVI GVVSIGYLVS KIDSWRAEFL
LPMAGVFVVL LGILMLLSWF LAAHIRRQMM GMEPKQIARV VRQQEALFSS VYEGLIAVDP
HGYITAINRN ARKMLGLSSP GRQWLGKPIA EVVRPADFFT EQIDEKRQDV VANFNGLSVI
ANREAIRSGD DLLGAIISFR SKDEISTLNA QLTQIKQYVE SLRTLRHEHL NWMSTLNGLL
QMKEYDRVLA MVQGESQAQQ QLIDSLREAF ADRQVAGLLF GKVQRARELG LKMIIVPGSQ
LSQLPPGLDS TEFAAIVGNL LDNAFEASLR SDEGNKSVEL FLSDEGDDVV IEVADQGCGV
PESLRDKIFE QGVSTRADEP GEHGIGLYLI ASYVTRCGGV ITLEDNDPCG TLFSIYIPKV
KPNDSSINPI DR