Gene EcolC_2985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2985 
Symbol 
ID6065846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3262953 
End bp3264032 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content54% 
IMG OID641602402 
ProductPhoH family protein 
Protein accessionYP_001725937 
Protein GI170020983 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.189197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAGCT TCCGGCGGCA TACAGATAAG AGGAACGGTT TGAACATAGA CACTCGCGAA 
ATCACCCTGG AGCCAGCAGA CAACGCGCGT CTGTTGAGCC TGTGCGGCCC GTTTGATGAC
AACATCAAGC AGCTCGAACG CCGTCTCGGC ATCGAGATCA ATCGCCGCGA CAACCACTTT
AAACTGACCG GCCGTCCGAT TTGCGTCACC GCTGCGGCAG ACATTCTGCG TAGCCTGTAT
GTCGATACAG CCCCGATGCG CGGTCAGATT CAGGATATCG AACCGGAACA GATCCACCTT
GCGATTAAAG AAGCAAGGGT ACTGGAGCAA AGCGCGGAGA GCGTGCCGGA GTACGGCAAA
GCGGTCAATA TCAAAACCAA ACGCGGCGTA ATTAAGCCGC GCACGCCAAA CCAGGCGCAG
TACATCGCCA ATATTCTCGA CCATGACATT ACCTTCGGCG TTGGCCCGGC GGGTACGGGT
AAAACCTACC TGGCAGTCGC TGCAGCAGTT GATGCCCTGG AGCGTCAGGA AATTCGCCGT
ATTCTGCTGA CTCGTCCGGC GGTCGAAGCC GGTGAGAAAC TGGGCTTCCT GCCTGGCGAT
TTAAGCCAGA AAGTAGACCC GTATCTGCGC CCACTGTACG ACGCGCTGTT TGAAATGCTG
GGCTTTGAGA AAGTCGAGAA ACTGATTGAG CGCAACGTTA TTGAAGTCGC GCCGCTGGCC
TATATGCGTG GTCGTACGCT GAATGACGCG TTTATCATTC TCGATGAGAG CCAGAACACC
ACCATCGAAC AGATGAAGAT GTTCCTGACC CGTATCGGTT TTAACTCAAA AGCGGTTATC
ACCGGCGACG TCACACAGAT CGACCTGCCG CGTAATACTA AATCAGGCTT ACGTCATGCC
ATCGAAGTGC TGGCCGATGT CGAAGAGATC AGCTTTAACT TCTTCCACAG CGAAGACGTG
GTTCGTCACC CGGTGGTGGC GCGTATCGTT AACGCCTATG AAGCCTGGGA AGAAGCCGAA
CAAAAACGTA AAGCGGCGCT GGCGGCAGAA CGCAAGCGCG AAGAACAGGA ACAAAAATGA
 
Protein sequence
MFSFRRHTDK RNGLNIDTRE ITLEPADNAR LLSLCGPFDD NIKQLERRLG IEINRRDNHF 
KLTGRPICVT AAADILRSLY VDTAPMRGQI QDIEPEQIHL AIKEARVLEQ SAESVPEYGK
AVNIKTKRGV IKPRTPNQAQ YIANILDHDI TFGVGPAGTG KTYLAVAAAV DALERQEIRR
ILLTRPAVEA GEKLGFLPGD LSQKVDPYLR PLYDALFEML GFEKVEKLIE RNVIEVAPLA
YMRGRTLNDA FIILDESQNT TIEQMKMFLT RIGFNSKAVI TGDVTQIDLP RNTKSGLRHA
IEVLADVEEI SFNFFHSEDV VRHPVVARIV NAYEAWEEAE QKRKAALAAE RKREEQEQK