Gene EcSMS35_0682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0682 
Symbol 
ID6145280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp691795 
End bp692835 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content54% 
IMG OID641615572 
ProductPhoH family protein 
Protein accessionYP_001742778 
Protein GI170681748 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0121436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACATAG ACACTCGCGA AATCACCCTG GAGCCAGCAG ACAATGCGCG TCTGTTGAGC 
CTGTGCGGCC CGTTTGATGA CAACATCAAG CAGCTCGAAC GCCGTCTCGG CATCGAGATC
AATCGCCGCG ATAACCACTT TAAACTGACC GGCCGTCCGA TTTGCGTCAC CGCTGCGGCA
GACATTCTGC GTAGCCTGTA TGTCGATACT GCCCCGATGC GTGGTCAGAT TCAGGATATC
GAACCGGAAC AGATCCACCT TGCGATTAAA GAAGCGCGGG TACTGGAGCA AAGCGCGGAG
AGCGTGCCGG AGTACGGCAA AGCGGTCAAT ATCAAAACCA AACGCGGCGT AATCAAGCCG
CGTACGCCAA ACCAGGCGCA GTACATCGCC AATATTCTCG ACCATGACAT CACCTTCGGC
GTTGGCCCGG CGGGTACGGG TAAAACCTAC CTGGCAGTGG CTGCGGCAGT TGATGCCCTG
GAGCGTCAGG AAATTCGCCG TATTCTGCTG ACTCGTCCGG CGGTAGAAGC CGGTGAGAAA
CTGGGCTTCC TGCCTGGCGA TTTAAGCCAG AAAGTAGACC CGTATTTGCG CCCACTGTAC
GACGCGCTGT TTGAAATGCT GGGCTTTGAG AAAGTCGAGA AACTGATTGA GCGCAACGTT
ATTGAAGTCG CGCCGCTGGC CTATATGCGT GGTCGTACGC TGAACGACGC GTTTATCATT
CTCGATGAGA GCCAGAACAC TACCATCGAA CAGATGAAGA TGTTCCTGAC CCGTATCGGT
TTTAACTCAA AAGCGGTTAT CACCGGCGAC GTCACGCAGA TCGACTTGCC GCGTAATACT
AAATCAGGCT TACGTCACGC TATCGAAGTG TTAGCCGATG TCGAAGAGAT CAGCTTTAAC
TTCTTCCACA GCGAAGACGT GGTTCGTCAC CCGGTGGTGG CGCGTATCGT TAACGCTTAT
GAAGCCTGGG AAGAAGCAGA ACAAAAACGT AAAGCGGCGC TGGCAGCAGA ACGCAAGCGC
GAAGAACAGG AACAAAAATG A
 
Protein sequence
MNIDTREITL EPADNARLLS LCGPFDDNIK QLERRLGIEI NRRDNHFKLT GRPICVTAAA 
DILRSLYVDT APMRGQIQDI EPEQIHLAIK EARVLEQSAE SVPEYGKAVN IKTKRGVIKP
RTPNQAQYIA NILDHDITFG VGPAGTGKTY LAVAAAVDAL ERQEIRRILL TRPAVEAGEK
LGFLPGDLSQ KVDPYLRPLY DALFEMLGFE KVEKLIERNV IEVAPLAYMR GRTLNDAFII
LDESQNTTIE QMKMFLTRIG FNSKAVITGD VTQIDLPRNT KSGLRHAIEV LADVEEISFN
FFHSEDVVRH PVVARIVNAY EAWEEAEQKR KAALAAERKR EEQEQK