Gene Nmul_A2696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2696 
Symbol 
ID3785058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp3100617 
End bp3101657 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content54% 
IMG OID637812786 
ProductPhoH-like protein 
Protein accessionYP_413375 
Protein GI82703809 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCAAAGAA GTTCGGTCGC TGCCAGAATT AAGACCTCAC CTCCATATTT GTGTCGCTTT 
ACCGCGCCAT TGAAGCCCAA ATCCGTAGAA ATTTCCTTTT CCCCCGCTGA CAACCAGCGT
CTGGCGAACC TGTGTGGTGT GCTGGATGAA AACCTGAGGC AGATCGAGAC GGTTCTCGAT
GTCGCAATTG CAAGGCGGGG AGAACATTTC AGTATCCGGG GGAAGCCACC CCAGACTTCA
CTTGCCGCGG AAGCTCTGCA GAACTTCTAC GATCAGGCGC ACCATCCTCT GGGCATCGAA
CAGATTCAAC TGGGCCTGAT CGAGGCGATG AATCCACATC ACCAGAAAAA ACAGGGGCCA
GATGCCAAGG AAATAGGGGA GCCCGCCCTG TATACGCGGC GTAGCGATTT GCGCGGACGC
ACACGCCGCC AAGTGGAGTA TCTGCACCAG ATCCAGACAC ATGACATCAC TTTTTCCATC
GGCCCCGCAG GCACCGGGAA AACGTATCTT GCGGTTGCAA GCGCAGTGGA TGCACTCGAG
CGGGATATCG TGGCGCGTAT CATACTGGTG CGGCCTGCGG TGGAAGCAGG CGAACGCCTG
GGATTTTTAC CGGGTGATAT GGTGCAGAAA GTGGATCCTT ATTTGCGTCC CCTTTACGAT
GCGCTCTACG ATCTGATGGG GTTCGATAAA ACCAGCAAAC AGTTTGAACG AAACGCAATC
GAGGTGGCTC CGCTTGCATT CATGCGCGGG CGAACGCTGA ACCAGTCTTT CATCATTCTG
GATGAGGCGC AAAATACCAC GCCGGAACAG ATGAAAATGT TCCTGACCCG CATCGGCTTT
GGCTCCAAGG CCGTCGTGAC AGGGGATATC ACGCAGATTG ACCTTGCAAA ACATCAGAAA
AGCGGCTTGG TGGAAGCCCA GCAGGTTCTT GAAAAAGTGC GGGGCATTGC CTTTACGCGG
TTCGATGCAG AGGATGTGGT GCGGCATCCG CTGGTGCAAA GAATTGTCAA TGCCTATGAG
AAATATGAAA GAAAGGAGTA G
 
Protein sequence
MQRSSVAARI KTSPPYLCRF TAPLKPKSVE ISFSPADNQR LANLCGVLDE NLRQIETVLD 
VAIARRGEHF SIRGKPPQTS LAAEALQNFY DQAHHPLGIE QIQLGLIEAM NPHHQKKQGP
DAKEIGEPAL YTRRSDLRGR TRRQVEYLHQ IQTHDITFSI GPAGTGKTYL AVASAVDALE
RDIVARIILV RPAVEAGERL GFLPGDMVQK VDPYLRPLYD ALYDLMGFDK TSKQFERNAI
EVAPLAFMRG RTLNQSFIIL DEAQNTTPEQ MKMFLTRIGF GSKAVVTGDI TQIDLAKHQK
SGLVEAQQVL EKVRGIAFTR FDAEDVVRHP LVQRIVNAYE KYERKE