Gene Nmul_A1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1946 
Symbol 
ID3785123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2237088 
End bp2238539 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content54% 
IMG OID637812033 
ProductPhoH-like protein 
Protein accessionYP_412633 
Protein GI82703067 
COG category[T] Signal transduction mechanisms 
COG ID[COG1875] Predicted ATPase related to phosphate starvation-inducible protein PhoH 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCCTA GAAAGCCGAG ATCTCCCACG GCTCCCAATT CATCTGCTTC CATCGCTCCG 
ATCCCCCGTC TGATGCCCAA GCTGTTTGTG CTCGACAGCA ACGTGCTCAT GCACGACCCC
ACCAGCCTGT TCCGCTTTCA GGAACATGAC ATCTATATCA CGATGACAAC GCTGGAGGAA
CTCGACAACA ACAAGAAAGG GATGTCGGAG GTTGCCCGCA ATGCACGTCA AACAAGCCGT
TTTCTGGATG AAATCGTCAG CAGCGCGATA ACCGATATTG ACGAAGGCAT TTCTTTACAG
TTGCATGGCA CCAAGAATGC AACCGGCAGG CTGTTCCTCC AGACGCAGGC GATCACGAAC
GTCTTGCCGG TATATCTGGC AAGCGGCAGC GCGGACAACC AGATTATCGG AGCAGTCAAG
TTTTTGCACG ATACCCACCA GAACCGCGTG GTCACACTGG TTTCCAAAGA CATCAACATG
CGGATCAAGG CGCGAGCCTT GGGGCTGGCA GCGGAGGACT ATTTCAATGA CAAGGTTCTG
GAAGACACTG ATGTCCTCTT CTCAGGCATC CAGGAACTGC CAGAGGATTT CTGGGATGAA
CACGGCAAGG ACATGGAGTC CTGGCAGCAG TCGGGGCAGA CATTTTATCG TGTGACCGGC
CCCCTTGCCG GGGGTTTTGT AATCAATCAA TTCGTCTATC TGGAGCATGA CAAACCGTTT
TACGCGCAGG TCAAGGAGAC TAGCGGAAGA ACTGCCGTTC TGCAGACGCT GAAAGATTAT
ACTCATCAAA AGAACAATGT GTGGGGCATC ACGGCGCGGA ATCGCGAGCA GAATTTCGCG
TTCAACCTGC TGATGAACCC GGAAGTGGAT TTCGTCACCC TGCTGGGGCA GGCAGGTACC
GGCAAGACGT TGCTTACACT GGCAGCAGGT CTGATGCAGA CACTGGAGCA CAAGGTATAC
TCCGAAATCA TCATGACGCG CGTGACGGTG CCAGTGGGGG AGGATATCGG ATTTCTGCCC
GGAACCGAGG AAGAAAAGAT GACTCCCTGG ATGGGGGCAC TGGAAGACAA CCTGGACGTG
CTCAACAAGA CGGATAGCAG CGCCGGAGAA TGGGGACGAG CAGCGACGCT CGATCTGATT
CGCTCCCGCA TCAAGATAAA ATCGCTCAAC TTCATGCGCG GGCGCACTTT CATCAATAAG
TTCCTGATAA TCGACGAAGC GCAGAACCTG ACACCCAAGC AGATGAAAAC GCTTATTACC
CGTGCCGGCC CTGCCACAAA GGTCGTGTGC CTGGGTAACA TCGCGCAGAT AGATACGCCC
TACTTGTCGG AGGGAAGCTC AGGGTTGACC TACGTGGTGG ACCGGTTCAA GGGATGGAAT
CACAGCGGGC ACGTCACGCT GCAACGTGGT GAGCGTTCCA GACTGGCGGA TTATGCTGCA
GAGATACTAT AA
 
Protein sequence
MSPRKPRSPT APNSSASIAP IPRLMPKLFV LDSNVLMHDP TSLFRFQEHD IYITMTTLEE 
LDNNKKGMSE VARNARQTSR FLDEIVSSAI TDIDEGISLQ LHGTKNATGR LFLQTQAITN
VLPVYLASGS ADNQIIGAVK FLHDTHQNRV VTLVSKDINM RIKARALGLA AEDYFNDKVL
EDTDVLFSGI QELPEDFWDE HGKDMESWQQ SGQTFYRVTG PLAGGFVINQ FVYLEHDKPF
YAQVKETSGR TAVLQTLKDY THQKNNVWGI TARNREQNFA FNLLMNPEVD FVTLLGQAGT
GKTLLTLAAG LMQTLEHKVY SEIIMTRVTV PVGEDIGFLP GTEEEKMTPW MGALEDNLDV
LNKTDSSAGE WGRAATLDLI RSRIKIKSLN FMRGRTFINK FLIIDEAQNL TPKQMKTLIT
RAGPATKVVC LGNIAQIDTP YLSEGSSGLT YVVDRFKGWN HSGHVTLQRG ERSRLADYAA
EIL