Gene Nmul_A2232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2232 
Symbol 
ID3784933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2535510 
End bp2536811 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content56% 
IMG OID637812320 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_412916 
Protein GI82703350 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR02966] phosphate regulon sensor kinase PhoR 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCGGTT TCTGGGGGCA TTTTGCCAAT CTCGTACTGA TTGCCGTTGT AAGCGGAGTA 
CTGATGACTG TGTCCGGCGC AGCCGCGGCG CTGACATTTT ATGTCGTTGT TCTGTTCCTG
CTGGTCCTGC GCCACTCGCA CAATCTGAAG CGCCTGGACC GCTGGCTTCG TAGTGAAGAA
GCTGTGCTAC CCGACAGCTC CGGCAGATGG GGAGATGTAT TCGCGCGCCT GGCCCGGCTC
ATGCGCGATC AAAAACAAAC CCACCAAAAC CTCAGTTCCG CTCTGGAGCG CCTGCGGAGC
GCCACCTCAG CCATGCCCGA GGGAGTGGTC ATCCTCGATG AAATGGACCG CATCGAATGG
TGCAATCCAG TGGCCGAAAA ACATCTGGGC ATCAATGCCA GTCTCGATAC CGGGCAGCAT
ATCACGCATC TGATGCGTCA AACCCAGTTC GCGGAATATC TTGCCGCGCG GAATTACAAG
GAACCCCTGG TGATCAAACA ACCGCGGCAG CACGAGTTGA CGCTCTCGCT GCAGTTCGTT
CCTTATGGCG ACAAGCAGAA ACTGCTCCTC AGCCGTGATA TAACCAAGCT TGAAAGAATC
CAGACCATGC GCCGCGATTT TGTCGCCAAC GTCTCGCATG AACTGCGCAC GCCTTTGACC
GTCATCGGTG GGTTTCTCGA AACGCTGTCG GACGATAATC AACCCGATCC CGATACCCGC
AAATGGGCAT TGGAATTGAT GAGCGAGCAG ACGCGACGCA TGCAAAGCCT GGTGGAGGAT
CTGTTGACTC TGTCGAGGCT CGAGAATACG GAAAACCAGG TGCGCGAGGA GCACGTCAAC
ATACCGGAAA TGTTGCGAAC GCTGTATGAG GAAGCGAAAT CCCTGAGTGG AGGACACCAT
CGCATCACGC TCGAGCTCGA TACAACCACA AAACTCCTGG GCAACCTGTT TGACCTGCGC
AGCGCCTTCA TCAATCTGAT CAGCAATGCC ATACGCTATA CTCCTGATGG AGGGAATATA
ACGCTGCGCT GGGCAATTCA GGACGGGAAA GGTGTATTTT CGGTGCAGGA CACGGGGATA
GGGATTGAAC CCGAACATAT CTCGCGCCTG ACGGAGCGTT TTTATCGCGT CGACCGTGGC
CGATCGCGTG AAACCGGAGG CACCGGCCTC GGTCTTGCCA TTGTCAAGCA CGTGCTCAGC
GGTCACCAGG CGAAGCTGGA AATTACCAGC GAACCGGGCA AAGGCAGTCG CTTCAGCGCA
GTGTTTCCGG CTACCCGGCT TCTGATACAG CAAGGTGAGT AA
 
Protein sequence
MSGFWGHFAN LVLIAVVSGV LMTVSGAAAA LTFYVVVLFL LVLRHSHNLK RLDRWLRSEE 
AVLPDSSGRW GDVFARLARL MRDQKQTHQN LSSALERLRS ATSAMPEGVV ILDEMDRIEW
CNPVAEKHLG INASLDTGQH ITHLMRQTQF AEYLAARNYK EPLVIKQPRQ HELTLSLQFV
PYGDKQKLLL SRDITKLERI QTMRRDFVAN VSHELRTPLT VIGGFLETLS DDNQPDPDTR
KWALELMSEQ TRRMQSLVED LLTLSRLENT ENQVREEHVN IPEMLRTLYE EAKSLSGGHH
RITLELDTTT KLLGNLFDLR SAFINLISNA IRYTPDGGNI TLRWAIQDGK GVFSVQDTGI
GIEPEHISRL TERFYRVDRG RSRETGGTGL GLAIVKHVLS GHQAKLEITS EPGKGSRFSA
VFPATRLLIQ QGE