Gene Nmul_A1287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1287 
Symbol 
ID3784324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1479126 
End bp1480838 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content54% 
IMG OID637811374 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_411982 
Protein GI82702416 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAAAC CCCTGATTCG CGTTCTACTC GTCGAAGACG ATGCCGTGGA TCGTATGGCG 
TGTCGACGTG CTCTTGCCCG GGACCCCGAC TACGAATTCG TGCTTTCCGA AGCCGAATCA
GGCCGGGAGG GATTGCAACT TGCCCATGAG CAGAAGCCGG ACTGCGTCCT GCTTGATTAC
CATTTGCCGG ATGTGAATGG TCTTGAGTTT CTGGCAGCGC TGACCGACGA AAGTGGCGAT
GTTTCCATTC CGGTCATGAT GCTGACAGGT ACGGACAATG CCGCCATTGC GGTAGAAGCT
ATGAAGCGCG GGGCGAAGGA TTACCTGATC AAGGACCTCG AGCACCAATA CCTTGAATTA
TTGCCTGCCG TTATCCAGCG CGTACTGAGC GACCGCCGGA TTCGGCTGGA AAAAAAGCAG
GCGGAAGAGA AACTCGCCCA GGCGGAAGCC AAATATCGCT CTCTGGTCGA AACCATTCCG
GCAATCGTCT ACATTGCTGC ACTGGATGGG AGCAACCGTT TCCTTTACGT CAGCTCGCGT
ATCAATATGC TTGGTTTTTC AGCGGAACAG TGGTTGAACG ATCCGACTAT CCTGCTTGCA
CAGATTCATC CGGACGACCG GTCGCACGCA CTCGAGGAGC GGGCAAAAAG CCGCGCAACG
TGCGCCCCTC TGCGCTGTGA GTACCGTTTG CTTACTCAGG ATGAAAGAGT CTTGTGGTTC
CGCGACGAGG CGAATGTTGT GCGGGATGAA TCGGGACGCT CACTGTTTCT GCAGGGTATC
CTGGTTGATA TCACGGAAAG CAAACAGGCA GAAGAGGAGT TGAAGCAGCA TCGTTTTCGT
CTCGAAGAGC TCGTGGCCAA GCGCACCGAT GAACTGGCGC GGGTGAACGA AGAGTTGCGG
CGCGATATCA CCCACCGCAA GCTGATCGAA GAAGAACTGA TCAAAGCCAA GACAGAGGCG
GAAAAGGCCA ATCTGGCCAA ATCGGATTTT CTTTCCAGCA TGAGCCACGA ATTACGCTCA
CCCCTGAATG TCATGCTTGG TTTCGCCCAA TTGATGGAAT CGAGTTCTCC GCCGCCAACA
TTCACGCAAA TGACCATGCT CAAGGAGATC AATTCTGCAG GATGGTATCT GCTTGAATTG
ATAAATAAAA TCCTCGATCT CGCCGCAATC GAGTCCGGCA AACTGATTGT TGCGCAGGAA
CCGATGTGCA TCGGAGAGGT GATGGCCGAG TCCCGGACCA TGGTCGAACC GCAGGCCCAG
GAGCAAAACA TACGGCTGAT CTTTCCGCCA TCATCGGACA TGAACTTCTT CGTTCGGGCA
GACCGAACGC GGGTGAAGCA GGTTCTGATC AATCTTCTTT CCAACGCAAT CAAATACAAC
CGCGAGGGTG GCATGGTCGA AGTGAGTTGC TCCATGACCA AACCCGGGCG TCTGCGAATA
AGTGTGCGCG ATACGGGGAC GGGGCTGCCG CCGGAAAAAC TGACGCAGCT TTTTCAGCAA
TTCAACCGGC TTGGACAGGA AGCAGGGTCT GTGGAAGGCA CAGGCATTGG ACTGGTAGTG
ACGAAACAAC TGGTTGAACT GATGGGCGGA TCAATCGGTG TGGAAAGCAC TGTCGATGTC
GGCAGTGTGT TCTGGTTTGA ACTCATGGTT GATGAACCTT CCCAGGCTGC TTCCGATATA
GATAAAAACG AGGAGTTCAG TACTGAAGGT TAA
 
Protein sequence
MTKPLIRVLL VEDDAVDRMA CRRALARDPD YEFVLSEAES GREGLQLAHE QKPDCVLLDY 
HLPDVNGLEF LAALTDESGD VSIPVMMLTG TDNAAIAVEA MKRGAKDYLI KDLEHQYLEL
LPAVIQRVLS DRRIRLEKKQ AEEKLAQAEA KYRSLVETIP AIVYIAALDG SNRFLYVSSR
INMLGFSAEQ WLNDPTILLA QIHPDDRSHA LEERAKSRAT CAPLRCEYRL LTQDERVLWF
RDEANVVRDE SGRSLFLQGI LVDITESKQA EEELKQHRFR LEELVAKRTD ELARVNEELR
RDITHRKLIE EELIKAKTEA EKANLAKSDF LSSMSHELRS PLNVMLGFAQ LMESSSPPPT
FTQMTMLKEI NSAGWYLLEL INKILDLAAI ESGKLIVAQE PMCIGEVMAE SRTMVEPQAQ
EQNIRLIFPP SSDMNFFVRA DRTRVKQVLI NLLSNAIKYN REGGMVEVSC SMTKPGRLRI
SVRDTGTGLP PEKLTQLFQQ FNRLGQEAGS VEGTGIGLVV TKQLVELMGG SIGVESTVDV
GSVFWFELMV DEPSQAASDI DKNEEFSTEG