Gene Nmul_A2015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2015 
Symbol 
ID3784565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2313198 
End bp2314814 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content52% 
IMG OID637812104 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_412702 
Protein GI82703136 
COG category[T] Signal transduction mechanisms 
COG ID[COG2202] FOG: PAS/PAC domain
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTGT CCTCCCTTTC TATCGAGTCA TATGCTGAAT CAGAACGCTT TCAGCTATTC 
GTTGCCAGCG TAACTGACTA CGCGCTTTAC ATGCTCAACC CTGAAGGCCG CGTCTGCAGT
TGGAATGCTG GTGCGCAGAG GTTTAAAGGC TATACGGCTG AAGAAATCAT AGGTCAGCAC
TTCTCCCGGT TTTATACAGA GGAAGACAGG GCGGCCAATA TTCCATTCAA GGCCTTGCAA
ACGGCAGCCA AAGAGGGAAG ATTCGAAGAT GAAGGCTGGC GCGTGCGCAA GGATGGCAAT
CGGTTCTGGG CCAGCATCGT CATTGATCCA ATTCGGGATC CCGAGGGCAT GCTGATTGGT
TTCGCTAAAA TCACCCGTGA TATTACCGCG CGCAAGAAGG CAACCGAGGC TCTGCACGCC
AGTGAGGAGC AATTCCGGTT ACTGGTCGAG GGCGTGACGG ACTACGCGCT CTACATGCTG
TCGGTAGATG GCACCATTAC CAATTGGAAT CCAGGGGCAC GTCGGATTAC TGGCTTCGAT
CAAACTGAAG CCGTTGGCAC TCATTTCTCC CGCTTTTATA TACAGGAAGA CAAGGCCAAG
GACTTGCCAT TGGTGGCATT ACAGACTGCG GAAGCGGATG GTCGCTTTGA AGGCGAAGGC
TGGCGGGTAC GGAAAGACGG CTCCAGGTTT TGGGCAAATG TAGTGATAGA CCCAATTAGA
AACGCTCTTG GCGAATTGAT TGGTTTTGCA AAAATCACGC GCGATATCAC GGGAAAGCGA
GAGGCCGAGC AGGCGCTGGA GCGTGCCAAA GAAGCCTTGT TCCAGTCCCA GAAGCTGGAA
GCGATCGGCA AATTGACGGG CGGGATTGCT CACGACTTCA ATAACCTGCT TAACGTCATT
GTCAACGGGA TTGAAATCAT TGCAAAGCAA GCACAGACGC CAACTTCCAC CCGGATGCTC
GAAAGCATGC AGCGCGCAGC CGCTCAGGGG ACGATGTTAA CGCAACAATT GTTGACGTTT
GCTCGCAAGC AACCCTTAAA GCAAGATAAG TACAATTTGA ATCACGTCAT ACGTTCTTTT
GAACCGGTAC TTCGCAGAGC CAATAAAGGT TCTGTTGAGT TTGATGTGAA ACTTGATCCG
CTTTTACCAC CGGTAATCAT CGATGCGCCG CACTTCGAGG CGGCATTATT AAACCTGGTT
ATCAATGCGC GTGACGCTAC GCCCGATGGG GGCGCTATTA CGTTGAGTAC TGAACAGCTC
GAACTGGATG AAAAGGAAAT CAATGAGCTG CCAGCAGGAC GCTATGTGAA AGTTACTGTG
AAAGATACCG GTACAGGGAT GCTGCCGGAA GTAGCGGCCC AAGCCGTGGA GCCGTTCTTT
ACCACTAAGG AGGTTGGCAA AGGGTCAGGG CTGGGGCTGA GTCAGGTGTA CGGGACGATC
AAGCAATTCG GCGGGGATAT GGTAATTGAA ACGGCGGTGG GCAAAGGCAC TGCTATTTCC
CTATTCGTGC CAGCGCTGGA AGGGGATACG AATGAAGGTT CCGGGGGCCT GGCAAGCGGA
AATGAGAAGG CATTGGTGGT GGATGATCAG GCGGATCTTC TGGAGATCAC CACGTAA
 
Protein sequence
MTLSSLSIES YAESERFQLF VASVTDYALY MLNPEGRVCS WNAGAQRFKG YTAEEIIGQH 
FSRFYTEEDR AANIPFKALQ TAAKEGRFED EGWRVRKDGN RFWASIVIDP IRDPEGMLIG
FAKITRDITA RKKATEALHA SEEQFRLLVE GVTDYALYML SVDGTITNWN PGARRITGFD
QTEAVGTHFS RFYIQEDKAK DLPLVALQTA EADGRFEGEG WRVRKDGSRF WANVVIDPIR
NALGELIGFA KITRDITGKR EAEQALERAK EALFQSQKLE AIGKLTGGIA HDFNNLLNVI
VNGIEIIAKQ AQTPTSTRML ESMQRAAAQG TMLTQQLLTF ARKQPLKQDK YNLNHVIRSF
EPVLRRANKG SVEFDVKLDP LLPPVIIDAP HFEAALLNLV INARDATPDG GAITLSTEQL
ELDEKEINEL PAGRYVKVTV KDTGTGMLPE VAAQAVEPFF TTKEVGKGSG LGLSQVYGTI
KQFGGDMVIE TAVGKGTAIS LFVPALEGDT NEGSGGLASG NEKALVVDDQ ADLLEITT