Gene Nmul_A1585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1585 
Symbol 
ID3784465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1817278 
End bp1819650 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content59% 
IMG OID637811671 
ProductRNA binding S1 
Protein accessionYP_412278 
Protein GI82702712 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCCAT CCATTGAACA ACGTCTTTCC CTTGAACTCG GCGCGAAGCC TGCACAGGTT 
AACGCGGCCA TTGCCTTGCT CGATGAAGGT GCCACCGTGC CCTTTATTGC ACGTTACCGC
AAGGAGGTGA CTGGCGGGCT GGACGATGCG CAGTTGCGCC TGCTGGAAGA ACGGTTGCGT
TATCTGCGTG AACTGGAAGA CAGGCGCGCC GCGATTATCG CCTCGATAGA AGAGCAGGGG
AAAATGACGC CCGCGCTGCT TGCTTCCATC CTGCAGGCCG AGGATAAGAC ACGGCTGGAA
GATCTGTATC TCCCCTTCAA GAAAAAGCGG CGCACCAAGG CGCAGATCGC GCTCGAAGCG
GGGCTGGAGC CACTGGCAGA TGCGCTGCTT GCCGATCCGA CGCTACAACC CGAGGAAGAG
GCCATCAAGT ACTTGAAGCC GCCCTTCGCT ACCGAGCAGG GGGATAATCC CGGGGTACCG
GATGTGAAAG CTGCGCTCGA GGGAGCACGC CAGATACTGA TGGAGCGTTT CGCGGAGGAT
GCCGAGTTGC TTCAGTGGCT GCGCGAGTAC CTGCTGGACC ATGGGGTGGT GGAGTCGAAA
GTCGCGAGCG ACAAGAATGG CGGTAAGGGT AAGGAAGAGG AGGGCGCCAA ATATTCCGAT
TATTTCGATT ACTCCGAACC GCTCAGCGCT ATTCCTTCGC ACCGGGCGCT GGCGCTTTTT
CGGGGGCGAC GTGAAGAAAT TTTACGCGTT GCCCTGCGTC TGGATTCGGA GGCGGAGAAA
CCGAAGTGGG ATGCACCGCA TAACCCGTGC GAGGCGCGCA TTGCTGTCCG GTTCGGTATT
GCGGACAAAG GGCGGCCTGC CGATGCGTGG CTGATGGACA CGGTGCGCTG GACCTGGCGG
GTGAAGAGTT TTCCGCATCT GGAACTCGAT CTTATGGGTT CGTTGCGCAC ACGCGCCGAG
ACCGAAGCGA TCCAGGTCTT TGCGCGCAAC CTGAAAGCCC TGCTCATGGC CGCTCCCGCC
GGGCCTCGCG TGACAATAGG TCTAGACCCC GGCTTGCGCA CAGGGGTGAA AGTCGCAGTA
GTGGATGCGA CAGGGCGGGT CATGGAAACG GCCACCATTT ATCCACATCA ACCAAGGAAT
GATTGGGAGG GGTCCCTTCA TGTTCTCGGC ACGCTTGCGG AAAAATTTCG GGTATCGCTG
ATAGCCATAG GCAATGGCAC CGCTTCGCGC GAGACCGACA AGCTGGCAAA AGACCTGATC
AAGCGCCGGC CCGATCTCAA GCTCACCTCT ATCGTGGTTT CGGAAGCGGG GGCTTCGGTT
TATTCCGCCT CCGATCTGGC CTCCAGAGAG TTCCCCGATA TGGATGTGTC GCTGAGAGGA
GCGGTTTCCA TTGCGCGGCG CCTGCAGGAC CCTCTGGCGG AGCTGGTCAA AGTCGATCCG
AAATCGATTG GCGTAGGCCA ATACCAGCAT GATGTCGGGC AAACCCAGCT CGCGCGCTCG
CTCGATGCCG TGGTCGAAGA CTGCGTCAAT GCGGTAGGCG TGGACGTCAA TACGGCCTCC
GCGCCGCTGC TCGAACGCGT TTCGGGGCTT AACCCGGCTG TCGCACAAAG CATCGTTGTC
TATCGCGAGG AAAACGGGAT GTTTGCCTCG CGTGAAGCTC TGCACCAGGT GCCGCGCCTG
GGTGAGAAAA CCTTCGAGCA GGCGGCAGGC TTCTTGCGGG TGATGCATGG CGAGAACCCG
CTTGATGCGT CGGCAGTGCA TCCCGAGTCG TATCCCGTCG TGCAAAGAAT CCTTTCCGAC
TTGAAGCAGG AAATCAGGTC GATCATCGGC AATAACAAAT TATTGAAGTC GCTCAATCCG
GCGAGGTATG CGGATGATCG ATTCGGCGTG CCGACCGTCA CCGACATCGT GAAGGAGCTG
GAAAAGCCGG GCCGTGATCC CCGGCCGGAA TTCATCACTG CCGCATTCAA GGAAGGCGTG
AACGAGATTT CCGATCTGCA GCCGGGTATG CTGCTGGAAG GCGTGGTAAC CAACGTGGCT
GCCTTCGGCG CGTTCGTCGA TATCGGGGTG CATCAGGATG GGCTGGTGCA CATCTCCGCG
CTCGCCGACA AATTCGTCAA AGACCCGCAC ACGGTCGTCA AGGTGGGGCA GGTGGTGAAG
GTCAAAGTGC TGGAAGTCGA TGAAAAGCGT AAGCGCATTG CCCTTACGAT GAGGTTGGCA
GATGCGCCAG CACCACAGAC ACAGGAGGCG CGAGGGGCTG GCAAGCGTGA GCAGCCAAGG
AATAGGAAGG ACCGCTCAGC CAAACCCCAG CAGAAACAGG ATTCCAGGGC TGATACGGCG
ATGGCAGCAG CGTTTGCGAG ATTGAAGGGT TGA
 
Protein sequence
MLPSIEQRLS LELGAKPAQV NAAIALLDEG ATVPFIARYR KEVTGGLDDA QLRLLEERLR 
YLRELEDRRA AIIASIEEQG KMTPALLASI LQAEDKTRLE DLYLPFKKKR RTKAQIALEA
GLEPLADALL ADPTLQPEEE AIKYLKPPFA TEQGDNPGVP DVKAALEGAR QILMERFAED
AELLQWLREY LLDHGVVESK VASDKNGGKG KEEEGAKYSD YFDYSEPLSA IPSHRALALF
RGRREEILRV ALRLDSEAEK PKWDAPHNPC EARIAVRFGI ADKGRPADAW LMDTVRWTWR
VKSFPHLELD LMGSLRTRAE TEAIQVFARN LKALLMAAPA GPRVTIGLDP GLRTGVKVAV
VDATGRVMET ATIYPHQPRN DWEGSLHVLG TLAEKFRVSL IAIGNGTASR ETDKLAKDLI
KRRPDLKLTS IVVSEAGASV YSASDLASRE FPDMDVSLRG AVSIARRLQD PLAELVKVDP
KSIGVGQYQH DVGQTQLARS LDAVVEDCVN AVGVDVNTAS APLLERVSGL NPAVAQSIVV
YREENGMFAS REALHQVPRL GEKTFEQAAG FLRVMHGENP LDASAVHPES YPVVQRILSD
LKQEIRSIIG NNKLLKSLNP ARYADDRFGV PTVTDIVKEL EKPGRDPRPE FITAAFKEGV
NEISDLQPGM LLEGVVTNVA AFGAFVDIGV HQDGLVHISA LADKFVKDPH TVVKVGQVVK
VKVLEVDEKR KRIALTMRLA DAPAPQTQEA RGAGKREQPR NRKDRSAKPQ QKQDSRADTA
MAAAFARLKG