Gene Nmul_A2236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2236 
Symbol 
ID3784937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2539020 
End bp2540294 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content53% 
IMG OID637812324 
Producthypothetical protein 
Protein accessionYP_412920 
Protein GI82703354 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATACA TCAGCAAAAT CCGCTCGGGG ATATACCGGA AATACTTCAG CACTATGCTG 
GCATTAAGTT TTCTGTTTTT GCTGCCCGAG GCATTCGCGA TGAATGGTCA TATTGCGGAT
GGTCCCAAGT GGGTTATCGA TGACACCAAG TGGTTTACTC TCGGCATCGG TTTCCGTGGT
TCGGGCGTAT GGGTCGAAAA CAGGGATACC GGTAATTTCC AGAGCGGTTT CAGCATCGAC
AATGCCCGTG TCTATCTCAA CGGACAGATC CACAAGTATG TCAAATTCGA AAGCTACACC
GAATGTACTT TCTGCAATAA CACCCATCCC GAGGATACCC CCAGGATGTC CTACAACGTT
CTGGCCGCAA TCGGAAAGGT CGAGATCAAC CGCTTTGTCA ATTTCTGGGG TGGGCGCATG
CTGGTGCCCA CGGAGCGGGG CGAATTGAGT GCCCCTTTTT ATCACGCGAC ACACGATGCC
ATCAAAACGC CGTTCTTCCC CCAAGGATTC AGTACTAAAT TCGGCAGCCT CGGCGCAGGC
CGGTATGGAC ATGATGACGG TGGGACCTTC TGGGGGAGCG TCGAGCCCGG CTTCATCAAA
GGCACCTTGG GCTACGCGCT CGGCGTGTAC AGGGGCTTGC AGTCATCCAC GGCAGCGCGC
ATGGGACCCA ATCAGGGGGA TAGTGTGGCA TGGGCCGGGC GTCTTACCTA CAATTTCCTG
AACCCCGAGC CGAATCCGGG TTATTACACC CGTAATACCT ACTTCGGCCA GGCTGGCGAC
ATTCTGGCGC TCGCGGCCGG TACTTCATAT CAAAAGGATG GTGCCGGATC GTTTGCGCAT
CCCAGCGATT TCCTGGGTCT CGTCGGCGAT GTCCTGTTTG AAAAGGTCCT GCCAAAAAAT
ATGGGTGTAG TTACCGTCAA CGGTGATTAC AAGCAATTCT ATGCCAATTA CTCGCCGCTG
GCCTTTGCCG ATCCGGACTG CTTCTGCATA TTCGACGGAA AATCATGGGG TGTCACCGGG
CTCTACCTGC TTCCCGTCAA GGTAGGGATC GGGCAATTTC AGCCTTATGG GAGATTTACC
AGAGTTCAGC CTGACAACAG CAGCAAACGG GAAGAAATCG AGGCTGGGGT GAATTATGTC
ATCAGCGGCT TCAACGCCCG TATTTCAGCG TACTACCAGC ACGGTGATCT TCGCACCAAA
GGCATCAACT ATGCGCCGGA TGTAACAGGT GACAAGGTCG ATGTTTTTAA ACTGGCATTC
CAGCTGCAAA TGTGA
 
Protein sequence
MKYISKIRSG IYRKYFSTML ALSFLFLLPE AFAMNGHIAD GPKWVIDDTK WFTLGIGFRG 
SGVWVENRDT GNFQSGFSID NARVYLNGQI HKYVKFESYT ECTFCNNTHP EDTPRMSYNV
LAAIGKVEIN RFVNFWGGRM LVPTERGELS APFYHATHDA IKTPFFPQGF STKFGSLGAG
RYGHDDGGTF WGSVEPGFIK GTLGYALGVY RGLQSSTAAR MGPNQGDSVA WAGRLTYNFL
NPEPNPGYYT RNTYFGQAGD ILALAAGTSY QKDGAGSFAH PSDFLGLVGD VLFEKVLPKN
MGVVTVNGDY KQFYANYSPL AFADPDCFCI FDGKSWGVTG LYLLPVKVGI GQFQPYGRFT
RVQPDNSSKR EEIEAGVNYV ISGFNARISA YYQHGDLRTK GINYAPDVTG DKVDVFKLAF
QLQM