Gene Nmul_A1836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1836 
Symbol 
ID3785945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2118087 
End bp2119229 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content52% 
IMG OID637811923 
Productintegrase catalytic subunit 
Protein accessionYP_412525 
Protein GI82702959 
COG category[L] Replication, recombination and repair 
COG ID[COG2826] Transposase and inactivated derivatives, IS30 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.100026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAGCG TATATGGTGG CATGACTGAT GAACGCAAAG CGTGTATTTG GCGGTTATGG 
CAGCAAGGGG TTGCTATGAG TGTAATTGCT AGAGATATTG CAAAGCCGCC TGCGACGGTA
TATTCGTATC TTCTCTACCA TGGAGGCATA AAGCCGAGGC AACGATCTCG TCGATCTGGT
TGCCTGTCGC TGAAGGAACG TGAAATGATT TCTCGTGGAT TGGCTAGTTG CAAAAGCCTG
CGCAGGATTA GCCAGGAACT TGGTCGGGCT GCCTCTACGA TATCAAGAGA AATTGCCCGC
AATGGCGGAC CTGAAAAATA TCGGGCATGC CATGCCGAGA AAGCTTTTCT CAAGCGCAGT
CGACGCCCCA AGCCCACATT GCTTTCCCAG GATGAGGAGC TAAGAGGCGT GGTAACAGCA
CTGCTGGAGG CTGATTGGTC GCCAGAACAG ATAACCGGAT GGCTCAAGCG ACACTCTTCT
GACGGAAAAG CGATGTGTGT ATCGCATGAG ACGATCTACA AATCCCTGTT CATTCAAACT
CGTGGCGTAC TACGCCAGGA ACTGAAGAAG CACTTGCGCA CCAAAAGAAT GTTTCGTCAC
GCCAAGTCCC ACCGGGTTGC AGGCAGAGGA CACATTACCG ATGCGATTTC TATTCGAGAG
CGCCCTGCAC AGGTGGAAGA CAGGGCCCTG CCGGGGCATT GGGAAGGAGA CCTGCTTATA
GGCTCGAGTA ATAGTGGCAT TGCTACGATG GTCGAGAGAT ACTCCAGATT CACCGTGCTT
TGCAAAGTGC AGGACAAGCG CGCTGAAAGT GTTGTTCAGT CCTTGATAAC CCAGATGCGC
ATGCTTCCTG AGCAACTGCG TAAGAGCCTG ACATGGGATA GAGGCCAGGA ACTTGCCGCA
CACAAGCGAT TTACCATGGC CACCAATATG GCCGTCTATT TCTGCGATCC GAGCAGCCCA
TGGCAAAGGG GAACCAATGA GAATACCAAT GGCCTGCTAA GACAATACTT TCCAAAAGGA
ACGAGTTTGG CGCCATACAC ACAGTGTCAA CTGAATGAGG TCGCCGAAAA ACCAAACTCT
CGCCCGAGGA AAACCTTGGA TTTTAGAACA CCCGCCCAAG TACTGAATGA AGCGTTGCAC
TGA
 
Protein sequence
MASVYGGMTD ERKACIWRLW QQGVAMSVIA RDIAKPPATV YSYLLYHGGI KPRQRSRRSG 
CLSLKEREMI SRGLASCKSL RRISQELGRA ASTISREIAR NGGPEKYRAC HAEKAFLKRS
RRPKPTLLSQ DEELRGVVTA LLEADWSPEQ ITGWLKRHSS DGKAMCVSHE TIYKSLFIQT
RGVLRQELKK HLRTKRMFRH AKSHRVAGRG HITDAISIRE RPAQVEDRAL PGHWEGDLLI
GSSNSGIATM VERYSRFTVL CKVQDKRAES VVQSLITQMR MLPEQLRKSL TWDRGQELAA
HKRFTMATNM AVYFCDPSSP WQRGTNENTN GLLRQYFPKG TSLAPYTQCQ LNEVAEKPNS
RPRKTLDFRT PAQVLNEALH