Gene Nmul_A1692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1692 
Symbol 
ID3784618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1932053 
End bp1933195 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content51% 
IMG OID637811778 
Productintegrase catalytic subunit 
Protein accessionYP_412382 
Protein GI82702816 
COG category[L] Replication, recombination and repair 
COG ID[COG2826] Transposase and inactivated derivatives, IS30 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.204308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAGCG TATATGGTGG CATGACTGAT GAACGCAAAG CGTGTATTTG GCGGTTATGG 
CAGCAAGGGG TTGCTATGAG TGTAATTGCT AGAGATATTG CAAAGCCGCC TGCGACGGTA
TATTCGTATC TTCTCTACCA TGGAGGCATA AAGCCGAGGC AACGATCTCG TCGATCTGGT
TGTCTGTCGC TGGAGGAACG TGAAATGATT TCTCGTGGAT TGGCTAGTTG CAAAAGCCTG
CGCAGGATTA GCCAGGAACT TGGTCGGGCT GCCTCTACGA CATCAAGAGA AATTGCCCGC
AATGGCGGAC CTGAAAAATA TCGGGCATGC CATGCCGAGA AAGCTTTTCT CAAGCGCAGT
CGACGCCCCA AGCCTACATT GCTTTCCCAG GATGAGGAGC TAAGAGGCGT GGTAACAGGA
CTGCTGGAGG CTGATTGGTC GCCAGAACAG ATAACCGGAT GGCTCAAGCG ACACTCTTCT
GACGGAAAAG CGATGTGTGT ATCGCATGAG ACGATCTACA AATCCCTGTT CATTCAAACT
CGTGGCGTAC TACGCCAGGA ACTGAAGAAG CACTTGCGCA CCAAAAGAAT GTTTCGTCAC
GCCAAGTCCC ACCGGGTTGC AGGCAGAGGA CACATTACCG ATGCGATTTC TATTCGAGAA
CGCCCTGCAC AGGTGGAAGA CAGGGCCCTG CCTGGGCATT GGGAAGGAGA CCTGCTTATA
GGCTCGAGTA ATAGTGGCAT TGCTACGATG GTCGAGAGAT ACTCCAGATT CACCGTGCTT
TGCAAAGTGC AGGACAAGCG CGCTGAAAGT GTTGTTCAGT CCTTGATAAC CCAGATGCGC
ATGCTTCCTG AGCAACTGCG CAAGAGCCTG ACATGGGATA GAGGCCAGGA ACTTGCCGCA
CACAAGCGCT TTACCATGGC CACCAATATG GCCGTCTATT TCTGCGATCC GAGCAGCCCC
TGGCAAAGGG GAACCAATGA GAATACCAAT GGCCTGCTAA GACAATACTT TCCAAAAGGA
ACGAGTTTGG CGACATACAC GCAGTGTCAA CTGAATGAGG TCGCCGAAAA ACTAAACTCT
CGCCCGAGGA AAACTTTGGA TTTTAGAACA CCCGCCCAAG TACTGAATGA AGCGTTGCAC
TGA
 
Protein sequence
MASVYGGMTD ERKACIWRLW QQGVAMSVIA RDIAKPPATV YSYLLYHGGI KPRQRSRRSG 
CLSLEEREMI SRGLASCKSL RRISQELGRA ASTTSREIAR NGGPEKYRAC HAEKAFLKRS
RRPKPTLLSQ DEELRGVVTG LLEADWSPEQ ITGWLKRHSS DGKAMCVSHE TIYKSLFIQT
RGVLRQELKK HLRTKRMFRH AKSHRVAGRG HITDAISIRE RPAQVEDRAL PGHWEGDLLI
GSSNSGIATM VERYSRFTVL CKVQDKRAES VVQSLITQMR MLPEQLRKSL TWDRGQELAA
HKRFTMATNM AVYFCDPSSP WQRGTNENTN GLLRQYFPKG TSLATYTQCQ LNEVAEKLNS
RPRKTLDFRT PAQVLNEALH