Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1836 |
Symbol | |
ID | 3785945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2118087 |
End bp | 2119229 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637811923 |
Product | integrase catalytic subunit |
Protein accession | YP_412525 |
Protein GI | 82702959 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2826] Transposase and inactivated derivatives, IS30 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.100026 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAGCG TATATGGTGG CATGACTGAT GAACGCAAAG CGTGTATTTG GCGGTTATGG CAGCAAGGGG TTGCTATGAG TGTAATTGCT AGAGATATTG CAAAGCCGCC TGCGACGGTA TATTCGTATC TTCTCTACCA TGGAGGCATA AAGCCGAGGC AACGATCTCG TCGATCTGGT TGCCTGTCGC TGAAGGAACG TGAAATGATT TCTCGTGGAT TGGCTAGTTG CAAAAGCCTG CGCAGGATTA GCCAGGAACT TGGTCGGGCT GCCTCTACGA TATCAAGAGA AATTGCCCGC AATGGCGGAC CTGAAAAATA TCGGGCATGC CATGCCGAGA AAGCTTTTCT CAAGCGCAGT CGACGCCCCA AGCCCACATT GCTTTCCCAG GATGAGGAGC TAAGAGGCGT GGTAACAGCA CTGCTGGAGG CTGATTGGTC GCCAGAACAG ATAACCGGAT GGCTCAAGCG ACACTCTTCT GACGGAAAAG CGATGTGTGT ATCGCATGAG ACGATCTACA AATCCCTGTT CATTCAAACT CGTGGCGTAC TACGCCAGGA ACTGAAGAAG CACTTGCGCA CCAAAAGAAT GTTTCGTCAC GCCAAGTCCC ACCGGGTTGC AGGCAGAGGA CACATTACCG ATGCGATTTC TATTCGAGAG CGCCCTGCAC AGGTGGAAGA CAGGGCCCTG CCGGGGCATT GGGAAGGAGA CCTGCTTATA GGCTCGAGTA ATAGTGGCAT TGCTACGATG GTCGAGAGAT ACTCCAGATT CACCGTGCTT TGCAAAGTGC AGGACAAGCG CGCTGAAAGT GTTGTTCAGT CCTTGATAAC CCAGATGCGC ATGCTTCCTG AGCAACTGCG TAAGAGCCTG ACATGGGATA GAGGCCAGGA ACTTGCCGCA CACAAGCGAT TTACCATGGC CACCAATATG GCCGTCTATT TCTGCGATCC GAGCAGCCCA TGGCAAAGGG GAACCAATGA GAATACCAAT GGCCTGCTAA GACAATACTT TCCAAAAGGA ACGAGTTTGG CGCCATACAC ACAGTGTCAA CTGAATGAGG TCGCCGAAAA ACCAAACTCT CGCCCGAGGA AAACCTTGGA TTTTAGAACA CCCGCCCAAG TACTGAATGA AGCGTTGCAC TGA
|
Protein sequence | MASVYGGMTD ERKACIWRLW QQGVAMSVIA RDIAKPPATV YSYLLYHGGI KPRQRSRRSG CLSLKEREMI SRGLASCKSL RRISQELGRA ASTISREIAR NGGPEKYRAC HAEKAFLKRS RRPKPTLLSQ DEELRGVVTA LLEADWSPEQ ITGWLKRHSS DGKAMCVSHE TIYKSLFIQT RGVLRQELKK HLRTKRMFRH AKSHRVAGRG HITDAISIRE RPAQVEDRAL PGHWEGDLLI GSSNSGIATM VERYSRFTVL CKVQDKRAES VVQSLITQMR MLPEQLRKSL TWDRGQELAA HKRFTMATNM AVYFCDPSSP WQRGTNENTN GLLRQYFPKG TSLAPYTQCQ LNEVAEKPNS RPRKTLDFRT PAQVLNEALH
|
| |