Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0039 |
Symbol | |
ID | 3784028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 41919 |
End bp | 43004 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810108 |
Product | hypothetical protein |
Protein accession | YP_410740 |
Protein GI | 82701174 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3569] Topoisomerase IB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.204308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTCT CTCCGAATAA GGAGGAGTTG CGGCCCCAGG CGGTGGAAAT TCTGGCGGCG CAAGCGGCGG CCGCGGGGTT GATCTATGTC TCCGGCGATG AGCCTGGGAT CAGAAGGTTG GCCTCCGGCG AGGGATTTCG TTATGTCGGT CCGGATGAAA AGCCCTTGAG TGATAAGGGC GAGCTTGAGC GCATTGCCCG GCTCGCCATT CCTCCCGCTT ATACCGACGT CTGGATTTGC CCTCATCCTC TGGGCCACTT GCAGGCCACC GGACTGGATG CGCGCAGCCG CAAGCAGTAC CGCTACCATT CCGATTGGCG CGCGGTGCGG GACAGCATCA AATTTGACCG CATGGTGGAA TTCGGCGAAG CTTTACCTCA ACTGCGTGAG CGTGTACACC TTGATCTGAA AAAACGAGGT TTGCCGCAAA AAAAGGTTGT TGCTGCAATA GTGAGACTGC TTGATACGAC GCAAGTGCGC ATCGGTAACC TGTCTTACGC CCGCGATAAC CACAGTTTCG GACTTACAAC CCTGCGCAAG CGGCATCTTG CCTTCATCAA CCCCAGGCGC GCATTATTGA AATTCCGGGG CAAGAGCCGG GTCGAGCACG AAGTCACGAT CGGTGACAGG CGCATCATCA CCATTATCCG GGCTTGCCAA GAGCTGAGGG GCCAGCATCT TTTCCAGTAC CTGGATGAGA GCGGCAAGCG ACGGCCGGTC GGAGCGGAGC AGGTAAACGC CTACCTCCAC GAAGTCATGG AAGCCGAATT TACCGCGAAG GATTTCCGAA CATGGAGCGG AACCTTGCGG GCGTTCGAAA TCATGCTCAA TACGCCGTTG CCTGAATTCC CGACCAAGCG CGCTCTCAAG GCCGGCATCG TTGCGGCGAT ACGGCAAGTC GCGGAAGATC TGCGCAATAC CCCTGCCGTC TGCCGGAAAT CCTACATCAA TCCGGCGGTG TTTTCTGCCT GGCAGAAAGG CGATCTTCAT CGCTGCGTGC AGGAAATGGA AGAAGCGGCG CATGCCGCCC ATGACCCGCT CGAGTCTGCC GTGCTGTCTT TTCTGCGGAA ACAAAAGACT GGATAA
|
Protein sequence | MSFSPNKEEL RPQAVEILAA QAAAAGLIYV SGDEPGIRRL ASGEGFRYVG PDEKPLSDKG ELERIARLAI PPAYTDVWIC PHPLGHLQAT GLDARSRKQY RYHSDWRAVR DSIKFDRMVE FGEALPQLRE RVHLDLKKRG LPQKKVVAAI VRLLDTTQVR IGNLSYARDN HSFGLTTLRK RHLAFINPRR ALLKFRGKSR VEHEVTIGDR RIITIIRACQ ELRGQHLFQY LDESGKRRPV GAEQVNAYLH EVMEAEFTAK DFRTWSGTLR AFEIMLNTPL PEFPTKRALK AGIVAAIRQV AEDLRNTPAV CRKSYINPAV FSAWQKGDLH RCVQEMEEAA HAAHDPLESA VLSFLRKQKT G
|
| |