Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0722 |
Symbol | |
ID | 3786068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 837372 |
End bp | 840164 |
Gene Length | 2793 bp |
Protein Length | 930 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810804 |
Product | hypothetical protein |
Protein accession | YP_411421 |
Protein GI | 82701855 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3857] ATP-dependent nuclease, subunit B |
TIGRFAM ID | [TIGR03623] probable DNA repair protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.851206 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGCCA TTCCTCATTC TCTACAAATT CCTTTCCCCG AAGTTGTTGT AAACCGTATC GCCGGATTGA AAGCCGGCGA TACGGTGGTG ACTCCCAACA CGCGGCTCGC CATCACCCTC AGGCGCGAGT TCGATCGAAT CCAGGCAGCG CAAGGAATTT CTGCCTGGGC TGCAGCCGAC ATTCTGCCCG TTTCCGCATT TATCGAGCGC ATCTACGAAG ATGCGCTTTA TTCCACTCAT GCATCTCATC TGCCCCAGCT GCTTACATCT GCACAGGAGC AGTTCTTGTG GGAGGAGGTT GTCGTCCAAT CGGCCGCTCC CCTCTTGCTC TCGGTGGCAG AAAGCGCACG GTTGGCCCGG GAGGCGTGGA GGCTGGCCCA TGAGTGGCAG TTGCTCCCCC GGTTCGGGAA ATTCGTGTTG AACGAGGATT GCAGGGCATT CAGCGCCTGG TCGCATCGCT ATAAAAATGT AACCCGCAAG GCTCGTCAGA TCGACCATGC ACGTCTTTGC ACGCTTGTAA CCGGGTTGTG CGGTTCCCCT GGTATCGCGA TTCCGCAACG CCTGATTTGT TATGGTTTCG ACAGCGTTAC GCCGCAGTGG GCTGCGCTGC TGGCAAAGCT TGAGGAAGCG GGTTGCGAGG TCGATGAGAT GACACGAACC GATCCGGCCG CGCACCGCTT CGAAGTGCCG TACTCCCCAA ACCGGACTGT GCACCGGGTG GAATGTGACG TCAGCAATGA CGAGATTTAT CAGGCCGCGG TGTGGGCGCG GGCCCGAATC GAGAAGGATA GCACGGCTCG CGTCGGCATA GTGGTCCCCC CGCTTTCGGA GTACCGGAGT AGCCTCGTAC GAATCTTTAG CTCGGTGATG GATCCGGATG TCCGTCAAGC CCTTCCCGGC GTACCGCGGC ATATGCCATT CAACATATCC CTGGGCGTTG CGCTGAGTTC GTATCCGCTG GTACATACTG CATTCCTTGT GCTTGAGCTT ATCCAGGGAG AAATCGAGTT CGAGCGTGCA AGCCTCTTGT TACGATCTCC CTTCCTGGCG GGAGCGGAGA CTGAAATGCT TCCCCGCGCA CGGCTGGATG CAAAATTGCG CAAGCGTGCC GAACCTGTGA TTACACTCGA ACGCCTGCTT GTACTGGTAA AGCGTGAAAA CAACGCGAAC TGTCCCCTCC TCGTGAAGCT GTTGTGGGAC TGCGAAAAGT TCTGCAAAAA AATGTCGACA GATGTGCAGC GCCCTTCCAC GCTGGCAAGA GCAATTTCCG ATATCCTGCA GCTCGTCGGA TTTCCAGGTG AAAGAGCTCC TGACTCTTCA GAATATCAGA CGCTCAAAAA ATGGCAGGAC GTCCTCGCGG GTTTAGCCCT GCTTGATAAC GTTACCATCG GCATGAGCTA TCGCCAGGCT ATCTCACATC TGCGCGGGGT GGCTGCCGAT ACCTTGTTTC AGCCGGAGAC GCCGGATGTT CCCATCCAGA TACTCGGGGT CTTTGAAGCT GGCGGAATGA TGTTCGATCA TCTCTGGGTG ATGGGATTAT CGGATGAGGC CTGGCCGCTG CAACCACGCC CGAACCCGTT CCTGCCGATC GAGCTGCAGC GCGCCGAAAG ACTGCCCCTG GGCTCTCCGG CAGCCACTCT CGAACTTGCG TCCCGTTTTA CGGAGGGATG GCTGGCCGCA TCGAAGGAAG TTATTCTGAG TCACGCGCGG CATAGCGGAA GCAGTGATAC ACGTGAGCTG GCGCCAAGTC CGTTGATCGC ACATATTGCT ACCAGCGATA TCAGGCTTTT CGGTCTGCTT CCGGATTATG TGAAACATCG CGATCTGATT CATCAGGGAC GCCGACTTGA GCGAATAAAG GATGAGCCGG CGCCCGGGAT AGTCTCTGGC AAAGGAGGAG AGGGCAGAGC CCGCGGCGGG GTGGCGGTGA TAAAAGACCA GGCCGCCTGT CCATTCCGCG CGTTGGCGCT ACATCGTTTG CATGCGGAAG GTTTGAAGAC GCCTCATCCT GGACTTGATG CGGGAGAGCG CGGCACGCTG GTTCACGAGA TGCTTGCCCA GGTATGGGAG CGGATCAAGA ATAAGAGCGC ACTCGACGCT TTGGCTGAGG ACGAGTTGAG AACGCTGCTG ACCAGTGCCG CGGGGGAAGT CATTGCACGC ATGAGCGCGA GGCATTCACA TACTTTGTCA GAACGTCTTG CCCGGGTCGA GCAGCGGCGG CTGGTAAGAT TGGCCCGAGA ATGGCTCGAA GAAGAGAGGA AGCGAGGCGA TTTCACTGTA GTTTCCACCG AGGCTAGAAA GAGCATCGAA ATCGGCGGGC TTGCATTGGA CACCCGGCTG GATCGTGTGG ATGAACTCGC CGACGGTCGC CGGATTGTCA TCGACTACAA AACACGGGCG CCTTCCGTGA ACGCGATGCT GGGGGAGCGC CCGGACGAGC CTCAACTGCC GCTCTATCTT GTGGCCGCTG AGGTGAACGC TGCAGCCATT GCCTTTGCGC AAGTCAGGGC CGGGGAAATG CGGTACACCG CGCTTGCCCG CGACAACGAT CTTTTACCGG ACGTAAAAGG GTTTGCGCAA TCACGCCATA TTGATCGATA TGACTCGTGG GAAGCGCTCG TCGCAGCGTG GTTTGAGGCT CTTGTAAGCG TCGTTACCGA TTTTTCCAAC GGTCATTCGC AAGTCGACCC CAAGAAATAT CCACAGACCT GTCGTACATG CGATGTTCAG CCCTTATGTC GCATTTACGA GCGCGCAGCC ACGGATTTCA CCGCGCAAGG GGAGGAAGAA TGA
|
Protein sequence | MSAIPHSLQI PFPEVVVNRI AGLKAGDTVV TPNTRLAITL RREFDRIQAA QGISAWAAAD ILPVSAFIER IYEDALYSTH ASHLPQLLTS AQEQFLWEEV VVQSAAPLLL SVAESARLAR EAWRLAHEWQ LLPRFGKFVL NEDCRAFSAW SHRYKNVTRK ARQIDHARLC TLVTGLCGSP GIAIPQRLIC YGFDSVTPQW AALLAKLEEA GCEVDEMTRT DPAAHRFEVP YSPNRTVHRV ECDVSNDEIY QAAVWARARI EKDSTARVGI VVPPLSEYRS SLVRIFSSVM DPDVRQALPG VPRHMPFNIS LGVALSSYPL VHTAFLVLEL IQGEIEFERA SLLLRSPFLA GAETEMLPRA RLDAKLRKRA EPVITLERLL VLVKRENNAN CPLLVKLLWD CEKFCKKMST DVQRPSTLAR AISDILQLVG FPGERAPDSS EYQTLKKWQD VLAGLALLDN VTIGMSYRQA ISHLRGVAAD TLFQPETPDV PIQILGVFEA GGMMFDHLWV MGLSDEAWPL QPRPNPFLPI ELQRAERLPL GSPAATLELA SRFTEGWLAA SKEVILSHAR HSGSSDTREL APSPLIAHIA TSDIRLFGLL PDYVKHRDLI HQGRRLERIK DEPAPGIVSG KGGEGRARGG VAVIKDQAAC PFRALALHRL HAEGLKTPHP GLDAGERGTL VHEMLAQVWE RIKNKSALDA LAEDELRTLL TSAAGEVIAR MSARHSHTLS ERLARVEQRR LVRLAREWLE EERKRGDFTV VSTEARKSIE IGGLALDTRL DRVDELADGR RIVIDYKTRA PSVNAMLGER PDEPQLPLYL VAAEVNAAAI AFAQVRAGEM RYTALARDND LLPDVKGFAQ SRHIDRYDSW EALVAAWFEA LVSVVTDFSN GHSQVDPKKY PQTCRTCDVQ PLCRIYERAA TDFTAQGEEE
|
| |