Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1969 |
Symbol | |
ID | 3784993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2263566 |
End bp | 2265311 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637812058 |
Product | hypothetical protein |
Protein accession | YP_412656 |
Protein GI | 82703090 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCCC GTGCTTTTAA TTCCCTTATG ATGGGACTGG TATTGGCTGG GACAAGCGTC TTTTGCCCGT CCTCAGAAGC GCGTGACTCG GCTGTGGGCG AGCAAAGCCG AACCCCTGCG GCGCAGGGAT CAAACCACAA GCATTATGAA GCTTCACCCA TGGCGAATCA TCCAGGACCG GATGGGCAAC TGGCCCCCAG GTTGCAGAAC CTCGGCAATC ATGCGTTTCC AGTCAGCACG CCAAATAAAA GAGCACAGCT CTTTATCAAT CAGGGCATCA ATCTGGCTTA CGGCTTCAAC CATGCCGAAG CGCGACGCGC ATTCCGCGAA GCCGCGCGAC TCGATCCCGC GCTGGCAATG GCATACTGGG GTCAGGCGCT GGTACTGGGC CCCAACATCA ATGCCCCCAT GGATCCCAAC GATGAGCCCG ATGCCTTGAA GCTGGTGCAA AAGGCGAAAT CGCTGATGGA GTCGATCTCG GAAAGGGAAC AGGCACTGAT CAGAGCGCTG GAGAAACGTT ATTCAGGCGA TTCAGGGGAT CGCGCGGCAA ATGACAAGTC TTATGCTGAA GCAATGCGGA CGGTCCATCG GCGTTTTCCC GCTGACCCTG ATATCGCAAT GCTCTATGTG GAATCAGTGA TGGATCTCCG CCCCTGGGGA TACTGGATGC GGGACGGCCA TCCTCATGCG GGAACGGCTG AAATCGTGGC GCTGACCGAG GAAGTATTGC GCCGCCATCC CGCGCATCCC GCCGCATTGC ACATGCAAAT TCATCTAATG GAGCCCACGA ACACGCCCGA GCGAGCGGAG AAAGCCGCGG ATGTCCTGCT CCCGCTGATG CCTGCAGCGG GTCACATGAT ACACATGCCG TCGCACATTT ATCAGCGGGT GGGACGGTAT GGGGATGCGA TAAAAAGCAA CCGATTGGCA ATAGCGGCCG ACGAGGACTA CATCGCTCAA TGCCAGGCAC AGGGACTGTA CCCGATGGCC TACTACCCGC ATAACATCCA CTTTCTTTCG TTTTCTGCCA CTGCAAACGG TCAAAGCAGA ATGGCGATCG AATCCGCCCG CAAGACCGCC AGCAGGATAG ACGATGCCAC GCTGAAGGAA ATGCCCCTGA CTGCCGTGTT CCGCATGACA CCCTACTGGG CTCTCGCGAG GTTCGGACAC TGGCAGGAGA TACTCGATGA GCCCTCTCCT CCCGTCACGA ACGCCTTCCT CAAGGGGAGC TGGCATTATG TCCGGGGTCT CGCATTTGTC GCAACCGGGC GTCTTTCCCA AGCCGAGCAG GAACTGGGAA CCTTGCGCGA GATCATGAAA GACCCGAGCC TGGACGGTGC GCTTTTCTCC AAGAATACGC CGCGCACCGT GCTGAGGATT GCTCCGGAAG TACTGGCCGG TGAAATTGAC GCCGCTCGCG GTAAATTCGA TTCGGCCATA GCGCATCTTG AACGCGCGAT CCGCTACGAG GATGCTCTGG TTTACACGGA ACCCGCTGAG TGGCACTATC CGCCGCGGCT CGCGCTGGGC GCGATCCTAC TTGAAGCCGG ATATCCCGAT GAAGCAGAGA CGGTCTACTG GAGCGACCTG CAACGCAATC GCGACAGTGG TTGGACTCTT TTCGGGCTGC TGCAGGCCCT GCGCGCCCAG AAAAAGGAAG CCGAAGCCGA AGTTATTGAG GCGCGCTTCA AAAGGGCATG GGAGCAGGCT GACGTAAAGC TGACGGCGTC ACGCATGGGG CGATAG
|
Protein sequence | MESRAFNSLM MGLVLAGTSV FCPSSEARDS AVGEQSRTPA AQGSNHKHYE ASPMANHPGP DGQLAPRLQN LGNHAFPVST PNKRAQLFIN QGINLAYGFN HAEARRAFRE AARLDPALAM AYWGQALVLG PNINAPMDPN DEPDALKLVQ KAKSLMESIS EREQALIRAL EKRYSGDSGD RAANDKSYAE AMRTVHRRFP ADPDIAMLYV ESVMDLRPWG YWMRDGHPHA GTAEIVALTE EVLRRHPAHP AALHMQIHLM EPTNTPERAE KAADVLLPLM PAAGHMIHMP SHIYQRVGRY GDAIKSNRLA IAADEDYIAQ CQAQGLYPMA YYPHNIHFLS FSATANGQSR MAIESARKTA SRIDDATLKE MPLTAVFRMT PYWALARFGH WQEILDEPSP PVTNAFLKGS WHYVRGLAFV ATGRLSQAEQ ELGTLREIMK DPSLDGALFS KNTPRTVLRI APEVLAGEID AARGKFDSAI AHLERAIRYE DALVYTEPAE WHYPPRLALG AILLEAGYPD EAETVYWSDL QRNRDSGWTL FGLLQALRAQ KKEAEAEVIE ARFKRAWEQA DVKLTASRMG R
|
| |