Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0801 |
Symbol | |
ID | 3785845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 917121 |
End bp | 918647 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637810887 |
Product | regulatory protein GntR, HTH |
Protein accession | YP_411500 |
Protein GI | 82701934 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0129569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATTC CCATTGAACT CGACCATAAC AGTCGTCAAT CGCTGCAAGG CCAGATTTTT GACCAGTTAC GCCACCTTAT CCTCGGCGGT AAGCTAAAAC CCGGTACACT CATCCCGGCG AGTCGCGTAC TTGCCGAACA ACTGGGTATT TCTCGTAATA CTGTTTTGCT GGTTTATGAT CGTTTGATTG CCGAAGGCTA TCTTCAGGCC CGGAAGGCAA TTGGAACGTA TGTAAACCTT GAGCTTCCTG AAACCTGCCT TTCTGCCACC CGCAGGGTGA CCCCCTCCTC TCCTGGTGAT GAAGAATTTG TCAAACAGCC GGTAATCCCA TTTATCGGCC ATATACACGC CAACATCACT GTTCAGCATC TCGATTTTGA TTTTTGTCCG GATCGCATCG GTCTCGACCT GTTTCCTCAT AAGATATGGC GCCGCTTTGT AAATAAGACA TTCATGTCTG CAAACATGCA CTTCGAGGAG TGTTGCGATC CCGGTGGGCT ATCGGAACTG CGGGAAGTCA TCGCTGCCCA CCTTGGAGTC GCGCGGGGTA TCAGCGTTTC ATCAGATCAG ATAATTATTA CGGCCGGCTC TCAGGAGGGA TTGAATCTTG CTGCTCGCCT TCTTGTAAAA GAGGGGACAA TGGTTGCAAC GGAAAATCCC TGCTACCAAG GCGCCACTTC GCTTTTCGAG AGTTACTATG CAAAATTGAT TCCTGTGCCT GTGGATGAGG CAGGGCTCGA CGTTGAACAA CTACCAAAGG AAGGAGTTGC TCTTTTATTT GTGACTCCTT CACATCAATT TCCCCTGGGC TTCACCTTGC CCATCGAGCG CCGGTTGAAG CTCCTGGATT GGGCGCGGCG TTGTGGGGCA TTCATCATAG AGGACGATTA CGATTCAGAT TTTCGATATC GCGGATCGCC CCTGACCGCT CTCATGGGAC TGGATGATTA TGGTTGTGTC ATGTATCTGG GTACTTTCTC CAAATCCATG GGTGCGGGCC TGCGCCTAGG CTACCTAGTG GTACCCAAAG CATTGATTCC TGCTGCCCGT TCTGCAAAAG CTCTGCTTAA TAATGGGCAT GCCTGGCTGG ATCAAGCAAC TATGGCCGAA TTTATCCGCA GCGGCGCTTA TGGCAATCAT CTGAGACGCA TGCGAAATAT GTATGGTAAG CGCCGTGATT GTCTGGTGGA AGCACTGCGG GAACATTTTG GGGCCATCCG CCTTAGCGGC CTTGAGAGTG GCCTGCATCT GGCATGGCAT CTTCCTGATC ACTATCCACC TGCAGCTCAG CTCCAGACAA TTGCATTACG TCATGGAGTA AGAATCTACT CGATCGGAGG GGGTACGGGC TATGATTACG GTGGCTGTCG CTACAGCGCC CGTACGCTCG TGTTAGGTTT TTCTTCCCTC AACGAATACC AGATCCGGGC TGGTGTTCGG CGAATTGCTG ATGCTTTCGC GGATATGTCA CTCAACCCCC TTTCCATGCG AGAGAAAGTG GATAACGTAG CGACACCCTC TCTTTAG
|
Protein sequence | MAIPIELDHN SRQSLQGQIF DQLRHLILGG KLKPGTLIPA SRVLAEQLGI SRNTVLLVYD RLIAEGYLQA RKAIGTYVNL ELPETCLSAT RRVTPSSPGD EEFVKQPVIP FIGHIHANIT VQHLDFDFCP DRIGLDLFPH KIWRRFVNKT FMSANMHFEE CCDPGGLSEL REVIAAHLGV ARGISVSSDQ IIITAGSQEG LNLAARLLVK EGTMVATENP CYQGATSLFE SYYAKLIPVP VDEAGLDVEQ LPKEGVALLF VTPSHQFPLG FTLPIERRLK LLDWARRCGA FIIEDDYDSD FRYRGSPLTA LMGLDDYGCV MYLGTFSKSM GAGLRLGYLV VPKALIPAAR SAKALLNNGH AWLDQATMAE FIRSGAYGNH LRRMRNMYGK RRDCLVEALR EHFGAIRLSG LESGLHLAWH LPDHYPPAAQ LQTIALRHGV RIYSIGGGTG YDYGGCRYSA RTLVLGFSSL NEYQIRAGVR RIADAFADMS LNPLSMREKV DNVATPSL
|
| |