Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1246 |
Symbol | |
ID | 3786022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1430388 |
End bp | 1433450 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637811331 |
Product | transcriptional regulator |
Protein accession | YP_411941 |
Protein GI | 82702375 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.571861 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGACAA CTTACTTGGA ATTCCATCCT TTCAGGCTGG ATAAGATCAA CGCCATCCTA TGGCGCAACG ATCAAGTAGT ACCGTTACGT CCGAAAAATT TCGCTATGTT GTGTTACCTG GCGGAGCGGG CTGGCACCCT GGTGACCAAG GATGAGTTGC TTGACGCGGT ATGGCAGCGC CGCTTCGTCG GAGAAGCAGT GCTGAAAGTA TGTATCAACG AAGTGCGGCG GGCTCTTGGA GACAGTGTTT CCGCTCCCAC CTATCTTTTA ACCGTTCCCA AACGCGGCTA CCGTTTCATC GCGCAGGTTA CTGAAGTCAA GTCGTCAGAG GAAGTGGAAG AAGTAATCTG CCCCGTTTTC CCCAAAAACC AGCGCTCTGA CAAGGTCGCA TATTGGATAG ACCGCCCGTC TCCACAAGCT CGCTTGCTGA CAATCTGGCA AAAATCACTG GTGGGTTCGC GCCAGATCGT TTTTGTTACC GGGGAAAGTG GAATCGGCAA GAGCACGCTG ATCGAGATGT TTCTCTCCAC AATATCCAGT CAAAGCCCGG GGGTTTTGCG CATGCGCTGC ATAGAGCGCT TTGGCCAGGG TGAAGCCCTG CTCCCGATGA TCGAGGCAAT TGAAAAGCGT TGCAATGCGC CAGGGGGGGG AAAACTCGTT GAACTGTTAT ATCGTCACGC ACCGGTTTGG CTTGCGCAAT TGCCGTCCGT GCTTCGCCCC GAGGAGCGTG TGGCGCTTCA GCAGGAGATT TTTGGCGCAA GCCGGGAACG CATGGTACGG GAAGGTTGTG AACTGCTGGA AACCTTGAGC AAAGTGCCCC TGATTCTCGT GCTTGAGGAT CTTCATTGGA GTGATCATGC GACTCTTGAT TTTTTAAGCT TGCTTGCGCA GCGTCATGTG CCAGCATACT TGATGGTGCT CGCCACCTAT CGCCCCATCG ATGCAAGCCA GCGGGTGCAC CCAGTTACAG AAGTTCACCG GGATTTGCAG TTGCGAGGAA TCTGTTCTGA AGTCGCCCTT GAGCCGTTTT CCTGCAATGA AGTCAAACAT TACCTCACTC GACGTTGCCC AGGCATAAAT ATTCCCGATT CGATCAGTCA GGCACTTTTC ATAAGAACCG GTGGACATCC TTTGTTCATA TCCAATCTGA TCGAATATTT AATGGAGCAA CATCAATGGT CACCGTTATC CCAGCAGATC GGAATCGATA CAGCGCTGCC GGAAACGATC CGCCGCGTCA TCGAGCGTGA AATCGAACGG CTCAGCCATG ATGAACAACG GGTGCTGACG GTGGCGAGTG TCGCGGGAAT GCGGTTTAGT GGGAACCTGC TTTGCAGTGT TCTGGGTATG GAGATCGCCG AAGCAGACCG CTGCTGCAAT GCCCTGGTTA GACGAGGCCA GATATTGGTG TCCGATGGAA TGGAGCAAAG CACGAAAGGA GTTGTCGCGG GCTACTATGC ATTCCGCCAT GCCCTGTACC TCGAAGTCTT CTACCAACGG CTTTCTCCCT CCGAAACGAT ACGAATGCAC CTTCGCATCG GAGAATACCT TGAAAAGGCA TACGGCGAGC AAAACGTGGA GCATGCAGGG GAACTCGCCC TGCATTTTGA AAACGGATGG GATTGGCTTC GCGCCATCCG CTATCTCGTG CAGGCGGCTG ACAACAGCAC CAGGCGCTTT GCCAATCGGC AGGCACATGA CTATCTGGCG CGCGCCGTTC GAATGATAGA ACGCTTGCCT GATGAGCAAC AAGCGAAAAC ACGCATAAGC CTCCTCAAGC AATCAGCGGC GGTAAGACGC TCAATGGGCG ATATGGCAGG AGCAAAAACT GATCTGGAGA AAATGCTGGC AGCCGCCAAA GCTTTGGGAG ACAGCCGGGA ACAGGCGGTG GGACTCCTCG AATTGAGCCG CGTCCTTGTT TTGTTGAACC GTCTTGAATG TCTCGAGTTT GCGGAGCAGG CGGTCTCTGC TTCCACAGCA CTTGAAGACA AGGTTTTTCA TTCAATCGTC AAGGGCATGT GGGGTGGCTT GAATCTGTTG CTCCGGCCAT GGCGGGAGGA TTATTCTGCC GCTTGTCACG AGTCTATGGA TGCGGTGCGC GCAACGGGAA ATCCCCTGGC TCTTCACTCG CGCTTGACTC AGCATATTTA CGTTGAACTG CTTGCCTCGA ATTACCAGGC TGCTGCGACG ACAGCAGTCG AAGCTCTGGC GTTATCACGC GTAATGGGAG ATGGCTACAT GTTCATCGCG GGCCATTATT ACTATGGATT AACGCTGCTG CATAAGGGTG AATGGGGCAG ATTGCGCGAA ACCGCAGAGC AAAGCAGGCG CGCGTTCGAG GGGCATGACG CTGGCTTGCT GCTTCGGTTG CATCGCCACA TCCTGCTGGG ATGGTTGCAT GTGGTAAGTG GCGATTTTTC GGGTGCCAAA GCGTATTGCG AGGAAGCGCT GTCAGAAGGC GTCGGTGCCT GGGCTGACTT CGTTTCGGTT CATTGTTCCG CAATCCTGGG AAAGGCCTTG CACGGGCTAA AAGATTATGC GGGAGCCATT CGATGCTTTG ACGCTTTTTT TCAGGCGGAA AAGAATAATG CTCTCCCGAT ATTCTCCAAC TATTTCTTCC CTGCTTGCTT GGGAATGGGA GAAACATGGC TCGCACTGGG AAAACTGGAT AAGGCGCGTC GCTATGCGCA GCGCTTATAT GATCTTTCCA GCGGTCCCTC TGAACGCACC TACCTTGCAC TCAGTCATCG CCTGTTTGCC GAGATTGCGA TAGTAGAAGA GAGCTGGGAC GAAGTACATT CGCATATTAC AAAAGCACTT GAAATCGTGG AAAATGCGGA AATTCCCCTT GCAGCCTGGA GGGTCTACGC GACTGGAGAA AAACTGCATT ATCGGCAGGA TGACGGGAAA AGGATGGGTT ACTACCGATC AAAAAAACAG GATGAAATCG ATCAACTCCT CAACTCTCTT CAACCGTCGG ATCCTTTAAG GAAACATTTG CTGAATCTTG CTGGAACCCA TGAATGCGAT TCTCTTTCAC CGGTCAGATA CAGCCCGCTT TAA
|
Protein sequence | MQTTYLEFHP FRLDKINAIL WRNDQVVPLR PKNFAMLCYL AERAGTLVTK DELLDAVWQR RFVGEAVLKV CINEVRRALG DSVSAPTYLL TVPKRGYRFI AQVTEVKSSE EVEEVICPVF PKNQRSDKVA YWIDRPSPQA RLLTIWQKSL VGSRQIVFVT GESGIGKSTL IEMFLSTISS QSPGVLRMRC IERFGQGEAL LPMIEAIEKR CNAPGGGKLV ELLYRHAPVW LAQLPSVLRP EERVALQQEI FGASRERMVR EGCELLETLS KVPLILVLED LHWSDHATLD FLSLLAQRHV PAYLMVLATY RPIDASQRVH PVTEVHRDLQ LRGICSEVAL EPFSCNEVKH YLTRRCPGIN IPDSISQALF IRTGGHPLFI SNLIEYLMEQ HQWSPLSQQI GIDTALPETI RRVIEREIER LSHDEQRVLT VASVAGMRFS GNLLCSVLGM EIAEADRCCN ALVRRGQILV SDGMEQSTKG VVAGYYAFRH ALYLEVFYQR LSPSETIRMH LRIGEYLEKA YGEQNVEHAG ELALHFENGW DWLRAIRYLV QAADNSTRRF ANRQAHDYLA RAVRMIERLP DEQQAKTRIS LLKQSAAVRR SMGDMAGAKT DLEKMLAAAK ALGDSREQAV GLLELSRVLV LLNRLECLEF AEQAVSASTA LEDKVFHSIV KGMWGGLNLL LRPWREDYSA ACHESMDAVR ATGNPLALHS RLTQHIYVEL LASNYQAAAT TAVEALALSR VMGDGYMFIA GHYYYGLTLL HKGEWGRLRE TAEQSRRAFE GHDAGLLLRL HRHILLGWLH VVSGDFSGAK AYCEEALSEG VGAWADFVSV HCSAILGKAL HGLKDYAGAI RCFDAFFQAE KNNALPIFSN YFFPACLGMG ETWLALGKLD KARRYAQRLY DLSSGPSERT YLALSHRLFA EIAIVEESWD EVHSHITKAL EIVENAEIPL AAWRVYATGE KLHYRQDDGK RMGYYRSKKQ DEIDQLLNSL QPSDPLRKHL LNLAGTHECD SLSPVRYSPL
|
| |