Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1830 |
Symbol | |
ID | 3785939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2102035 |
End bp | 2105451 |
Gene Length | 3417 bp |
Protein Length | 1138 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637811917 |
Product | amino acid adenylation |
Protein accession | YP_412519 |
Protein GI | 82702953 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATAGCA GCAGTCTAGT TCAAAGACGC GCCCGCCTGA CCCCCGAGCA ACGGGAGAGG CTGGCGCAGC GGCTGGCCGG AGCTCATGCT CCAGCACTTC AATCGAATAT CCCTTGCCGC AATGCTTCCG CGCGGGTGCC GCTCTCATAC GCACAGGAGC GTCACTGGTT TTTATGGCAA TTGGAGCCGT TGAGCACGGC TTATCATTTG AGCGGGGGAT TGCGGCTGAC GGGCAGGGTG GATATTGAAG CGCTGCGTTG GAGCTTTGCG GCGCTGGGCA GGCGGCATGA GTCGTTGCGT ACGATATTCA GGGTCAATTC GGAAGGGTTG CCGGAGCAGA TCATCGAAGA CGAGCCGCGG CTTGAAATTC CGCTGACCGA CTTTTCCGGA CTGCCGCTGG AACAAGCCAG AGCGCAAGCC GGTGAAGAAG CGGGCCGGAT AGCCGGCACG CCCTTTGATC TGACGCAAGG CCCGCTGCTT CGGGTTGCCC TCATCCGCAT TGCAGCGGAA GAACATCTTC TCGTGGTGGT GATGCACCAC ATCATCTCGG ACGCCTGGTC CAACCGCATT GTCATTGACG AATTTGCCGC CCACTATCGG GCACGGGTGC AGCAGGAGCA GGAGGGGGAG AAACAGGGGC AGGAACCCTC CCTGCCGGCC CTGCCGATCC AGTATGCCGA TTACGCGATA TGGCAGCGCA ACTGGCTGGA AGCGGGAGAA AAAGAGCGCC AGCTGGCCTA CTGGCGCAGC CAGTTGGGGG AAGAGCACCC GGTATTGCAA TTGCCCACCG ATCACCCCCG ATCTTCCAGG GCCAGTTACC GTGCGGCGCG CCACACCTTC ACATTACCTG CGGGTCTGGT TACACGCTTG CAGCGTCAGG CGCAAAGCCA GGGAGCGACC CTGTTCATGG CGCTGCTCTC GGGCTTTCAA GGCCTGCTCT ATCGCTATAC CGGCCAGCGG GATATCCGCG TGGGCGTGCC GATTGCCAAC CGGCATCGGG CTGAAATAGA AAACATCGTC GGCTTCTTCG TCAATACCCA GGTATTGCGC ACCCTCATGG ATGGGCGCAT GTCCCTGCAT ACGTTGCTCG ATCAGACGCG GGAAGCAGCG CTGGGTGCCC AGACCCACCA GGATTTGCCG TTCGAGCGAC TGGTTGAAGC CCTGCAACCC GAACGCAACC TGAATCAGAA TCCTCTGTTT CAGGTCATGT ACAACCACCT GCGCGAAGAC TACCGGGCAC TCGAGCAATT GCCCGGGCTC AAGGTGGAAA ATCACGAGCT GAGCGAGCAG GCGGCGCAGT TCGAACTGAC CCTGGATACG GTCGAGCAGC CCGATGGCAG GCTGGAAGCC ACCTTCACCT ATGCCGCCGA GCTGTTTGAA CCTGCCACCA TTGGGCGGCT TGGCAACCAT TATCTGCTTC TTCTGGAGCA ACTGGCCGAG CATCCGCAGC AGAACCTTGG CGACATCGAC ATCCTCAGTG AAGCCGAGCG GGCGCAGCTC AAGGCCTGGG GGATCAACGA GCAGCGCTAC GCCAATACCG AGCCCGTGCA CAGGCTGATC GAGCGGCAGG TTGAAGTCCA GCCGGAAGCG ATTGCCCTGA TCTTTGGCGA TGTCGAATTG AGCTACGGCG AGCTGAACCG AAGGGCGAAC CGCCTGGCGC ACCGTTTGAT CAGGCTTGGG GTTGGGCCGG AGGTCAAGGT GGGCATTGCG GTGGAGCGCT CGATCGACAT GGTGGTGGGG TTGCTTGCCA CCCTGAAGGC GGGCGGAGCA TATGTGCCGC TTGATCCGGA ATATCCGCAG GAGCGGCTGG CCTACATGGT GGCAGACAGT GGCATCGGGC TGTTGCTGAC GCAAAGCCGG GTTCGATCCG CCATTCCCCA TTCCGACCAA TGCGTGGTAC TGGAGCTGGA CAGGCTCGAT CTCGAGGAGG AATCCGGCAG CAACCCGCAA GTCGCCCTGC ATGGATACAA CCTTGCCTAC ATCATCTATA CCTCAGGCTC CACAGGTAAA CCAAAGGGCG TAAGTGTAGC GCATCATGCG CTGGTTGAGC ATGCACAGGT AGCGGTAGGC TTCTTCGGTC TTGGTTCCAC AGACCGGATG TTGCAATTTT CCACCATCAA CTTCGATGGG TTTATCGAAC AGCTTTTCCC CCCCTTGTGC GCGGGAGCCG CCGTTGTCTT GCGCGGCCCG GCGCTGTGGG ACAGCGAGAC TTTCTATCGC GAGCTGATCG AAAAGCGCAT CACGGTTGCC GATCTTACCA CCGCCTACTG GTTCATGCTG GTGCAGGATT TTGCCAGAGG GGGTCCACGC GACTACGGGT TGTTACGCCA GGTTCATGCG GGCGGTGAGG CCATGTCGCC TGAAGGACTC AAAGCCTGGA GCGAGGCGGG ATTCGACGGT GTGACCCTGC TGAATACCTA CGGTCCGACC GAAGCCGCTG TGACCGCGAC CGTATGGAAT TGCAGCGATT ATTCGCAGGG TAACGAAATA TCCTCCCAAG TGTCCATTGT CCCTATTGTG TCGATTGGCA GTCCGCTTGC CGCCCGTCAT ATCTATCTGC TGGACGCCAA CCTGACTCCT GTTTCCCCTG GAATTCCCGG TGAGCTGTGC ATAGGAGGGG AATTGCTCGC TCGCGGCTAT CTCAACCGTG GAGGATTGAC GGCGGAGCGT TTCATAGCCG ATCCCTTCGA TGGAGGAGGC GGACGACTCT ACCGCACGGG AGATCTGGCA AGATGGCGCT CGGACGGGCA GATCGAATAT CTGGGGCGGC TGGATCATCA GGTCAAGATA CGGGGATTCC GCATCGAGCT GGGCGAAATC GAAATGCAAC TGCTGGCGCA ACCGGAAGTC AGGGAAGCGG TGGTGGTTGC CAGGGAAAGT GCCCGCGGCT CCAATCCTGC GGGAGGAGCA AGACTCGTTG CCTATGTTTC CTTGCATGCG GAAGCGGAGA TGGAAGTTGG GCGACTGCGT GAAGCGTTGG GCAAGGTTTT GCCAGACTAC ATGCTGCCCT CAATGATTGT GGTGCTGGAG AGTCTGCCGC TCAATCCGAG CGGCAAGGTA GACCGCAAGG CCTTGCCCGA GCCGGAGTTT ACCCATACGG AGCATTATGA GGCGCCGCGG GGGGAAGCGG AAGAGGTGCT GGCAGGTATC TGGGCGCAGG TGCTGGGTGT GGCGCAGGTG GGACGGCATG ACAACTTCTT TGAACTGGGG GGACATTCGC TCGCTATCCT CCAGGTTCAG CAGAAACTGC AACAAGCCCT ATCCATTTCG TTGCCTTTGC GGCTGCATTT CGAGAATCCC CTGCTGAAGG ATATTGCTTC TGCCATCCAG GAAAAACGGT CCCGGGCATC CGAAAAAGAC GCGGAGCAGG AGGACCTGTT GGGAATGGCG GAATTGCTTG ATTTACTGGA GAGTTGA
|
Protein sequence | MHSSSLVQRR ARLTPEQRER LAQRLAGAHA PALQSNIPCR NASARVPLSY AQERHWFLWQ LEPLSTAYHL SGGLRLTGRV DIEALRWSFA ALGRRHESLR TIFRVNSEGL PEQIIEDEPR LEIPLTDFSG LPLEQARAQA GEEAGRIAGT PFDLTQGPLL RVALIRIAAE EHLLVVVMHH IISDAWSNRI VIDEFAAHYR ARVQQEQEGE KQGQEPSLPA LPIQYADYAI WQRNWLEAGE KERQLAYWRS QLGEEHPVLQ LPTDHPRSSR ASYRAARHTF TLPAGLVTRL QRQAQSQGAT LFMALLSGFQ GLLYRYTGQR DIRVGVPIAN RHRAEIENIV GFFVNTQVLR TLMDGRMSLH TLLDQTREAA LGAQTHQDLP FERLVEALQP ERNLNQNPLF QVMYNHLRED YRALEQLPGL KVENHELSEQ AAQFELTLDT VEQPDGRLEA TFTYAAELFE PATIGRLGNH YLLLLEQLAE HPQQNLGDID ILSEAERAQL KAWGINEQRY ANTEPVHRLI ERQVEVQPEA IALIFGDVEL SYGELNRRAN RLAHRLIRLG VGPEVKVGIA VERSIDMVVG LLATLKAGGA YVPLDPEYPQ ERLAYMVADS GIGLLLTQSR VRSAIPHSDQ CVVLELDRLD LEEESGSNPQ VALHGYNLAY IIYTSGSTGK PKGVSVAHHA LVEHAQVAVG FFGLGSTDRM LQFSTINFDG FIEQLFPPLC AGAAVVLRGP ALWDSETFYR ELIEKRITVA DLTTAYWFML VQDFARGGPR DYGLLRQVHA GGEAMSPEGL KAWSEAGFDG VTLLNTYGPT EAAVTATVWN CSDYSQGNEI SSQVSIVPIV SIGSPLAARH IYLLDANLTP VSPGIPGELC IGGELLARGY LNRGGLTAER FIADPFDGGG GRLYRTGDLA RWRSDGQIEY LGRLDHQVKI RGFRIELGEI EMQLLAQPEV REAVVVARES ARGSNPAGGA RLVAYVSLHA EAEMEVGRLR EALGKVLPDY MLPSMIVVLE SLPLNPSGKV DRKALPEPEF THTEHYEAPR GEAEEVLAGI WAQVLGVAQV GRHDNFFELG GHSLAILQVQ QKLQQALSIS LPLRLHFENP LLKDIASAIQ EKRSRASEKD AEQEDLLGMA ELLDLLES
|
| |