Gene Nmul_A1830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1830 
Symbol 
ID3785939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2102035 
End bp2105451 
Gene Length3417 bp 
Protein Length1138 aa 
Translation table11 
GC content59% 
IMG OID637811917 
Productamino acid adenylation 
Protein accessionYP_412519 
Protein GI82702953 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAGCA GCAGTCTAGT TCAAAGACGC GCCCGCCTGA CCCCCGAGCA ACGGGAGAGG 
CTGGCGCAGC GGCTGGCCGG AGCTCATGCT CCAGCACTTC AATCGAATAT CCCTTGCCGC
AATGCTTCCG CGCGGGTGCC GCTCTCATAC GCACAGGAGC GTCACTGGTT TTTATGGCAA
TTGGAGCCGT TGAGCACGGC TTATCATTTG AGCGGGGGAT TGCGGCTGAC GGGCAGGGTG
GATATTGAAG CGCTGCGTTG GAGCTTTGCG GCGCTGGGCA GGCGGCATGA GTCGTTGCGT
ACGATATTCA GGGTCAATTC GGAAGGGTTG CCGGAGCAGA TCATCGAAGA CGAGCCGCGG
CTTGAAATTC CGCTGACCGA CTTTTCCGGA CTGCCGCTGG AACAAGCCAG AGCGCAAGCC
GGTGAAGAAG CGGGCCGGAT AGCCGGCACG CCCTTTGATC TGACGCAAGG CCCGCTGCTT
CGGGTTGCCC TCATCCGCAT TGCAGCGGAA GAACATCTTC TCGTGGTGGT GATGCACCAC
ATCATCTCGG ACGCCTGGTC CAACCGCATT GTCATTGACG AATTTGCCGC CCACTATCGG
GCACGGGTGC AGCAGGAGCA GGAGGGGGAG AAACAGGGGC AGGAACCCTC CCTGCCGGCC
CTGCCGATCC AGTATGCCGA TTACGCGATA TGGCAGCGCA ACTGGCTGGA AGCGGGAGAA
AAAGAGCGCC AGCTGGCCTA CTGGCGCAGC CAGTTGGGGG AAGAGCACCC GGTATTGCAA
TTGCCCACCG ATCACCCCCG ATCTTCCAGG GCCAGTTACC GTGCGGCGCG CCACACCTTC
ACATTACCTG CGGGTCTGGT TACACGCTTG CAGCGTCAGG CGCAAAGCCA GGGAGCGACC
CTGTTCATGG CGCTGCTCTC GGGCTTTCAA GGCCTGCTCT ATCGCTATAC CGGCCAGCGG
GATATCCGCG TGGGCGTGCC GATTGCCAAC CGGCATCGGG CTGAAATAGA AAACATCGTC
GGCTTCTTCG TCAATACCCA GGTATTGCGC ACCCTCATGG ATGGGCGCAT GTCCCTGCAT
ACGTTGCTCG ATCAGACGCG GGAAGCAGCG CTGGGTGCCC AGACCCACCA GGATTTGCCG
TTCGAGCGAC TGGTTGAAGC CCTGCAACCC GAACGCAACC TGAATCAGAA TCCTCTGTTT
CAGGTCATGT ACAACCACCT GCGCGAAGAC TACCGGGCAC TCGAGCAATT GCCCGGGCTC
AAGGTGGAAA ATCACGAGCT GAGCGAGCAG GCGGCGCAGT TCGAACTGAC CCTGGATACG
GTCGAGCAGC CCGATGGCAG GCTGGAAGCC ACCTTCACCT ATGCCGCCGA GCTGTTTGAA
CCTGCCACCA TTGGGCGGCT TGGCAACCAT TATCTGCTTC TTCTGGAGCA ACTGGCCGAG
CATCCGCAGC AGAACCTTGG CGACATCGAC ATCCTCAGTG AAGCCGAGCG GGCGCAGCTC
AAGGCCTGGG GGATCAACGA GCAGCGCTAC GCCAATACCG AGCCCGTGCA CAGGCTGATC
GAGCGGCAGG TTGAAGTCCA GCCGGAAGCG ATTGCCCTGA TCTTTGGCGA TGTCGAATTG
AGCTACGGCG AGCTGAACCG AAGGGCGAAC CGCCTGGCGC ACCGTTTGAT CAGGCTTGGG
GTTGGGCCGG AGGTCAAGGT GGGCATTGCG GTGGAGCGCT CGATCGACAT GGTGGTGGGG
TTGCTTGCCA CCCTGAAGGC GGGCGGAGCA TATGTGCCGC TTGATCCGGA ATATCCGCAG
GAGCGGCTGG CCTACATGGT GGCAGACAGT GGCATCGGGC TGTTGCTGAC GCAAAGCCGG
GTTCGATCCG CCATTCCCCA TTCCGACCAA TGCGTGGTAC TGGAGCTGGA CAGGCTCGAT
CTCGAGGAGG AATCCGGCAG CAACCCGCAA GTCGCCCTGC ATGGATACAA CCTTGCCTAC
ATCATCTATA CCTCAGGCTC CACAGGTAAA CCAAAGGGCG TAAGTGTAGC GCATCATGCG
CTGGTTGAGC ATGCACAGGT AGCGGTAGGC TTCTTCGGTC TTGGTTCCAC AGACCGGATG
TTGCAATTTT CCACCATCAA CTTCGATGGG TTTATCGAAC AGCTTTTCCC CCCCTTGTGC
GCGGGAGCCG CCGTTGTCTT GCGCGGCCCG GCGCTGTGGG ACAGCGAGAC TTTCTATCGC
GAGCTGATCG AAAAGCGCAT CACGGTTGCC GATCTTACCA CCGCCTACTG GTTCATGCTG
GTGCAGGATT TTGCCAGAGG GGGTCCACGC GACTACGGGT TGTTACGCCA GGTTCATGCG
GGCGGTGAGG CCATGTCGCC TGAAGGACTC AAAGCCTGGA GCGAGGCGGG ATTCGACGGT
GTGACCCTGC TGAATACCTA CGGTCCGACC GAAGCCGCTG TGACCGCGAC CGTATGGAAT
TGCAGCGATT ATTCGCAGGG TAACGAAATA TCCTCCCAAG TGTCCATTGT CCCTATTGTG
TCGATTGGCA GTCCGCTTGC CGCCCGTCAT ATCTATCTGC TGGACGCCAA CCTGACTCCT
GTTTCCCCTG GAATTCCCGG TGAGCTGTGC ATAGGAGGGG AATTGCTCGC TCGCGGCTAT
CTCAACCGTG GAGGATTGAC GGCGGAGCGT TTCATAGCCG ATCCCTTCGA TGGAGGAGGC
GGACGACTCT ACCGCACGGG AGATCTGGCA AGATGGCGCT CGGACGGGCA GATCGAATAT
CTGGGGCGGC TGGATCATCA GGTCAAGATA CGGGGATTCC GCATCGAGCT GGGCGAAATC
GAAATGCAAC TGCTGGCGCA ACCGGAAGTC AGGGAAGCGG TGGTGGTTGC CAGGGAAAGT
GCCCGCGGCT CCAATCCTGC GGGAGGAGCA AGACTCGTTG CCTATGTTTC CTTGCATGCG
GAAGCGGAGA TGGAAGTTGG GCGACTGCGT GAAGCGTTGG GCAAGGTTTT GCCAGACTAC
ATGCTGCCCT CAATGATTGT GGTGCTGGAG AGTCTGCCGC TCAATCCGAG CGGCAAGGTA
GACCGCAAGG CCTTGCCCGA GCCGGAGTTT ACCCATACGG AGCATTATGA GGCGCCGCGG
GGGGAAGCGG AAGAGGTGCT GGCAGGTATC TGGGCGCAGG TGCTGGGTGT GGCGCAGGTG
GGACGGCATG ACAACTTCTT TGAACTGGGG GGACATTCGC TCGCTATCCT CCAGGTTCAG
CAGAAACTGC AACAAGCCCT ATCCATTTCG TTGCCTTTGC GGCTGCATTT CGAGAATCCC
CTGCTGAAGG ATATTGCTTC TGCCATCCAG GAAAAACGGT CCCGGGCATC CGAAAAAGAC
GCGGAGCAGG AGGACCTGTT GGGAATGGCG GAATTGCTTG ATTTACTGGA GAGTTGA
 
Protein sequence
MHSSSLVQRR ARLTPEQRER LAQRLAGAHA PALQSNIPCR NASARVPLSY AQERHWFLWQ 
LEPLSTAYHL SGGLRLTGRV DIEALRWSFA ALGRRHESLR TIFRVNSEGL PEQIIEDEPR
LEIPLTDFSG LPLEQARAQA GEEAGRIAGT PFDLTQGPLL RVALIRIAAE EHLLVVVMHH
IISDAWSNRI VIDEFAAHYR ARVQQEQEGE KQGQEPSLPA LPIQYADYAI WQRNWLEAGE
KERQLAYWRS QLGEEHPVLQ LPTDHPRSSR ASYRAARHTF TLPAGLVTRL QRQAQSQGAT
LFMALLSGFQ GLLYRYTGQR DIRVGVPIAN RHRAEIENIV GFFVNTQVLR TLMDGRMSLH
TLLDQTREAA LGAQTHQDLP FERLVEALQP ERNLNQNPLF QVMYNHLRED YRALEQLPGL
KVENHELSEQ AAQFELTLDT VEQPDGRLEA TFTYAAELFE PATIGRLGNH YLLLLEQLAE
HPQQNLGDID ILSEAERAQL KAWGINEQRY ANTEPVHRLI ERQVEVQPEA IALIFGDVEL
SYGELNRRAN RLAHRLIRLG VGPEVKVGIA VERSIDMVVG LLATLKAGGA YVPLDPEYPQ
ERLAYMVADS GIGLLLTQSR VRSAIPHSDQ CVVLELDRLD LEEESGSNPQ VALHGYNLAY
IIYTSGSTGK PKGVSVAHHA LVEHAQVAVG FFGLGSTDRM LQFSTINFDG FIEQLFPPLC
AGAAVVLRGP ALWDSETFYR ELIEKRITVA DLTTAYWFML VQDFARGGPR DYGLLRQVHA
GGEAMSPEGL KAWSEAGFDG VTLLNTYGPT EAAVTATVWN CSDYSQGNEI SSQVSIVPIV
SIGSPLAARH IYLLDANLTP VSPGIPGELC IGGELLARGY LNRGGLTAER FIADPFDGGG
GRLYRTGDLA RWRSDGQIEY LGRLDHQVKI RGFRIELGEI EMQLLAQPEV REAVVVARES
ARGSNPAGGA RLVAYVSLHA EAEMEVGRLR EALGKVLPDY MLPSMIVVLE SLPLNPSGKV
DRKALPEPEF THTEHYEAPR GEAEEVLAGI WAQVLGVAQV GRHDNFFELG GHSLAILQVQ
QKLQQALSIS LPLRLHFENP LLKDIASAIQ EKRSRASEKD AEQEDLLGMA ELLDLLES