Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1417 |
Symbol | |
ID | 3786615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1625469 |
End bp | 1627406 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637811505 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_412112 |
Protein GI | 82702546 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1022] Long-chain acyl-CoA synthetases (AMP-forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000978143 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACTGAGA AGAAGGGTGA AGGCAGCAAA ACCATAAGTC TCACGCTAGG ATTGATGATT GATTTCAAGA TACATGGAGC CATGCAGATG GGGCAGAAGC GGAAGACAGA TATCATTTCA TGCGGAGAGG CTCAGACACT GGCAGGACTC TTTTCCATTC GAATCAAGCG TACCCCACAA GCGATCGCGT ACCGGCAATT TGATGTCGAG AGCGGGGAAT GGCGGGAATA CAACTGGCAG GAAATGGGTA CGCGGGTGAG GCGCTGGAAG CGCGCTTTGA TGCGGGAGAA TCTCGAAGCG GGTGATCGTG TCGCAATCCT GCTACGCAAT TCCATTGAAT GGGTCTGCTT TGATCAGGCG GCACTCGCTG TCGGGCTCGT GGTGGTGCCT CTTTATCCCT CCGATGCGCC AGATAACATT GCCTACATTC TCGAGGATTC CGGCAGCCGG TTACTTCTGG TGGGCACTCA AAAGCGTTGG GAAACACTGG CCTCCCGATG CAAGGATGCC GGATTAGGCA AGATACTATG CGTTGAACAT CCGTCAGGAG ACGGTGGCGA GGGCAGGGTG CTACAGGGTG TAGGTGAATG GCTGAAGGCA GCAGATGAGG GTGCCAGCGA TGAGGAGGAG AGGGGCAACT CTGGCGACAA GGGTAATTCT CAACCCTCCG ATTCTCACGC GCTCGCTACA CTTGTTTACA CTTCTGGAAC CACCGGCAAG CCCAAGGGTG TCATGCTTTC ACACTTCAAT GTGCTTTGGA ATGCGGAGGC AACCCTTCAA GCGATATCCG GCTATCCGGA AGACGTTTAT CTCTCGCTTC TGCCGCTCTC GCATATGCTT GAGCGCACTG CCAGCTATTA CGTTCCTCTC ATGGCGGGGA GCAGCGTAGC CTATGCCCGT TCACTAAAAG ATTTGCCAGA GGATTTGAAA TCCGTACGGC CTGGTATATT CGTTGCCGTG CCGCAGGTTT ATGTAGGTAT TCGCAATAAA ATGAACCAGC AGGTGCAGGA AAGAGGATGG GTTGCCAGGT TGTTGCTCGA CTGGACTGTT GCACTTGGCT GGAAACGCTT CACCGTCGTG CAAGCACAGG GGAAGGAGAG ACTATGGCAG CGCGTTGCGT GGCCTATTCT GCGTCAATTG GTAGCCGCCA AGGTGCTGGC CGCATTCGGG GGGAGGCTCC GGCTAGCCGT AAGCGGAGGT GGCCCGCTCC ATGCGGATGT TTCCAGGTAT TTTATAGGAC TGGGTTTGCC GCTTCTGCAA GGGTACGGAC TGACCGAAGC TTCACCCATT CTGACAGCCA ATCGCTTGCA GGATAATATG CCCGGATCAA CGGGGAGCGC ATTGCTTGGC GTAGAGCTGC GTATCGGCGA GCAGCGTGAA CTGTTGGCCC GAAGTCCTGG CGTCATGCTG GGCTACTGGA ACAGACCCGA AGAAACCCGC GCTGCGATTG ATGCAGAGGG GTGGCTGCAT ACCGGTGATC AGGCCCGTAT TTCTGACAAT CATGTATTTA TCAGCGGACG AATCAAAGAG ATTCTGGTCA CTTCCAGTGG TGAAAAAGTG CCCTCGGGAG ATCTGGAGAT GTCTATCGTT CAAGAACCCT TGTTTGACCA GGTAATGGTG GTTGGCGAAG GAAGACCTTA TTTGACCGCA CTGGCTGTAG TGAACAAGAG GGAATGGCGG AATCTTGCCT CCAGCCTGGG GCTGAAAACG GACGAGGTCC AATCTCTGAG CCATTCGGCT ACCCGAGCAG CCGCTTTGAA AAGGATCAAG GCAACCTTGC GCGGTTTCCC CAAATACGCC CGAATTCGGG CGGTATATCT GTCACAGGAA CCCTGGAAGG TGGAAGACGG CCTGCTGACA CCCACTCTGA AACTGAAACG TTCAGAAATC GAAAAGCGCT TCGCGACCCA GATTACCGAA CTGTACGAAA AAGGATGA
|
Protein sequence | MTEKKGEGSK TISLTLGLMI DFKIHGAMQM GQKRKTDIIS CGEAQTLAGL FSIRIKRTPQ AIAYRQFDVE SGEWREYNWQ EMGTRVRRWK RALMRENLEA GDRVAILLRN SIEWVCFDQA ALAVGLVVVP LYPSDAPDNI AYILEDSGSR LLLVGTQKRW ETLASRCKDA GLGKILCVEH PSGDGGEGRV LQGVGEWLKA ADEGASDEEE RGNSGDKGNS QPSDSHALAT LVYTSGTTGK PKGVMLSHFN VLWNAEATLQ AISGYPEDVY LSLLPLSHML ERTASYYVPL MAGSSVAYAR SLKDLPEDLK SVRPGIFVAV PQVYVGIRNK MNQQVQERGW VARLLLDWTV ALGWKRFTVV QAQGKERLWQ RVAWPILRQL VAAKVLAAFG GRLRLAVSGG GPLHADVSRY FIGLGLPLLQ GYGLTEASPI LTANRLQDNM PGSTGSALLG VELRIGEQRE LLARSPGVML GYWNRPEETR AAIDAEGWLH TGDQARISDN HVFISGRIKE ILVTSSGEKV PSGDLEMSIV QEPLFDQVMV VGEGRPYLTA LAVVNKREWR NLASSLGLKT DEVQSLSHSA TRAAALKRIK ATLRGFPKYA RIRAVYLSQE PWKVEDGLLT PTLKLKRSEI EKRFATQITE LYEKG
|
| |