Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0095 |
Symbol | |
ID | 3785820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 99069 |
End bp | 101060 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810165 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_410796 |
Protein GI | 82701230 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1022] Long-chain acyl-CoA synthetases (AMP-forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAATG TCTCCGCCAG CCGGCAGCAG GTTCGATCGG CATTATCTCC GGCAGCGACG CCCAGCGACG CGGAAATCCC TACAACATCT GGAAGACAGG ACATGGCGGG CATCATTCCG GTGGAAACTG CGGGAACTCT GGATGGATTG TTCCGCGAGC GGGTGCGCCG CACGCCCGAA GCGATAGCCT ATCGGGATTA TGATCGATCG AGCGGCAAGT GGCGTGATCT TTCCTGGGCG CGAATGGATG AGCGCATTGC ACACTGGGTG ATTGCGCTAT CGAAGGAAAA GCTTAAACCG GGTGACCGCG TCGGGATAAT GTTAAGAAAC TGCCCGGAAT GGGTAATGTT CGAACAGGCT GCGCTGAGGC TTGGGCTGGT GGTAGTGCCG CTCTATCCCA CTGATCGCCC GGATAATGCG GCTTATATCC TTCAGGATGC AGGCGTCAAG GTACTTCTGC TGGAGGAACT GGGGCAGTGG CTTGCTTTCT CGGAAGTGCG TGACCAGATG ACAGGGCTTG TACGCGTCAT CATTATCCAG GGCAGCGTCA GGCAGGGTCA TAGAGGAAAC GAGCTCGTGC TTGCGCTGCA TGACTGGCTG CCGGAGTGGT TCCCCGGCGA AAGACCGACC GAGTCGTTGC CAAGCGTGGA ACCCCTCTAT CAGCTTCCTG CGGAGCCGGA GCATCAGGCT CCGGAAGAAG TCCCGCCCAG CCGTGTCCTC GCACGTGACC CGCACCAGCT GGCTACCATC ATTTATACAT CGGGTACATC CGGGCATCCC AAGGGCGTGA TGCTCAGCCA TCATAATATA CTGACCAATG CCCATAGCTG CCTGCAGGTC GTACCGATCG AAGAAAGCGA CGTGCTGCTC TCCTTTTTAC CGCTGTCCCA CACATTTGAA CGGACTGCCG GTTATTACGT GCCGATGATG CGGGGCTCGA CAGTGGCGTA TGCGCGCTCG ATTCCCCAGT TGCAGGAGGA TCTGCTGATC ATCCGGCCGA CCATCCTCGT TTCGGTGCCC CGAATATATG AGCGCGTATA TGCCGGCATT CGCGCCAAGC TGGCGGAGGG GCCCCTGCTG AGCCGCAGGC TGTTCGACCT TGCAGTCGAG ATCGGGTATA ACCGGTTCGA GTATCAGCAG GGGCGCGCCG AGAAGCATTT TTCCCATGCG CTGTGGCCGT TGCTGGAAAT ACTGGTGGCA AAGAAGGTGA TGAGCAAACT GGGAGGGCGG CTGCGCGCAG CAATGAGCGG CGGAGCCGCA TTATCGAGCG AGGTTTCCCG CATATTTATC GGCCTGGGCC TGCCCATCCT CCAGGGCTAC GGTATGACGG AAAGCAGCCC GGTGGTTTGC TGCAATACGA TCGAGGACAA TGTTCCGGCG AGTGTCGGAC GGCCTATTCC AGGGGTGGAA GTGAAGCTCG GCGAGCAGAA TGCCTTGCTT ATTCGCGGAC CCAACGTCAT GCTCGGCTAC TGGAACAATG AGGAGGCCAC GCGAGCGGTA ATGACCCCGG ACGGGTGGCT CAATTCGGGC GATATCGCGG AAATCGATGA GGCCGGGCAT ATCGCCATTA CCGGCCGGGT AAAGGAAATC ATTGTCATGT CGACCGGGGA AAAGATTCCA CCTGCCAACA TGGAAGCGGC GATTCTACGC GATCCGTTAT TTGAACAGGT CATGGTGGTT GGCGAGGGAC GTCCTCATTT AGCGGTGCTG GCAGTTCTCA ACTCCGGAAA TTGGGAAAGC ATGGCTGACG AATACCATCT CGACCGGAAC TGGCGCCGTC TAGGGGGCGA TCCGAAGTTC GAAGAAATCC TGCTGGAACG GATCGCTTAT CAGATCAAAG GATTTCCGGG TTATGCCAAA ATATATCGCA TTGCCGTGGT GGCCGAGCCG TGGACAGTCG AGAATCAGAT GTTGACCCCG ACGTTGAAAC TGCGGCGCAC TTATGTGCAG AACCACTACA AAAGCGAGGT GGACAGGCTC TACACGAAGT AA
|
Protein sequence | MNNVSASRQQ VRSALSPAAT PSDAEIPTTS GRQDMAGIIP VETAGTLDGL FRERVRRTPE AIAYRDYDRS SGKWRDLSWA RMDERIAHWV IALSKEKLKP GDRVGIMLRN CPEWVMFEQA ALRLGLVVVP LYPTDRPDNA AYILQDAGVK VLLLEELGQW LAFSEVRDQM TGLVRVIIIQ GSVRQGHRGN ELVLALHDWL PEWFPGERPT ESLPSVEPLY QLPAEPEHQA PEEVPPSRVL ARDPHQLATI IYTSGTSGHP KGVMLSHHNI LTNAHSCLQV VPIEESDVLL SFLPLSHTFE RTAGYYVPMM RGSTVAYARS IPQLQEDLLI IRPTILVSVP RIYERVYAGI RAKLAEGPLL SRRLFDLAVE IGYNRFEYQQ GRAEKHFSHA LWPLLEILVA KKVMSKLGGR LRAAMSGGAA LSSEVSRIFI GLGLPILQGY GMTESSPVVC CNTIEDNVPA SVGRPIPGVE VKLGEQNALL IRGPNVMLGY WNNEEATRAV MTPDGWLNSG DIAEIDEAGH IAITGRVKEI IVMSTGEKIP PANMEAAILR DPLFEQVMVV GEGRPHLAVL AVLNSGNWES MADEYHLDRN WRRLGGDPKF EEILLERIAY QIKGFPGYAK IYRIAVVAEP WTVENQMLTP TLKLRRTYVQ NHYKSEVDRL YTK
|
| |