Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1039 |
Symbol | |
ID | 3785166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1200673 |
End bp | 1202043 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637811123 |
Product | hypothetical protein |
Protein accession | YP_411734 |
Protein GI | 82702168 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1030] Membrane-bound serine protease (ClpP class) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.649912 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTAC TCATTCGCAC TCTTTTTGTC AGTGTCCTGG CGATGCTGTT TCTGCTGCCG TCAATTGTTC GAGCTTCCCC TGTCGTGGTG TTGAAAATCG ATGGCCCCAT TGCTCCGGCC AGCGCCGATT TCATCCAGCG CGGACTGGAA CGCGCCGCGA ACGAGAACGC GCTGCTTGTG GTGTTACAAC TGGACACGCC GGGAGGCCTC GACACTTCCA TGCGGCAGAT CATCCGCGGT GTCCTGGCTT CACCCTTGCC CGTGGCGACT TTTGTGGCGC CGAGTGGGGC TCGGGCAGCC AGTGCCGGCA CTTATATCCT GTATGCCAGC CATATCGCCG CGATGGCTCC GGGCACCAAC CTGGGTGCCG CAACCCCAAT AGAAATGGGT GGCTTTCCCC GATCAGAGCC TGAACCCAGG CCCCAACCGA AATCAGGCGA GAACCGCGCG CAAAATCCGG AAAAAGAAGG CCAGCTTCCC GTAAAGGACG AAATGTCCCG CAAAATGATT CATGATGCTG CGGCGTACAT ACGCGGCCTG GCGCAAATGC GCGGGCGTAA TGTGGAATGG GCGGAAAGAG CCGTCCGCGA AGCGGTCAGT CTGTCGGCCT CCGAAGCGCT GCACCTCAAA GTTGTCGATT ATATCGCTAC CGACATCGCG GATCTGCTGA AGCAGCTCAA TGGCAGGCAA GTGAACGTAC TGGGCCAGGA CCGTAAGCTC GATACCACTT CCGCCACCAT GGAAGTCGTG GAACCCGACT GGCGCACGCG GCTGCTCGCC ATCATCACCA ATCCAAGCGT CGCCTATGTC CTGATGCTTA TCGGCATTTA CGGACTGTTC TTCGAGTTTG CCAACCCCGG TTTTGTGCTA CCCGGCGTGG CAGGCGCCAT CTGCCTGCTT ATCGCCCTGT ATGCCTTCCA GTTACTGCCG GTGAGCTATG CCGGACTTGC CTTGATCCTG CTCGGGATCG GGTTCATGGT GGCCGAAGTA TTCCTGCCCA GCTTCGGGGC CCTCGGTATC GGCGGAATTA TCGCTTTCGT CGTGGGCTCC CTGATGCTGA TTGACAGTGA GGCGCCCGGT TTCGGTATCC CCTGGACGCT CGTCGGCGGA GTAGCCTTCG CCAGTGCCCT GTTCCTGATT TCAGTAATCG GCATGGCGCT CAAGACCCGC CGGAAACCGT TGCTGAGTGG TCAGGAGCAT ATGGTCGGCT CAGTGGGAGA AATGCTGGAA GACACTTCCG GTGACGGCAT GGCGCGTATC CGCGGCGAGT TGTGGACCGT GCACTCCGCC CAACCGCTGG TTCGGGGCCA GAAGGTGCGT GTGACCGGAA TCGACGGCCT CATCCTGCAT GTAACCGCAG CAGAAAAATA A
|
Protein sequence | MKLLIRTLFV SVLAMLFLLP SIVRASPVVV LKIDGPIAPA SADFIQRGLE RAANENALLV VLQLDTPGGL DTSMRQIIRG VLASPLPVAT FVAPSGARAA SAGTYILYAS HIAAMAPGTN LGAATPIEMG GFPRSEPEPR PQPKSGENRA QNPEKEGQLP VKDEMSRKMI HDAAAYIRGL AQMRGRNVEW AERAVREAVS LSASEALHLK VVDYIATDIA DLLKQLNGRQ VNVLGQDRKL DTTSATMEVV EPDWRTRLLA IITNPSVAYV LMLIGIYGLF FEFANPGFVL PGVAGAICLL IALYAFQLLP VSYAGLALIL LGIGFMVAEV FLPSFGALGI GGIIAFVVGS LMLIDSEAPG FGIPWTLVGG VAFASALFLI SVIGMALKTR RKPLLSGQEH MVGSVGEMLE DTSGDGMARI RGELWTVHSA QPLVRGQKVR VTGIDGLILH VTAAEK
|
| |