Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1143 |
Symbol | |
ID | 3784256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1313896 |
End bp | 1315143 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637811228 |
Product | cupin region |
Protein accession | YP_411838 |
Protein GI | 82702272 |
COG category | [S] Function unknown |
COG ID | [COG2850] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000830013 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGCAC CGTTGCCCCG CTGTATGTCG TGCAGCCATT GTTACCTTCA CCAAAACGCT CACACCGGTA ATCCCATGAC GAAAATCCGG CTTTTAGGCG GCCTCTCGCC CAGCGACTTC CTTCAGGATC ATTGGCAGAA AAAACCTTTG CTGATACGCA AAGCCTTGCC GGATTTCAGC GGACTGCTGG ATGCCAATGA GCTTATCGAC CTGGCCTGTC AGGAAGATGC GCAATCGCGT CTGGTTACCC GTAGAAACGG CCGATGGGAG GTGAGGCATG GCCCCTTCGC ACCTCGCGCT TTCGCACGGC TGCCGCAGAA AGGCTGGACT CTGCTGGTGC AGGACGTCAA TCACTTCCTT CCGGCGGCGC GTGAACTGCT GCTGAAATTC AACTTCATTC CACATTCCCG GCTCGATGAT CTGATGGTCA GCTACGCTCC CGAAGACGGG GGCGTGGGGC CCCACTTTGA CTCCTACGAC GTTTTTCTGC TGCAAGGAAC AGGCCGCAGA CGCTGGCGAA TATCGGGCCA GAAGGACAGA ACGCTGGTGG CCGCCGCACC GCTCAAGATT CTGCAGGATT TCAGGCCGGA GCAGGAATGG GTACTGGAAC CAGGCGACAT GCTGTATTTG CCGCCCGGCT ATGCGCACGA TGGAGTTGCG GTGGAACCCT GCATGACCTA TTCCATCGGT TTTCGCGCAC CCACCTATCA GGAGCTCGCG ATGCAGTTTC TCGTTCATCT CCAGGACAGC TGTGAAATAG CGGGTATCTA CGAGGATCCG GATCTCAGGA TTCAAACTCA TCCCGGACAA ATCAGTTCCG CGATGCTGGA TCAGGTCAAC GCGGCGCTCG ACAAAATCGA GTGGGACAAC GTTGAAGTGG AACGTTTTAT CGGTATGTAT TTAACCGAAC CCAAACCTCA CGTTTTTTTT ATGCCTCCTC AGGAGCCGAT ATCCGAACGG AAATTCGTGC ATCAGATAAG AAAAGGAAAA CTGCAACTGG ATCTGAAAAG CCGCATGCTC TTCAGGGAAA ACAGAATTTT CCTGAACGGA GACGTATATG AAGTAGGAAA AACCGCACAA CGGATACTAG GAGAGCTGGC CGATCGTCTT GCCTTATCCC CTGTGAGAGA TATCGATGCC GAAACACAGG CGCTGCTATA TCAGTGGTAC CTCGATGGTT ACGTCGTTTA TGTTGAAGAT ACCGGGGCAG TTGAGGAAAT CCAGGAATCG ATAATAGAAA GAAAGTAA
|
Protein sequence | MIAPLPRCMS CSHCYLHQNA HTGNPMTKIR LLGGLSPSDF LQDHWQKKPL LIRKALPDFS GLLDANELID LACQEDAQSR LVTRRNGRWE VRHGPFAPRA FARLPQKGWT LLVQDVNHFL PAARELLLKF NFIPHSRLDD LMVSYAPEDG GVGPHFDSYD VFLLQGTGRR RWRISGQKDR TLVAAAPLKI LQDFRPEQEW VLEPGDMLYL PPGYAHDGVA VEPCMTYSIG FRAPTYQELA MQFLVHLQDS CEIAGIYEDP DLRIQTHPGQ ISSAMLDQVN AALDKIEWDN VEVERFIGMY LTEPKPHVFF MPPQEPISER KFVHQIRKGK LQLDLKSRML FRENRIFLNG DVYEVGKTAQ RILGELADRL ALSPVRDIDA ETQALLYQWY LDGYVVYVED TGAVEEIQES IIERK
|
| |