Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0104 |
Symbol | |
ID | 3786371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 109727 |
End bp | 111196 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637810174 |
Product | Sel1 repeat-containing protein |
Protein accession | YP_410805 |
Protein GI | 82701239 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCTAT CCAAATCAGT GGCAACATCC GGGTTCGTAA CGGCTGCCCC GGTATTGATG TTGTTCGCCT CCCTGGCTAT CGCGGGGGAT TTCGAGGATG GGATGAAGTT CGTCCTCAGC AAGGACTATA CCAAGGCAAT GCAATCTTTC CGGAAAGCGG CCAATGCAGG AAATGCTGAC GCCCAGTTCA ATCTGGGCGT GCTGTATTCA CGGGGCCGCG GCGTGCCACA GGATCATGAG CAAGCCGCCA AGTGGTATCG CAGGGCGGCG GAGCAAGGGG ACGCACCGGC ACAATCCATG CTGGGGTATA TGTATCTGAA AGGCCAGGGC GTCCCGCAGG ATTATCAACA GGCAATGTTC TGGTATTTCC GAGCAGCCGA CAGCGGATAT GCGGTGGCGC AATACAATCT CGGGGTAATG TATGCAAAAG GCCAGGGCGT GGAAAAGGAT TATCGGCACG CCCTCTCCTG GTATCTGAAA GCTGCGGAGC AGGGACACGC ACCTGCGCAG GCAATCATGG GATTCATGTA TCTCAAGGGG CAGGGGGTCG AGCAGGATGA CCATCAGGCT GTATCCTGGT ATCGCAAGGC AGCCGAGCAA GGGTATGGCG AAGCGCAATA TGCTCTTGGC GTGCTCTACG CCAAGGGCCG GGGAGTAGCG CAGAGCAACC AGGAAGCCGC CTCCTGGTAC CGCAAGGCTG CTGAGCAGGG GAACACGGAT GCACAGTTCA ATCTCGGCAT GATGTTCGCC ACGGGAGAAG GAGTCACGCA GGATTATCGG CAGGCAGCGT CCTTGTATCG CCAGGCGGCC GATCAGGGAT ATGCGCGGGC CCAGTTCAAA CTCGGGGTGG CAAATGCCAA AGGGCTCGGT ATTCCGGAGG ACGCTTACGA AGCAGCGGCA TGGTACCGCA AGGCGGCCGA GCAGGGCTAT GCTCCTGCCC AGTTCAATCT GGGCGTGATG TATGCGACGG GTAAAGGCGT CATTAGGGAT GAGCGGCAGG CGGTATCATG GTATCGACAG GCGGCCGAGC AAGGAGACCC GGATGCGCAA TATAACCTGG GGGTAAGGTA TGACACGGGA CGGGGCATCG AAAAGGATCC ACAACAGGCA GTAGCCTGGT ATCGCAAGGC GGCAGAGCAA GGCTATGCAC GGGCACAATA CAGCGTGGGC GTGAAGTATG ACAGCGGGCA GGGAGTGCCG CAAGATTACG CGCAGGCGCT AGCCTGGTAC CTGAAGGCCG CGGAGCAGGG GCATGCGGGC GCCCAGACCA ATCTCGGCGT GCTGTATTAC AACGGCAATG GCGTGAAGCA GGATTATGTG GAAGCCGACA AGTGGTTCAG CATCGCCAGC GCCGGCGGCT ACGAGGATGC CAAAGAGAAT CGCGAACTGA TGGAAAAGCT GATGACACCG ATGCAAATCG CCGATGCGCG ACGGGAGGCG GATGAATGGG CAAGAGCACA CCAACGGTAA
|
Protein sequence | MHLSKSVATS GFVTAAPVLM LFASLAIAGD FEDGMKFVLS KDYTKAMQSF RKAANAGNAD AQFNLGVLYS RGRGVPQDHE QAAKWYRRAA EQGDAPAQSM LGYMYLKGQG VPQDYQQAMF WYFRAADSGY AVAQYNLGVM YAKGQGVEKD YRHALSWYLK AAEQGHAPAQ AIMGFMYLKG QGVEQDDHQA VSWYRKAAEQ GYGEAQYALG VLYAKGRGVA QSNQEAASWY RKAAEQGNTD AQFNLGMMFA TGEGVTQDYR QAASLYRQAA DQGYARAQFK LGVANAKGLG IPEDAYEAAA WYRKAAEQGY APAQFNLGVM YATGKGVIRD ERQAVSWYRQ AAEQGDPDAQ YNLGVRYDTG RGIEKDPQQA VAWYRKAAEQ GYARAQYSVG VKYDSGQGVP QDYAQALAWY LKAAEQGHAG AQTNLGVLYY NGNGVKQDYV EADKWFSIAS AGGYEDAKEN RELMEKLMTP MQIADARREA DEWARAHQR
|
| |