Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0682 |
Symbol | |
ID | 5773734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 621596 |
End bp | 623224 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641316318 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_001582016 |
Protein GI | 161528190 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGTCTC AATCTAGATT AGAAATCCAA GGTCAGGCAC CATTCAAGCA ATCTGGCGTC TATGAGGCTC AAATTGCAAT CGTTGGCAGC CGACCTTCTG ATGCACAAAT TAAATCTAAA CAATTCACAT CTCTTTGGCG TGGAAATTTC CATCTACGAG TTAAAGATGG AGTATTTTCT GAAACAATTG GTTCTCCTGA AAACCCAATT CCATCATCTG TCTTAGAATT AGAAACAATT TGGATAGTTG TAACTGATCT GTTTTCATCA CTACATTCTG TATTTGATGT CCCATTATCA AAACCAAAAT CTTCCCCAAA ACCTCCTGAA TCAAAACCTT CTGAAACTAA ATCAAATCTT GAAACTCCAA AACAAACACG TTCAACAAAA TCAACTCAAA GCACTCCTGG CGTACGAGGT AGCCCTGGTG AAAAAGGCGC ACCTGGATTA CAAGGTCCAA CTGGTGACAA AGGTCCACAA GGTCCTCCCG GCCCAACTGG TCCTCCCGGT GACAAAGGTC CTGACGGTCC ACAAGGTCCT CCCGGTGACA AAGGTCCAAC TGGTGACAAA GGCCCTGCAG GTGACAAAGG TATTGCTGGA GATAAAGGAA TTACTGGTGA CAAAGGAATT ACTGGAGATA AAGGTGATAA AGGTGACAAA GGAATTCCTG GTCCCGTTGG TGATAAAGGT GACAAAGGAG CAACTGGTCC AATCGGAGAA AAAGGTCCAA CTGGATTAAA AGGCCCAATC GGTGAAAAAG GAGAAAAAGG TCCAACTGGT CCTCCCGGCG ACAAAGGATT ATCTGGATTA AAAGGTCCTC CCGGTGAAAA AGGAGAAAAA GGTCCAACTG GTCCTCCCGG CGATAAAGGT TTAACTGGTC CTGTCGGAAC TCCTGGAGAA AAAGGTCCAA CCGGACAAGC AGGAGTTCAA GGAGAAAAAG GAATTCAAGG TGGTCCTGGT CCAATCGGAG AAAAAGGTCC AGCAGGTCCA GTTGGTGACA AAGGTCCAAT CGGTCCAGCA GGTCCTCTTG GCGACAAAGG ATTATCTGGT CCAACCGGAG TTCCTGGTGA CAAAGGTCCA ATCGGTCCTC CTGGTCCAAT CGGAGAAAAA GGTCCTAAAG GAACTGAAGG TCCAATCGGA GAAAAAGGTC CACAAGGTCC ACAAGGTCCA GCTGGTGCTA AAGGATTAAC AGGCGTTCCT GGTCCACAAG GTGAAAAAGG AGAAAAAGGT CCAACAGGTC CGCCCGGAGA AAAAGGATTA ACAGGTCCAG CAGGTCCACC TGGAGAAATT GGTACAGTTG GTCCACAAGG TTCTCAAGGA GAACGCGGTC CAACAGGTCC ATCAGGAGAA AAAGGTCCAC AAGGTCCACA AGGAATTCAA GGTCCGCAAG GAGAACGCGG CCCAACTGGT CCAATCGGTT CTATTGGTGA AGCAGGTCCT CGTGGAGAAC AAGGCCCATT AGGTCCAGCA GGACCAAGAG GTCCACCCGG TCCACCAGGC GAAAAAGGTC CATCAGGTGG AATGTCTTCT GAACAAAAAG CATTGTTCAA AGAATTGCTA GAAATACTAA CTGAGAAAAA TATCATTAGC ACTGAAGAAC AAATCAAACT AATGAGTTAT CTTTACTAG
|
Protein sequence | MSSQSRLEIQ GQAPFKQSGV YEAQIAIVGS RPSDAQIKSK QFTSLWRGNF HLRVKDGVFS ETIGSPENPI PSSVLELETI WIVVTDLFSS LHSVFDVPLS KPKSSPKPPE SKPSETKSNL ETPKQTRSTK STQSTPGVRG SPGEKGAPGL QGPTGDKGPQ GPPGPTGPPG DKGPDGPQGP PGDKGPTGDK GPAGDKGIAG DKGITGDKGI TGDKGDKGDK GIPGPVGDKG DKGATGPIGE KGPTGLKGPI GEKGEKGPTG PPGDKGLSGL KGPPGEKGEK GPTGPPGDKG LTGPVGTPGE KGPTGQAGVQ GEKGIQGGPG PIGEKGPAGP VGDKGPIGPA GPLGDKGLSG PTGVPGDKGP IGPPGPIGEK GPKGTEGPIG EKGPQGPQGP AGAKGLTGVP GPQGEKGEKG PTGPPGEKGL TGPAGPPGEI GTVGPQGSQG ERGPTGPSGE KGPQGPQGIQ GPQGERGPTG PIGSIGEAGP RGEQGPLGPA GPRGPPGPPG EKGPSGGMSS EQKALFKELL EILTEKNIIS TEEQIKLMSY LY
|
| |