Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1526 |
Symbol | |
ID | 5774293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1386804 |
End bp | 1388972 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641317177 |
Product | thrombospondin type 3 repeat-containing protein |
Protein accession | YP_001582860 |
Protein GI | 161529034 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00000000258397 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAACAAT ATATTTTAGG ATTTTTAATT TTACTTACCA TAACTATTGG AATGACACCA AACATTGTTT TTGCAAATAC TACAATTGAC ACTGATGGTG ATGGTGTTCC AAATGCTGTT GATTTTTGTC CTCATCTCTT AGAGGACTAT GATCCTGAAT ACGGTAACAA CATTGACGGC TGTCCTGCAG ACTTTGTTCC TTGGTATGAT GCTGATTATG ATGGTATCCA AGATCATATC GATAGATGTC CGACCGTAAA AGAAACTTAC AACAAGTTCC AAGACACTGA CGGTTGTCCT GATTTGTCCC CTGATGGTGA TACTGGAATT GCTGACTCTG ATGGTGACGG CTTTCCTGAT TATCTAGACT TGTGTCCAAA TAGACCTGAA ACATTTAATG GAATTGATGA CACTGACGGT TGTCCTGATG ATGATCATTC TCTACTGGAT CGTGATCAAG ATGGAATTTC TGACGGTAAA GATGCTTGTC CACTAGAGCC TGAAACTTAC AACTTTTATC AAGATTCTGA CGGCTGTCCT GACTCTGTTG ACACTACAAC TTCTCTTTAT CAATTTCCAG ATACTGATGG TGATGGAATA GACGATAGAT GGGATCAATG TCTAAACGAA CCTGAAAACT ATAACAATTT CCAGGATCAA GATGGTTGTA TTGATATTCC AGGTGTAGAT TCAGAAGGCT TTATCGATTC TGATTTTGAT AGTATTGGTG ATGATGTTGA TGCTTGTCCA TTAGAACGTG AAAATTACAA CAAGTTCCAA GATTCTGACG GCTGTCCAGA TGTCTTACAA TTGCCAATTT CTGGTGATGC AGATGGTGAT GGTTTGTTAA ATGCAAATGA TGCATGTCCA TACAGTCCTG AAACTTACAA CAAGCTCCAA GATTCTGACG GTTGCCCTGA TTCTCTTACA GATGGCTTTA CTGCCTATGA CTCTGATGGT GATGGCATTA TTGATAATTT AGATTGGTGT CCAAATCAAC CTGAAACATT TAATGGATTC CAAGATTCTG ACGGTTGCCC AGATTATTCT ATTTCCACAC TTGACTCTGA TAGAGATGGT GTTCCAAATG TCTCAGACTC TTGTCCACTA GAGCCTGAAA CTTACAACTT TTATCAAGAT TCTGACGGCT GTCCAGACTC TGTTGACGGT GTTTTGTTTT CTTATACATT CCCAGATGCA GATGGTGATG GAATAGATGA TAGATGGGAT GCATGTCTTG ATGAACAAGA GAACTTTAAC GGATTTTTAG ATTCTGACGG CTGTCCAGAT ACTCCAGGAA TTTCAAAATC TTCATTACTT GATACTGATT ATGATCATAT CCCTGACGTT CGTGATTCAT GTCCTACTAT TGCAGAAAAT TACAACAAAT TCCAAGATGA AGATGGATGC CCTGATACAA TAGAACATGA TTCATTTGGA GATTCTGATG GAGATGGAAT AATTGATAAA ATGGATCAAT GTCCTACCGC AAAAGAAACT TACAATAAAT TCCAAGATAC TGACGGTTGT CCTGATTCTC TTACAGATGG CTTTACTGCC TATGACTCTG ATGGTGATGG CATTATTGAT AATTTAGATC TCTGTCCAAC ACAACCTGAA ACTTACAATA AATTCCAAGA TACTGACGGT TGTCCTGATG ATTCTCGATC TACACTTGAT TCAGACATGG ATGGAATTCC AAATGTTTTA GACTCTTGCC CACTAGAGCC TGAAACTTAT AATTTTTATC AAGATACTGA CGGTTGCCCT GATTCTACTG GTACTGTGAC TTCATCCTAT TCTTTCCCAG ATGCTGATGG TGATGGAATA GATGATAGAT GGGATGCATG TCTTGATGAA CAAGAGAACT TTAACGGATA CTTGGATTGG GATGGCTGCC CAGATATATT GGCTGCAGCA CCAACAGCAC CAACAAAATT TGACTCTGAT GGTGACGGAT TCTATGATTC AATTGATTCT TGTCCATCAA AACCAGAAAC CTGGAATAAA TACAACGATG ATGATGGTTG TCCAGATATT GCCCCAGAAC AACAAAGATT TGTCCATGAT GATGATCTAG ATGACATCAT TAATGATGAA GACTTGTGTC CACTTGATCC TGAAGACTAT GATGGTGACA GAGACACTGA CGGTTGTCCA GATAACTGA
|
Protein sequence | MKQYILGFLI LLTITIGMTP NIVFANTTID TDGDGVPNAV DFCPHLLEDY DPEYGNNIDG CPADFVPWYD ADYDGIQDHI DRCPTVKETY NKFQDTDGCP DLSPDGDTGI ADSDGDGFPD YLDLCPNRPE TFNGIDDTDG CPDDDHSLLD RDQDGISDGK DACPLEPETY NFYQDSDGCP DSVDTTTSLY QFPDTDGDGI DDRWDQCLNE PENYNNFQDQ DGCIDIPGVD SEGFIDSDFD SIGDDVDACP LERENYNKFQ DSDGCPDVLQ LPISGDADGD GLLNANDACP YSPETYNKLQ DSDGCPDSLT DGFTAYDSDG DGIIDNLDWC PNQPETFNGF QDSDGCPDYS ISTLDSDRDG VPNVSDSCPL EPETYNFYQD SDGCPDSVDG VLFSYTFPDA DGDGIDDRWD ACLDEQENFN GFLDSDGCPD TPGISKSSLL DTDYDHIPDV RDSCPTIAEN YNKFQDEDGC PDTIEHDSFG DSDGDGIIDK MDQCPTAKET YNKFQDTDGC PDSLTDGFTA YDSDGDGIID NLDLCPTQPE TYNKFQDTDG CPDDSRSTLD SDMDGIPNVL DSCPLEPETY NFYQDTDGCP DSTGTVTSSY SFPDADGDGI DDRWDACLDE QENFNGYLDW DGCPDILAAA PTAPTKFDSD GDGFYDSIDS CPSKPETWNK YNDDDGCPDI APEQQRFVHD DDLDDIINDE DLCPLDPEDY DGDRDTDGCP DN
|
| |