Gene Nmar_1526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1526 
Symbol 
ID5774293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1386804 
End bp1388972 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content39% 
IMG OID641317177 
Productthrombospondin type 3 repeat-containing protein 
Protein accessionYP_001582860 
Protein GI161529034 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000258397 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAACAAT ATATTTTAGG ATTTTTAATT TTACTTACCA TAACTATTGG AATGACACCA 
AACATTGTTT TTGCAAATAC TACAATTGAC ACTGATGGTG ATGGTGTTCC AAATGCTGTT
GATTTTTGTC CTCATCTCTT AGAGGACTAT GATCCTGAAT ACGGTAACAA CATTGACGGC
TGTCCTGCAG ACTTTGTTCC TTGGTATGAT GCTGATTATG ATGGTATCCA AGATCATATC
GATAGATGTC CGACCGTAAA AGAAACTTAC AACAAGTTCC AAGACACTGA CGGTTGTCCT
GATTTGTCCC CTGATGGTGA TACTGGAATT GCTGACTCTG ATGGTGACGG CTTTCCTGAT
TATCTAGACT TGTGTCCAAA TAGACCTGAA ACATTTAATG GAATTGATGA CACTGACGGT
TGTCCTGATG ATGATCATTC TCTACTGGAT CGTGATCAAG ATGGAATTTC TGACGGTAAA
GATGCTTGTC CACTAGAGCC TGAAACTTAC AACTTTTATC AAGATTCTGA CGGCTGTCCT
GACTCTGTTG ACACTACAAC TTCTCTTTAT CAATTTCCAG ATACTGATGG TGATGGAATA
GACGATAGAT GGGATCAATG TCTAAACGAA CCTGAAAACT ATAACAATTT CCAGGATCAA
GATGGTTGTA TTGATATTCC AGGTGTAGAT TCAGAAGGCT TTATCGATTC TGATTTTGAT
AGTATTGGTG ATGATGTTGA TGCTTGTCCA TTAGAACGTG AAAATTACAA CAAGTTCCAA
GATTCTGACG GCTGTCCAGA TGTCTTACAA TTGCCAATTT CTGGTGATGC AGATGGTGAT
GGTTTGTTAA ATGCAAATGA TGCATGTCCA TACAGTCCTG AAACTTACAA CAAGCTCCAA
GATTCTGACG GTTGCCCTGA TTCTCTTACA GATGGCTTTA CTGCCTATGA CTCTGATGGT
GATGGCATTA TTGATAATTT AGATTGGTGT CCAAATCAAC CTGAAACATT TAATGGATTC
CAAGATTCTG ACGGTTGCCC AGATTATTCT ATTTCCACAC TTGACTCTGA TAGAGATGGT
GTTCCAAATG TCTCAGACTC TTGTCCACTA GAGCCTGAAA CTTACAACTT TTATCAAGAT
TCTGACGGCT GTCCAGACTC TGTTGACGGT GTTTTGTTTT CTTATACATT CCCAGATGCA
GATGGTGATG GAATAGATGA TAGATGGGAT GCATGTCTTG ATGAACAAGA GAACTTTAAC
GGATTTTTAG ATTCTGACGG CTGTCCAGAT ACTCCAGGAA TTTCAAAATC TTCATTACTT
GATACTGATT ATGATCATAT CCCTGACGTT CGTGATTCAT GTCCTACTAT TGCAGAAAAT
TACAACAAAT TCCAAGATGA AGATGGATGC CCTGATACAA TAGAACATGA TTCATTTGGA
GATTCTGATG GAGATGGAAT AATTGATAAA ATGGATCAAT GTCCTACCGC AAAAGAAACT
TACAATAAAT TCCAAGATAC TGACGGTTGT CCTGATTCTC TTACAGATGG CTTTACTGCC
TATGACTCTG ATGGTGATGG CATTATTGAT AATTTAGATC TCTGTCCAAC ACAACCTGAA
ACTTACAATA AATTCCAAGA TACTGACGGT TGTCCTGATG ATTCTCGATC TACACTTGAT
TCAGACATGG ATGGAATTCC AAATGTTTTA GACTCTTGCC CACTAGAGCC TGAAACTTAT
AATTTTTATC AAGATACTGA CGGTTGCCCT GATTCTACTG GTACTGTGAC TTCATCCTAT
TCTTTCCCAG ATGCTGATGG TGATGGAATA GATGATAGAT GGGATGCATG TCTTGATGAA
CAAGAGAACT TTAACGGATA CTTGGATTGG GATGGCTGCC CAGATATATT GGCTGCAGCA
CCAACAGCAC CAACAAAATT TGACTCTGAT GGTGACGGAT TCTATGATTC AATTGATTCT
TGTCCATCAA AACCAGAAAC CTGGAATAAA TACAACGATG ATGATGGTTG TCCAGATATT
GCCCCAGAAC AACAAAGATT TGTCCATGAT GATGATCTAG ATGACATCAT TAATGATGAA
GACTTGTGTC CACTTGATCC TGAAGACTAT GATGGTGACA GAGACACTGA CGGTTGTCCA
GATAACTGA
 
Protein sequence
MKQYILGFLI LLTITIGMTP NIVFANTTID TDGDGVPNAV DFCPHLLEDY DPEYGNNIDG 
CPADFVPWYD ADYDGIQDHI DRCPTVKETY NKFQDTDGCP DLSPDGDTGI ADSDGDGFPD
YLDLCPNRPE TFNGIDDTDG CPDDDHSLLD RDQDGISDGK DACPLEPETY NFYQDSDGCP
DSVDTTTSLY QFPDTDGDGI DDRWDQCLNE PENYNNFQDQ DGCIDIPGVD SEGFIDSDFD
SIGDDVDACP LERENYNKFQ DSDGCPDVLQ LPISGDADGD GLLNANDACP YSPETYNKLQ
DSDGCPDSLT DGFTAYDSDG DGIIDNLDWC PNQPETFNGF QDSDGCPDYS ISTLDSDRDG
VPNVSDSCPL EPETYNFYQD SDGCPDSVDG VLFSYTFPDA DGDGIDDRWD ACLDEQENFN
GFLDSDGCPD TPGISKSSLL DTDYDHIPDV RDSCPTIAEN YNKFQDEDGC PDTIEHDSFG
DSDGDGIIDK MDQCPTAKET YNKFQDTDGC PDSLTDGFTA YDSDGDGIID NLDLCPTQPE
TYNKFQDTDG CPDDSRSTLD SDMDGIPNVL DSCPLEPETY NFYQDTDGCP DSTGTVTSSY
SFPDADGDGI DDRWDACLDE QENFNGYLDW DGCPDILAAA PTAPTKFDSD GDGFYDSIDS
CPSKPETWNK YNDDDGCPDI APEQQRFVHD DDLDDIINDE DLCPLDPEDY DGDRDTDGCP
DN