Gene Nmar_0682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0682 
Symbol 
ID5773734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp621596 
End bp623224 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content47% 
IMG OID641316318 
Producttriple helix repeat-containing collagen 
Protein accessionYP_001582016 
Protein GI161528190 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGTCTC AATCTAGATT AGAAATCCAA GGTCAGGCAC CATTCAAGCA ATCTGGCGTC 
TATGAGGCTC AAATTGCAAT CGTTGGCAGC CGACCTTCTG ATGCACAAAT TAAATCTAAA
CAATTCACAT CTCTTTGGCG TGGAAATTTC CATCTACGAG TTAAAGATGG AGTATTTTCT
GAAACAATTG GTTCTCCTGA AAACCCAATT CCATCATCTG TCTTAGAATT AGAAACAATT
TGGATAGTTG TAACTGATCT GTTTTCATCA CTACATTCTG TATTTGATGT CCCATTATCA
AAACCAAAAT CTTCCCCAAA ACCTCCTGAA TCAAAACCTT CTGAAACTAA ATCAAATCTT
GAAACTCCAA AACAAACACG TTCAACAAAA TCAACTCAAA GCACTCCTGG CGTACGAGGT
AGCCCTGGTG AAAAAGGCGC ACCTGGATTA CAAGGTCCAA CTGGTGACAA AGGTCCACAA
GGTCCTCCCG GCCCAACTGG TCCTCCCGGT GACAAAGGTC CTGACGGTCC ACAAGGTCCT
CCCGGTGACA AAGGTCCAAC TGGTGACAAA GGCCCTGCAG GTGACAAAGG TATTGCTGGA
GATAAAGGAA TTACTGGTGA CAAAGGAATT ACTGGAGATA AAGGTGATAA AGGTGACAAA
GGAATTCCTG GTCCCGTTGG TGATAAAGGT GACAAAGGAG CAACTGGTCC AATCGGAGAA
AAAGGTCCAA CTGGATTAAA AGGCCCAATC GGTGAAAAAG GAGAAAAAGG TCCAACTGGT
CCTCCCGGCG ACAAAGGATT ATCTGGATTA AAAGGTCCTC CCGGTGAAAA AGGAGAAAAA
GGTCCAACTG GTCCTCCCGG CGATAAAGGT TTAACTGGTC CTGTCGGAAC TCCTGGAGAA
AAAGGTCCAA CCGGACAAGC AGGAGTTCAA GGAGAAAAAG GAATTCAAGG TGGTCCTGGT
CCAATCGGAG AAAAAGGTCC AGCAGGTCCA GTTGGTGACA AAGGTCCAAT CGGTCCAGCA
GGTCCTCTTG GCGACAAAGG ATTATCTGGT CCAACCGGAG TTCCTGGTGA CAAAGGTCCA
ATCGGTCCTC CTGGTCCAAT CGGAGAAAAA GGTCCTAAAG GAACTGAAGG TCCAATCGGA
GAAAAAGGTC CACAAGGTCC ACAAGGTCCA GCTGGTGCTA AAGGATTAAC AGGCGTTCCT
GGTCCACAAG GTGAAAAAGG AGAAAAAGGT CCAACAGGTC CGCCCGGAGA AAAAGGATTA
ACAGGTCCAG CAGGTCCACC TGGAGAAATT GGTACAGTTG GTCCACAAGG TTCTCAAGGA
GAACGCGGTC CAACAGGTCC ATCAGGAGAA AAAGGTCCAC AAGGTCCACA AGGAATTCAA
GGTCCGCAAG GAGAACGCGG CCCAACTGGT CCAATCGGTT CTATTGGTGA AGCAGGTCCT
CGTGGAGAAC AAGGCCCATT AGGTCCAGCA GGACCAAGAG GTCCACCCGG TCCACCAGGC
GAAAAAGGTC CATCAGGTGG AATGTCTTCT GAACAAAAAG CATTGTTCAA AGAATTGCTA
GAAATACTAA CTGAGAAAAA TATCATTAGC ACTGAAGAAC AAATCAAACT AATGAGTTAT
CTTTACTAG
 
Protein sequence
MSSQSRLEIQ GQAPFKQSGV YEAQIAIVGS RPSDAQIKSK QFTSLWRGNF HLRVKDGVFS 
ETIGSPENPI PSSVLELETI WIVVTDLFSS LHSVFDVPLS KPKSSPKPPE SKPSETKSNL
ETPKQTRSTK STQSTPGVRG SPGEKGAPGL QGPTGDKGPQ GPPGPTGPPG DKGPDGPQGP
PGDKGPTGDK GPAGDKGIAG DKGITGDKGI TGDKGDKGDK GIPGPVGDKG DKGATGPIGE
KGPTGLKGPI GEKGEKGPTG PPGDKGLSGL KGPPGEKGEK GPTGPPGDKG LTGPVGTPGE
KGPTGQAGVQ GEKGIQGGPG PIGEKGPAGP VGDKGPIGPA GPLGDKGLSG PTGVPGDKGP
IGPPGPIGEK GPKGTEGPIG EKGPQGPQGP AGAKGLTGVP GPQGEKGEKG PTGPPGEKGL
TGPAGPPGEI GTVGPQGSQG ERGPTGPSGE KGPQGPQGIQ GPQGERGPTG PIGSIGEAGP
RGEQGPLGPA GPRGPPGPPG EKGPSGGMSS EQKALFKELL EILTEKNIIS TEEQIKLMSY
LY