Gene Nmar_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0402 
Symbol 
ID5774257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp361717 
End bp363147 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content42% 
IMG OID641316031 
Productpreprotein translocase subunit SecY 
Protein accessionYP_001581736 
Protein GI161527910 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.144908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGG GTACAGTTAC CACCATCATT CGAAAGATGG TCTTTAAAGC AGAACCATAT 
CTTCCTCAAG TTCCAAAACC AAAAAAGAAG ATTCCATTAT CCACTAGATT GCTCTGGTGT
GGAGTTGCAT TACTTATCTA CATGGTAATG GGCCAAACAC CATTATTTGG AGCAACAACC
CCTGAATTTG ACTTCCTAGC ATTTGCTAGA GTAATTTTTG CATCACAACA AGGAACCCTT
GTTGAATTAG GTATTGGGCC GATTGTAACA GCTGGACTCT TGATGCAGTT GTTGAGAGGT
TCAGACATTC TCAAATTTGA CTTTAAGAAA CCAGAGGAAA GAGGTGTATT CCAAACTGCA
ACAAAGATGG TAACTTATGT TGTAATTGTA GCTGAATCTA TTGTATATGG CGTAGCAGTC
TATGGTCCTG GAGTCTCTGA CCCGTCGATA CTGTATGTCA TGGTCGGGCA GCTTATGGCC
GCGTCGATTA TTATCATGTT CTTAGACGAA TTAATTCAGA AAGGCTGGGG CCTTGGTAGT
GGAATCAGTC TCTTCATTAT GGCAGGTGTT GCTCAACAAA TTCTGTGGAG TCTGTTCAGC
CCATTGCCTG CAGGAGATGG AGGAACTATT GGTATCATTC CATACATTGG ACAATCAATC
ATGGCAGGTG ACCTGTCAAA CATCATGTTC CGTTCAAACC AGTTGCCGAG CATCTTCGGT
CTGTGTCTGA CGGCAGGTGT TATACTGATA CTTGTCTTCA CACAAGGAAT GAAGATTGAG
ATTCCAATCG TATCTACAAA ATACAGAGGA TTCTCTGCAG TCTATCCAAT CAAGATGATG
TATGTATCAA ATATTCCAGT TATCTTGGCA TCTGCACTTA CTGCAAACGC CGTCTTTATC
TTCCAGATGT TGTGGGCCAA TGCGAACCCG CGTAACAATA ATTTCTTTAT GAATTTCATA
GCGCAATTCG ACCCGACGAG TCCGTCGACC CCAATTGGTG GTCTTATCTA TTACATCACA
CCACCTAGAG GTCTAGACGT TGCAGCTTTG GATCCTGGAC GTGCAGTTGG CTATGTCCTA
TTCATGATTG GAATTGTAAT TGTATTTGGT AGGTTATGGG TAGAGCTAGG TGGTCTTTCA
CCAAAGAGTG CAGCTCAGAA CTTGCTTGAT GCAGATGTAC AGATTCCTGG ATTTAGAAGA
TCAAACAAAC CCGTTGAAGC ATTGTTGAAC AAGTACATTC CATCAGTCAC CATTATTGGC
TCAGCTATTC TGGGTCTGTT AGCAGGTGCA TCTGATGTCT TGGGTGTATT TGGTTCTGGT
ATCGGAGTTT TACTTATGGT AGATATTCTC ATCAACTATT ACACACAATT AGTTAGAGAA
CAAGTCGAAG TTGTAATGCC GCGATTGGGT GCTTTACTTG GCAGAAAGTA A
 
Protein sequence
MAEGTVTTII RKMVFKAEPY LPQVPKPKKK IPLSTRLLWC GVALLIYMVM GQTPLFGATT 
PEFDFLAFAR VIFASQQGTL VELGIGPIVT AGLLMQLLRG SDILKFDFKK PEERGVFQTA
TKMVTYVVIV AESIVYGVAV YGPGVSDPSI LYVMVGQLMA ASIIIMFLDE LIQKGWGLGS
GISLFIMAGV AQQILWSLFS PLPAGDGGTI GIIPYIGQSI MAGDLSNIMF RSNQLPSIFG
LCLTAGVILI LVFTQGMKIE IPIVSTKYRG FSAVYPIKMM YVSNIPVILA SALTANAVFI
FQMLWANANP RNNNFFMNFI AQFDPTSPST PIGGLIYYIT PPRGLDVAAL DPGRAVGYVL
FMIGIVIVFG RLWVELGGLS PKSAAQNLLD ADVQIPGFRR SNKPVEALLN KYIPSVTIIG
SAILGLLAGA SDVLGVFGSG IGVLLMVDIL INYYTQLVRE QVEVVMPRLG ALLGRK