Gene Nmar_1577 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1577 
Symbol 
ID5773449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1443189 
End bp1444811 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content36% 
IMG OID641317230 
Productthermosome 
Protein accessionYP_001582911 
Protein GI161529085 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCAA TTCAACAAGG ACCAAATGGA CCTGTTTTAG TGCTCAAAGA GAGTGCATTA 
CAACAAAAAG GTAAAGACGC TCAACAAAAC AACATTGCAG CCGCAAAATT AGTTACAGAA
TTAGTCAAAA GCAGCCTTGG CCCACGTGGT CTAGATAAAA TGTTAGTTGA TTCCTTAGGT
GATGTTACTA TTACAAATGA TGGGGCAACA ATTCTCAAAG AAATTGATGT TCAGCATCCT
GCAGCCAAAA TGATGGTTGA AATTTCAAAA ACTGTTGACA ATGAAGTTGG AGATGGTACA
ACTTCTTCTG TAGTTTTTGG AGGTACTCTT TTGGCTAAAG CCGAAGACCT ACTCAAAAAA
GATGTTCATT CTTCAACAAT TATTGACGGT TATCAAGCTG CTGCAGAAAA AACCCTTGAA
ATCTATTCTG AATTATCAAA GAAAATTAAA CCAGATGACA AAGAATCACT CATTAAAATT
GCTACAACAA GTATGCAATC AAAATTAATC TCAGAAGATA GTGATACATT ATCAAAAATT
GTAGTTGATG CTATTCTTAG CATAGTTACA AAGAAAGGTG AAGATTACTT TGTTGATCTT
GAAAACATAA AAGTTGAAAA GAAATCAGGT GGATCAATTC AAGATACTCA AATTGTTAAA
GGAATTGTTT TAGATAAAGA AATTGTTCAC AGTGGAATGC CTACAAAAAT CGATAAAGCA
AACATTGCTT TGTTAAATTC CGCACTTGAA ATTGAAAAAA CTGAAATGAG TTCTGAAATT
AGAATTTCTG ATCCTACTCA AATGCAGATG TTCTTAGAAG AGGAAAACAG AATGCTCAAG
ACAATGGTTG ACAAACTACA TGATATTGGA GTTAATGTTC TAATTTGCCA AAAAGGTATT
GATGATATTG CACAACATTA TCTTGCCAAA AACGGAATTC TTGCAGTACG TCGTGTAAAA
GAAAGTGATA TGATTAAACT ATCAAAAGCA ACTGGCGGCC GTGTAATTAG TAATATTGAT
GACCTATCTG AAAAAGATCT TGGTTCTGCT AATTTGGTTC ACCAAAAGAA AGTTGAATCT
GACAAATGGG TATTCATTGA AGGATGTAAA CACCCACAAT CAGTTACAAT GTTGATTCGT
GGTGGCTCTC AAAGAGTAAT TGATGAGGTT GACCGCTCTA TTCATGATTC TCTCATGGTA
GTAAAAGACG TAATTGAAAA GCCTGAAATT GTCGCAGGTG GAGGTGCTCC AGAATCATTT
GCAGCATCAC AACTCAAAGA CTGGGCTGAC AATTTTGATG GACGAGAACA ACTTGCAATT
AAGAAATATG CTGAAGCCTT AGAGGTAATT CCATTAACAA TTGCTGAAAA TGCAGGAATG
GATCCAATTG ACACAATGGC AAACTTGAGA GCAAAACAAA ACCAAGGTCG TAAATGGACT
GGTATTGATG CTAAAAACAC AAAGATTGCA GATATGCTTT CTATTGATGT TGTAGAACCA
ATTGCTGTCA AAGAACAGAT TATCAAATCT GCAACAGAAG CTGCATGTAT GATTCTTAGA
ATTGATGATG TCATTGCAGT ATCTGGTGGT CCAGGTGGCG GTGGCATGCC TCCAATGGGA
TAA
 
Protein sequence
MASIQQGPNG PVLVLKESAL QQKGKDAQQN NIAAAKLVTE LVKSSLGPRG LDKMLVDSLG 
DVTITNDGAT ILKEIDVQHP AAKMMVEISK TVDNEVGDGT TSSVVFGGTL LAKAEDLLKK
DVHSSTIIDG YQAAAEKTLE IYSELSKKIK PDDKESLIKI ATTSMQSKLI SEDSDTLSKI
VVDAILSIVT KKGEDYFVDL ENIKVEKKSG GSIQDTQIVK GIVLDKEIVH SGMPTKIDKA
NIALLNSALE IEKTEMSSEI RISDPTQMQM FLEEENRMLK TMVDKLHDIG VNVLICQKGI
DDIAQHYLAK NGILAVRRVK ESDMIKLSKA TGGRVISNID DLSEKDLGSA NLVHQKKVES
DKWVFIEGCK HPQSVTMLIR GGSQRVIDEV DRSIHDSLMV VKDVIEKPEI VAGGGAPESF
AASQLKDWAD NFDGREQLAI KKYAEALEVI PLTIAENAGM DPIDTMANLR AKQNQGRKWT
GIDAKNTKIA DMLSIDVVEP IAVKEQIIKS ATEAACMILR IDDVIAVSGG PGGGGMPPMG