Gene Nmar_0477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0477 
Symbol 
ID5774452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp429251 
End bp430828 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content36% 
IMG OID641316109 
ProductNHL repeat-containing protein 
Protein accessionYP_001581811 
Protein GI161527985 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.176502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAACAA TTTCACTTGC TGTAGCCGTA ATTACTGCAA TTTTGCTATC TGGAACTTTT 
GCTCCATCTT CATATGCATT AGGTGATTAT GATTTTCTTG CTGGATGGGG TGAGTTTGGA
ATATCTACTC CTGGACACCT TTCACACCCA CAATTTATCG CAGTTGATGA TGAAGGGAAT
GCATACATTA GTGATTTAGG AAATAAACGT GTACAAAAAT TCTCCAGCTC TGGTGAGTTT
ATTCTTAACT TTGGAGAAAG TGGCAAATCC TCTGGCCAAT TCCATCACCC TTCCGGTGTT
GCAGTTGATT CTGATTTTGT CTATGTGGCA GATCAAAATT TGCATAAAAT TCAAAAATTT
ACCCTTGATG GAGTGTTTGT AGATGAATGG GGAAAATACG GTAACCAAGA TGGTCAATTC
AAGTCCCCAA AAGATATTGC AGTAGACTCT GATTTTCTTT ATGTTGTAGA TGCTGATAAC
TATCGAATCC AGAAATTTAC TACTGATGGA GAATTTGTAT TATCTTTTGG TTCTGGTGGT
ATGAATCATG ATCAATTTCT TATTTTATCT GGAATAGCAG TTGATGATGA TGGAAATATC
TACATCACAG ACAAAGGAAA CCGTAAAATT GAAAAATTTA CTTCTGATGG AATTTTGATT
AAATCTTATC CACTATTTGG TACAAACTAT GTATTTGCTC CTACGGGAAT CACTGTTGAT
TCTGATGGGA AAATATTTGT CATAAACTCT GCAGAAAACA GGATTTTGTA TCTTGAACTT
GATGATAATT TACGTCTAAG TGTATTTGAG CAACTTGGAC CATTTGGTAA CTCCTTTATT
GCTCCAACTG ATCTTACCTT TGGATTTCAA GGTAACTTGT TGATAGTAGA TTCTGCTGCT
CATAAAGTAA AATCCTTTGA AACCCCATTT TATGATGAAA CAAAAGTTTT TCAAACAACT
GAAATTATTG CACCAGAAAT AACTGAAGGT TATGAATCTG ATGATATTGA TCCAACAATT
ATGGCTCCAA GTGACATAAA ATTAGAAGCA ACTGATCTAT TTACTCCTGT ACCTATTGGT
GATGCTGTTG CAAATGACTT GCAGAGTGGA ATCAAAACTA TTTTGAATAA TGCTCCTGAG
GCATTCTCTT TGGGTGTAAA CAAGGTAACT TGGGTAGCAT TTGATAATGC TGGAAACACT
GCTGAAGACT ATCAAACAGT TACAGTCTTT GCATGTGGTC ATGTCTATTC TGATTACAAC
ATGATAGTTG GAACTGATGA GAATGATGTT CTTCTGGGAA CCTCTGGTGA TGATCTAATC
TTTGGGTTAG AAGGAAATGA TATCATTAGT GGGTTAGAAG GAAATGACTG TATCTTTGGC
GGTGATGGTG ATGATATTGT CTATGGCGAT GATGGATATG ACACCATTAG TGGAAACGGT
GGACATGATG TCCTCAAAGG AGATTCTGGA TCTGATGTAA TCTATGGTGG ATCAGGCTCT
GATGTACTTG ATGGCGGTTC TGAGAATGAC AACTGCTATG ATTCACTAGA AAATGTTGTT
CTTAATTGCA ACGAATAG
 
Protein sequence
MKTISLAVAV ITAILLSGTF APSSYALGDY DFLAGWGEFG ISTPGHLSHP QFIAVDDEGN 
AYISDLGNKR VQKFSSSGEF ILNFGESGKS SGQFHHPSGV AVDSDFVYVA DQNLHKIQKF
TLDGVFVDEW GKYGNQDGQF KSPKDIAVDS DFLYVVDADN YRIQKFTTDG EFVLSFGSGG
MNHDQFLILS GIAVDDDGNI YITDKGNRKI EKFTSDGILI KSYPLFGTNY VFAPTGITVD
SDGKIFVINS AENRILYLEL DDNLRLSVFE QLGPFGNSFI APTDLTFGFQ GNLLIVDSAA
HKVKSFETPF YDETKVFQTT EIIAPEITEG YESDDIDPTI MAPSDIKLEA TDLFTPVPIG
DAVANDLQSG IKTILNNAPE AFSLGVNKVT WVAFDNAGNT AEDYQTVTVF ACGHVYSDYN
MIVGTDENDV LLGTSGDDLI FGLEGNDIIS GLEGNDCIFG GDGDDIVYGD DGYDTISGNG
GHDVLKGDSG SDVIYGGSGS DVLDGGSEND NCYDSLENVV LNCNE