Gene Nmar_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1040 
Symbol 
ID5773932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp914012 
End bp916327 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content35% 
IMG OID641316682 
Productfibronectin type III domain-containing protein 
Protein accessionYP_001582374 
Protein GI161528548 
COG category[C] Energy production and conversion 
COG ID[COG3794] Plastocyanin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTCAT TTTTCACTAT GAATATGAAT AAAAAAATAT TTTTTTTATT TCTTTCATTA 
TCTGTATTAT TTTCATTTAT GTTAAGTGAA GATGCATTTG CTCAAACTAA ACCCGATAGA
GTGAGGGGAC TTTCTGCAAC TGCAATCAGT ACGACCCAGA TTGACTTGTC TTGGAATGAA
CCTTCTGACG GAGGATTGCC AATAACAGGA TACCAAATTG AACGCAAAAA AGCATCAGAC
CCTTGGGAAA TCTACATTGC TGATACTGGC AATACAAACA CAACATATTC TGATCAAGGT
TTAGATCCTG ATACAAGATA TCGTTACAAA GTTGCTGCAA TTAACGCAAT TGATATTGGT
CGTTCATCCA CTGCCAAAGC TGCTACAACT TTTGCAATAA CAGAACCAAA TAGAGTAACA
GGACTCTCGG CCACTACAAT CAGTCCAACA CAAATTGACT TGTCTTGGAA TGAACCATAC
GATGGTGAGT CTCCAATAAC AGGATACCAA ATTGAACGCA AAAAAGCATC AGACCCTTGG
GAAATCTACA TTGCTGATAC TGGCAATACA AACACAACAT ATTCTGATCA AGGTTTAGAT
CCTGATACAA GATATCGTTA CAAAGTTGCT GCAATTAATG CAATTGGGAT TGGAACTGCA
TCCACTGCCA AAGCTGCTAC AACTACTGAG ATAACAGAAC CCGATAGAGT AACAGGACTT
ACTGCGACTG CAATTAGCCA TACTCAGATA AACTTGTCTT GGAATGAACC ATACGATGGT
GAGTCTCCAA TAACAGGATA CCAAATTGAA CGCAAAAAAG CATCAGACCC TTGGGAAATC
TACATTGCTG ATACTGGCAA TACAAACACA ACATATTCTG ATCAAGGTTT AGATCCTGAT
ACAAGATATC GTTACAAAGT TGCTGCAATT AACGCAATTG ATATTGGTCG TTCATCAACT
ATTGTAACTG AGACTACGTT ACTACCTGGT GTTTATTTGC CCCCTGTTGA TACCTCTACA
GGCACGTCAA AAATTAGACC TCCTCCACAA ATTCAAGGAA TTGGTCTTTA CAAATTCACA
ACACATGTTG GTGATGATGG TGATACCAAA GAATATGATG CTCCTCTTAA TCTCCCATTT
GATCAATATT TCCCATACAG TAAATTTTCA GATGAAACTG ATTTTAAGAA TTACAAAGAA
ATGGGAGCGT ATAAAAAATT AGGACTATAT CATGATTTTT CTAAAAAAAC ACTAGCTCCT
AAATTCTTTG CAGAAACAAA TCAACCAGTT CAACTTCAAA TTCGTTTATG GGATATGCTT
TCAAGTTCAA AAATTGAACA TCTTTCATTA TACACTTATT CTACATCCTC TACAACTGTA
GAAAACAGTG ACGTTGAAAT TATTTTTGAT AAAGGAAAAC CTCTTGATGT TATAGATCCT
AATGATATTT TCAAGTATGT TGAAGTATAT CCTTCTTATG AAGATGAATG GTTATGGATT
AATCTTGATT TGATGTTTCA AAAGCCTATG ACTTCATCTA ATATTCTACT GCAATCCTGG
CATGAATCTA GAATCCCTTC ATTTGTTCAA GTTGATGATA TTTGGGAAAT CTCTAATCCT
CAATCAAATA CTGATCATGT TGATGAGGTT AATCTTACTG AAGAAGTTGA AATTACTCAT
GGTACATCAA ATCCCACATG TAAAATGGAT GATTCATGTT TTACTCCCTC CGATGCAAAA
ATCTTAGAAG GGGGAATTAT AACTTGGTTT AACACCGATT CTTTTACACA CACTGTAACT
AGTGGTTCTG TTAATAATAA TGATAATAGA TTTGGATACA TTTTGTTTCC TGGTCAAACA
GTACAACATG AATTTCCTTA CAAAGGAATC TATGATTATT ATTGTGCACT TCATCCTTGG
GCTAATGGCT CTGTCATTGT ATATGGTGCA GATTTTGAAA AACCAGAGAC TAGTTTTGAT
GAATCACAAC CAACATTACT TGTAAAATCT ACCTCTGGGG GCTCGTTAAT AATTGAAAAT
AATGATGTGT ATGTTACTTC GTCAAGGGAT CTCCACATGA ATATCTCTGG ACATATTCAA
GAACTATCTA CATCAAATAC TGTTAAAATC ATAATTATTC ATCCAGATAA AATCACAAAA
CACATGACTG CTATTGTAAA TAGTGATGGA TTTTATTCCA TGCCTGTAGT TCTTAACAAA
CACTGGATGG AAGGAACCTA CGAAATAATT ACTGAATATC GTGGTGAACA AGTGGCACAA
TTATCTTTTC TAGTATCTGA CAAACCTGTA AGATGA
 
Protein sequence
MRSFFTMNMN KKIFFLFLSL SVLFSFMLSE DAFAQTKPDR VRGLSATAIS TTQIDLSWNE 
PSDGGLPITG YQIERKKASD PWEIYIADTG NTNTTYSDQG LDPDTRYRYK VAAINAIDIG
RSSTAKAATT FAITEPNRVT GLSATTISPT QIDLSWNEPY DGESPITGYQ IERKKASDPW
EIYIADTGNT NTTYSDQGLD PDTRYRYKVA AINAIGIGTA STAKAATTTE ITEPDRVTGL
TATAISHTQI NLSWNEPYDG ESPITGYQIE RKKASDPWEI YIADTGNTNT TYSDQGLDPD
TRYRYKVAAI NAIDIGRSST IVTETTLLPG VYLPPVDTST GTSKIRPPPQ IQGIGLYKFT
THVGDDGDTK EYDAPLNLPF DQYFPYSKFS DETDFKNYKE MGAYKKLGLY HDFSKKTLAP
KFFAETNQPV QLQIRLWDML SSSKIEHLSL YTYSTSSTTV ENSDVEIIFD KGKPLDVIDP
NDIFKYVEVY PSYEDEWLWI NLDLMFQKPM TSSNILLQSW HESRIPSFVQ VDDIWEISNP
QSNTDHVDEV NLTEEVEITH GTSNPTCKMD DSCFTPSDAK ILEGGIITWF NTDSFTHTVT
SGSVNNNDNR FGYILFPGQT VQHEFPYKGI YDYYCALHPW ANGSVIVYGA DFEKPETSFD
ESQPTLLVKS TSGGSLIIEN NDVYVTSSRD LHMNISGHIQ ELSTSNTVKI IIIHPDKITK
HMTAIVNSDG FYSMPVVLNK HWMEGTYEII TEYRGEQVAQ LSFLVSDKPV R