Gene Nmar_1129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1129 
Symbol 
ID5773436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1031846 
End bp1033210 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content38% 
IMG OID641316772 
Producthypothetical protein 
Protein accessionYP_001582463 
Protein GI161528637 
COG category[C] Energy production and conversion 
COG ID[COG3794] Plastocyanin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.32866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCGT TATTTGTAGA GGAGAAATCC TCTGCAAATT TTACAAATGT AAAGTTGCTT 
GAAAAAAATT TAAAGTTTTT ATGTTGTAAT AATTTACAAC ACCTTGTGAA AACGCTAGCA
ATCTTTTCAG TTTTAATGTT GTTTAGTATC ATTTCTTTAT CTCCTGCTTT TGCAGATCAT
TCAGAAGTTA CAGTTGTACC TGCAGATGGT TCTGGCTCAC CTGGTTGTGA AAAAACTGCA
GATGGATGTT ACATTCCAAG TACTGCAACA GTTGATGTTG GCGGTGTTGT AATCATGTCA
AACACTGACA CTGCAGGACA CACATACACT TCAGGAAATC CTGAAGATGG ACCTGATGGT
ATCTTTGATA GTAGTTTGTT GATGACTGGA AATTCATTTG AATGGAGTCC AGATGAAGTT
GGTGAATATG ATTATTATTG TATGGTTCAT CCTTGGATGT TGGGAACAAT AATTGTTCAA
GAAGTATCTG CAGAGGAGGA TGATGTGATG GAGCAACCCA TGAAGGCTGA AATGTTTGGT
TGGGACAGAT TTGAATCAAT GCAAGATCCT GGCGTTGGCC ATGAAGAGCA TCAACTAGCA
ATTTTGTTGG CTCCAAGCGA GAATACCTAT GCTGGAACAT TAAGATATGA TGCATCTGAA
CCTATTCAAC TTGTAAGTTT GAGGGGTCCA CTTGGATCTG ATGAAACTGC TGGAAAAATT
TGGACTCCTG ACGGTAAGAC TAAATTTGAG TTGACCCTAG TTGATCAAGA ATCTTCTTCT
GGCGAATGGG ATTTTTCTGG AAATGCTCTA GCCGTTCATA CTTTTAATAC AAATCAGTTC
GTAGTTGACG TTCAAATTGA TTATGAAGAA ATCCCTCCAC AAAAATCTAT GATGGAAGAA
GACATGATGA AAGATGACTC TATGATGGAA CAAGAAACTA TGATGGCAGA TGATTCTGTG
ATGGAAACAA CATCTGATGA AGCAGAGTCT AGTGGTGGTG GGTGTTTGAT TGCAACAGCA
GCATATGGAA CAGAACTGGC ACCACAAGTT CAATTCTTAA GAGAAATTCG AGATAACACT
GTAATGAGTA CGGCATCTGG TGCATCCTTT ATGACTGGTT TTAACCAATT GTATTATTCA
TTCTCACCAA CAATTGCTGA TCTGGAACGA GAAAACCCAA TGTTTAAAGA ATCTGTACGA
GCATTCATCA CACCAATGAT TTCAACATTG TCTATTATGA CATTGGCTGA AGATGGTTCA
GAAGCAGAAG TTTTAGGATT GGGAATATCT GTTATTGCAC TTAACTTGGC AATGTATATT
GCAGCACCTG CTGTTGTTGT ATGGCAAATC AAAAAGAGAA TTTAG
 
Protein sequence
MSSLFVEEKS SANFTNVKLL EKNLKFLCCN NLQHLVKTLA IFSVLMLFSI ISLSPAFADH 
SEVTVVPADG SGSPGCEKTA DGCYIPSTAT VDVGGVVIMS NTDTAGHTYT SGNPEDGPDG
IFDSSLLMTG NSFEWSPDEV GEYDYYCMVH PWMLGTIIVQ EVSAEEDDVM EQPMKAEMFG
WDRFESMQDP GVGHEEHQLA ILLAPSENTY AGTLRYDASE PIQLVSLRGP LGSDETAGKI
WTPDGKTKFE LTLVDQESSS GEWDFSGNAL AVHTFNTNQF VVDVQIDYEE IPPQKSMMEE
DMMKDDSMME QETMMADDSV METTSDEAES SGGGCLIATA AYGTELAPQV QFLREIRDNT
VMSTASGASF MTGFNQLYYS FSPTIADLER ENPMFKESVR AFITPMISTL SIMTLAEDGS
EAEVLGLGIS VIALNLAMYI AAPAVVVWQI KKRI