Gene Nmar_1678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1678 
Symbol 
ID5774298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1537367 
End bp1538377 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content34% 
IMG OID641317332 
Productblue (type1) copper domain-containing protein 
Protein accessionYP_001583012 
Protein GI161529186 
COG category[C] Energy production and conversion 
COG ID[COG3794] Plastocyanin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAA AATTTTTCAT TTTGTTTGTT GTTGCAACAA GTTTTCTATT TACAGGAAAC 
ATACAAAATT CCTTTGCTGA GGATTTTGTT GTAAGTATTC CTTTTGGTGC ATTTAATCCT
GAACTAAATA CCCCTGCTAA AGTTTGGTAT GATCCACCTG AATTATCTAT TGTTGAAGGG
GATACTGTAA CATGGGTAAA TGATGATAGA GAAGGGCATA CAGTTACCAG TGGCAGTGGT
GCAGGAAGAT TTGATTGGAT GGATGCCAAG AATCTTGGTG AGCCTGATGG ACTGTTTGAT
AGTTTAAGAT TTATGCCAGA CGAGTCTTGG TCATATACAT TTGAAAAAGC TGGAGATTAC
AATTACTTTT GTGTAATTCA TCCCTGGATG GAAGGAATTA TTTTTGTGAA ACCATTTATT
CCTGATTATC CCCATGATGC TACAGGAAAA AAGTATGAAC AGTTCCCTAC ATTTCTTATA
ACTCCTGATG GTTCAATTGA AATTAATTTT TCATGGGAAC CTCGAGTCAT TAAAACCCAT
GAAAAAACAA ACTTCATCTA TCGTTTTTAT GATGCAATAT ATGATCAGCC ATTAAGAAAG
TTAGAATATG ATATAGCTAT TTTACAAAAT AATCAAGTGT TATACAAAGA TGAAGGCGCA
GTATCAGGTG CGGGTGGGGA TTACCGACAA TGGATATTTG AAGAACCAGG CCCTATCATT
GTTAAAATAT CAAATATCAA ACCTTATGGT TCTGTAGCAG AAACACAAAT CAATCTTGGA
CCTGATGCCA CTGCTAGATT AGGAGATTTT ACAGCTATGG TTTATGAAAA TTATGAAAAG
AAAACTACTA CTGAAAAAAT TGTACAACCT CGAGATACTT TGCAATTTTA TTATGAGATT
GCCGTAGCAA TGATTATAGT TCCTGCAATT ATGTTGGCAG TTATAGTGCT ATATATGAAA
GGAAAAAAAC CTACTTATAA TTATCCTGAA AGAAAAGCGA GTCCTGTGTA A
 
Protein sequence
MQKKFFILFV VATSFLFTGN IQNSFAEDFV VSIPFGAFNP ELNTPAKVWY DPPELSIVEG 
DTVTWVNDDR EGHTVTSGSG AGRFDWMDAK NLGEPDGLFD SLRFMPDESW SYTFEKAGDY
NYFCVIHPWM EGIIFVKPFI PDYPHDATGK KYEQFPTFLI TPDGSIEINF SWEPRVIKTH
EKTNFIYRFY DAIYDQPLRK LEYDIAILQN NQVLYKDEGA VSGAGGDYRQ WIFEEPGPII
VKISNIKPYG SVAETQINLG PDATARLGDF TAMVYENYEK KTTTEKIVQP RDTLQFYYEI
AVAMIIVPAI MLAVIVLYMK GKKPTYNYPE RKASPV