Gene Nmar_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0043 
Symbol 
ID5774145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp33030 
End bp34121 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content36% 
IMG OID641315660 
Producthypothetical protein 
Protein accessionYP_001581381 
Protein GI161527555 
COG category[S] Function unknown 
COG ID[COG0392] Predicted integral membrane protein 
TIGRFAM ID[TIGR00374] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTTAG GCGTGAAAAC TCTACTAAAC CGATTTTGTC AACTTTCATA TTTGATCCTT 
CAAAGAAATC ATCAAATGAA CTGGAGACTT GCAGCTATTC CTGCTACATT AATTCCAATT
ATCATTATAG CTATTCAATT TGATATCAAA CCTGAAGACG TTCTTGCAAT CGGTTTCTTT
CCATTTGTCG GTGCAGTTGT AGCAATGATG ATAAAACTAG GACTTCAAGG AGTAAAGTTT
GCCTACATTG CAAGGAGATA TCTTGGCAAT TTTGATTCTG TTTTGAAATT AACTGGAGTT
CGTGTCGGTA GTGAGTTTAT CAAATTTACA ACTCCGATGT TTATTGGAGC AGAATTCATC
GTAATCTATT ATTTGCACAA AAAGGGAGTA AAGCCCTCAA AATCAACATG GATTGCAATA
ATGGATATTG TAACTGAAGT GTTTGCAGCT GGATTGTTAT CTATAATGGC AGGAATAATT
GCACTGCTAA ATGGAGCATA TGTTGTTGCC GCAGTAGTTT TGGGAACCAG CATTACTGTC
ACCACATTGT GGATGGTACT ATTCTTCTTG TCTTCTAAAC GCACATTCCA AGTTCCTAAA
GTTTTGGGAA AACTTGCACA AAGATTTGGA AAAGAGAAAG GTACCAAGTA TATAGAACAA
ACAAACTCTT GGATGGAAGA AGTGTGTACT ATGAGTAGAG AAAATCTCAA AACTTCTGAA
TCAAAAAAGG TCTTTACAAT ATCATTCTTG TTTTCAATAG CATCTTGGTC ATTTTATGGA
ATTTCATTTA TGATCATTGC AATGGGAACT GGATATGTTA TCAACGCATT TGATTCTATT
ATGGCTGTAA TGGGGGCAAA TGCAATTGGA AATCTTCCAA TCACTATTGG TGGTTCTGGC
CTTGCTGAAT TTGGAATTGT TGCATATCTT AACAATCTAA ATCCATTTGA CTTTGATGCT
TCCCAAGGTG GTTTAGCTTG GGATGCAGTA ATAGGCTGGA GAATTGCAAC ATACTATGTA
CCAATTGTGA TTACTTGGTT GCTTTTAGTA AAACTAGCCT TGAGTAGAAT CTCAAAACCT
CAAGCCACAT AG
 
Protein sequence
MVLGVKTLLN RFCQLSYLIL QRNHQMNWRL AAIPATLIPI IIIAIQFDIK PEDVLAIGFF 
PFVGAVVAMM IKLGLQGVKF AYIARRYLGN FDSVLKLTGV RVGSEFIKFT TPMFIGAEFI
VIYYLHKKGV KPSKSTWIAI MDIVTEVFAA GLLSIMAGII ALLNGAYVVA AVVLGTSITV
TTLWMVLFFL SSKRTFQVPK VLGKLAQRFG KEKGTKYIEQ TNSWMEEVCT MSRENLKTSE
SKKVFTISFL FSIASWSFYG ISFMIIAMGT GYVINAFDSI MAVMGANAIG NLPITIGGSG
LAEFGIVAYL NNLNPFDFDA SQGGLAWDAV IGWRIATYYV PIVITWLLLV KLALSRISKP
QAT