Gene Nmar_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1543 
Symbol 
ID5773851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1404590 
End bp1406185 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content38% 
IMG OID641317195 
Productcytochrome b/b6 domain-containing protein 
Protein accessionYP_001582877 
Protein GI161529051 
COG category[C] Energy production and conversion 
COG ID[COG1290] Cytochrome b subunit of the bc complex 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000370205 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGTTT CGCTTGAGCC AAGAAGAAAT GGTGTAGTTG AATTTCTCTA TTGGTTATGG 
GAAGGTGTAG ATAGAACTAT CTTTACTGCA ATCAAGTTTT CATTTCCTGC AAGATTTGTA
AGTCCATTTG GATTTTTGGG AATGTTAACA TTCATTGTGT TTATCATTCT AGGAATTTCA
GGAGCTCTGC TCATGTTTTA CTATCAACCG ATATTGGATA GAGCATGGGA TAGTGTTCAA
TTCATTAACG ATGAAGTTCC ATTTGGATTC CACATTAGAA ACATACACTA TCATGGTTCT
AATGCAATGG TTCTCTTAGC TGTTCTTCAC ATGTATTATC AATACTTTAG TGGAAGATAC
AAAATTAGAA ATGAAGTTTT ATGGATGACT GGTGTTATCT TAGGCGTTGT TACTATCTTA
GAAGCATTTA CTGGATATGA TGTTATATTC TCTGAAAGAG CGCAACTTGC AATTAGTATT
GCAGCGTCGT TAACCACTTC AATTCCAGTG GCCGGATCTG TCATACGTGA CGCGGCGTTG
GGTAGTGGGT TCTCGGACTT TGTATTGAGA TTCTATGCTC AACATGTGTT CTTGTTGCCA
ATAGTTATGC TTGGATTGAT GGCAGTTCAC TTCCCAAGAT TCTTGGTATT TGATGTACCA
ATGGTTATGG CAATTGCTGG TGCAATTTTG ATTACTGGTG GTGTTTTCCC AATTGATTTG
GGATTCAAGT TTGAGCCGAC CGTACCGCCT GGTGTAACCG TACCTGAATG GTATTTGACT
GGAATTTATG CATTCATGAG AACTCAGTAT GATAAGTTTG TAACAGGGTT GCTGTGGCCT
TTGATATTCA TTATATCGTT TGTATTGATT CCATTTATTG ATAGATACAA GAAATTCTCT
TGGAGAGATA GACCAATTAT TACTGCATTT GGAATTACCA GCCTTGCACA GATTATGGTA
ACAACTTATT GGGGATTCTA TATCTCACCT GACATCTCAA TTCCATTAGT TGAGCGTTTG
GTAATTGATC CTGTGTTCTT TTACAGTGTG ATGATATTGT TAGTTCCTAT GAGTTTTGGA
TTTACATACA TGATGATCAA ACTTGCAAAT GAAGCAGAGA GAAAATCAAA ACTAGCAAAA
AATACTGGTC CGCAGAAAGT TGCAACCTTA GATCTATCTG AAAAATGGAT TAACTGGTTA
CTTGTTGCAT TACTAGCATT CCAAGTATTC CTTAACATTG CAGCATACAA TGCAGCTTTG
ACTGGCATGA ACAACATGTC TTTGTTCTTT GTTGGATTGA TACTGATGGT ATTTGCTGGT
TTCTTCCATA TCTACAGATA TGGTATGAGT CAGCAAAAGA ATGCTCCTCC AGCTCCTCCA
GCTCCTGTAT CTGATGAGAA ACCAAAACTA GCTGAACCAG AAGAATCTTC ACAACCCGAA
GAATCTGCAA AACTTCCTGA AGGTGAAACT GCAGCAGAAA AACCTGAAGA GAAACCTATT
GCTCCAGAAG TTTCTGCACC AAAAACTCAA GCTGACTTGG GAGTTGGTGC AGACAATAAT
CCAAATCTTG GTACCGGTGA TCTCAACAAA CCATGA
 
Protein sequence
MAVSLEPRRN GVVEFLYWLW EGVDRTIFTA IKFSFPARFV SPFGFLGMLT FIVFIILGIS 
GALLMFYYQP ILDRAWDSVQ FINDEVPFGF HIRNIHYHGS NAMVLLAVLH MYYQYFSGRY
KIRNEVLWMT GVILGVVTIL EAFTGYDVIF SERAQLAISI AASLTTSIPV AGSVIRDAAL
GSGFSDFVLR FYAQHVFLLP IVMLGLMAVH FPRFLVFDVP MVMAIAGAIL ITGGVFPIDL
GFKFEPTVPP GVTVPEWYLT GIYAFMRTQY DKFVTGLLWP LIFIISFVLI PFIDRYKKFS
WRDRPIITAF GITSLAQIMV TTYWGFYISP DISIPLVERL VIDPVFFYSV MILLVPMSFG
FTYMMIKLAN EAERKSKLAK NTGPQKVATL DLSEKWINWL LVALLAFQVF LNIAAYNAAL
TGMNNMSLFF VGLILMVFAG FFHIYRYGMS QQKNAPPAPP APVSDEKPKL AEPEESSQPE
ESAKLPEGET AAEKPEEKPI APEVSAPKTQ ADLGVGADNN PNLGTGDLNK P