Gene Nmar_1714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1714 
Symbol 
ID5773311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1571629 
End bp1572816 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content35% 
IMG OID641317368 
Producthypothetical protein 
Protein accessionYP_001583048 
Protein GI161529222 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0633161 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAATACC GATTACGATT AGCCAAAAGA ATGCTATTCA ACAAAAAAGG AAGCTTGATT 
GGTGCAGTTC TAGCTGTTAC TATCGGAATT TTAGTAATTC ATGTAAATTT TGTAATTTTC
CAAGGATTGT TTGATGCAAT TGTCAGAGAT ATCAGTGATT ACAGAAATGG TGACATTCTT
GTAACTGATG AGGCAGACTA TATAGATAAA TCAGATTTAT CTGTTGTAAA TTGGTTTGAA
AGAATTCCAT ATGTAGAGGC GGCAACACCG CGGCTTTCAT CAACTGCCGA AATGAATATG
ACAAAAAATG GAAAACTAGT TGAAGAAACA AGAGTTCCTA TAGTTGGAAT TGATCCATTT
AGAGATATTA GAGCATCAAC TGTTCATGAA ACTGTTTCAG AAGGAAGTTA TGTTTTTTCA
AGAAACTCTA TTGTATTAGG TTCTAATGTT GCAAGAGATC TTGGAGGAGC TGAAGTTGGT
GACAGTGTCA AAGTTCTAGT AGTAGACAGA TATGGACAAG ATGAAATAAG AAGATTTACT
GTTTCAGGAA TTGCAAAATC TCCTGGAGGT CAAGGATTTG ACTATACTGC AGTTATTCAT
ATTGATACAT TACGTGACAT GATGAATAGA CAAGGTGATA CAGGATCATT TATGGTAAAA
CTAAATGATC CCACAAAAGC ATTTGAAGTA AAAGAATTTT TCTTACGTTC ATTTCCAAAT
GATGATTTTA AAGCTGAAAC AATAGAAGAG TCAGCTGAAG AACAACTAGC TGGATTTAGA
TCTGGTATTG CAATGATTAA CATGATTGGT TATTTTGGAA TGATGTCTTC AGCTTTTGCT
ATTGTAACAA TTCAAATGAT GTTGGTAAAT GGAAAAACCC GTGAAATCGG TGTAATGAGA
TCTATTGGAG CAAAAAGAAA GGATATTTTG ATTATCTTTA TTTTCCAAGG AATGATTATT
GGAGCTATTG GTGCTGGTGT AGGTACGGCT GCAGGCTTGG GATACACATT CTATGCAAAG
GAAACTAAAA TGTCATTTAA CAATAGTTTG CCTCTTGAAG TTACCTATAA CTGGGAGAAG
ATCATCCAAA CTGCTTTAAC TTCATTTATT TTGGCAATTA TTGCGTCACT CTATCCGTCG
TATAGGGCTA CAAAGCTATT ACCAGTGGAG GCGATGAGAA CTGTCTAA
 
Protein sequence
MEYRLRLAKR MLFNKKGSLI GAVLAVTIGI LVIHVNFVIF QGLFDAIVRD ISDYRNGDIL 
VTDEADYIDK SDLSVVNWFE RIPYVEAATP RLSSTAEMNM TKNGKLVEET RVPIVGIDPF
RDIRASTVHE TVSEGSYVFS RNSIVLGSNV ARDLGGAEVG DSVKVLVVDR YGQDEIRRFT
VSGIAKSPGG QGFDYTAVIH IDTLRDMMNR QGDTGSFMVK LNDPTKAFEV KEFFLRSFPN
DDFKAETIEE SAEEQLAGFR SGIAMINMIG YFGMMSSAFA IVTIQMMLVN GKTREIGVMR
SIGAKRKDIL IIFIFQGMII GAIGAGVGTA AGLGYTFYAK ETKMSFNNSL PLEVTYNWEK
IIQTALTSFI LAIIASLYPS YRATKLLPVE AMRTV