Gene Nmar_1133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1133 
Symbol 
ID5774197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1036866 
End bp1037891 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content41% 
IMG OID641316776 
Productkelch repeat-containing protein 
Protein accessionYP_001582467 
Protein GI161528641 
COG category[S] Function unknown 
COG ID[COG3055] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0981995 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TATCATTTTT CTTAATTCTG TTAATTTTTC CTGTTTCAGA TATTTTTGCA 
GAAGAAGATT CAGAAGGGTG GAAAAGATTG GCAGACATGC CAGAAGTAAG GTCAGAGATG
GAATCAGCTG CAATTGATGA AAAGATCTAT GTGGTGGGAG GCATAGCCAA TACAAATCAA
GTATCAAATT CTGTTTTTGT TTTTGATACC AAAGATGAAT CATGGAGTAC TGGAACCCCA
ATGCCAATAG AATTACATCA TGCTGGAACT GCAGCCCATG ATGGGAAGCT GTATGTTGTT
GGAGGATACA TGAAAGGGTG GAGTCCATCA AACGCATTAC TAATTTATGA TTCTGTCAAA
GATTCTTGGA GTCAGGGCAA GGATATGCCA ACTGCTCGCG GTGCACTGAC TGCAGAATTT
GTAGACGGCA AGCTGTATGC AGTTGGAGGA TTCAATGAGA ATTCCCGTAC TGAAAATGAA
GTGTATGATC CTGCAGACGA CTCTTGGGAG AAAATGGCCC CAATGCCTAC AGCCAGGGAA
CACCTAGCAT CAGCAGTTCT AGACGGACAG TTGTTTGTCA TTGGTGGCAG GGCAGGACAG
GTAAATTCTG ATGCAAACGA AATGTACGAC TATACCTCAG ATACTTGGAA AATATTAGAA
CCACTTCCAA CTGCAAGAAG TGGATTGGCT GCATCTGTTA TTAGCGGAGC AGTTTTTGTT
TTTGGGGGAG AAAGCTCACT AAGGACATTT GAAGAAAATG AAGCATACAT TCCTGAAGAA
GGATGGTTTG CACAGCAACC AATGCCAATA CCAAGACATG GCTTAGCATC ATCAACTGTA
GGGGACAACA TCTATCTTAT TGGCGGGGGA GTAGTTCCAG GTTTTAGCTT TAGTGGAATT
ACTGAAAAAT ATCACAACAC AGTCGTTCCA GAATTCGGCG TCTTGTCAAT TGTGATTTTG
GGAATCTCAA CTGTCATGAT AATTTTGTTT ACAAAGCCTA AATTTCAACA CATCATTCAG
CAATGA
 
Protein sequence
MKKISFFLIL LIFPVSDIFA EEDSEGWKRL ADMPEVRSEM ESAAIDEKIY VVGGIANTNQ 
VSNSVFVFDT KDESWSTGTP MPIELHHAGT AAHDGKLYVV GGYMKGWSPS NALLIYDSVK
DSWSQGKDMP TARGALTAEF VDGKLYAVGG FNENSRTENE VYDPADDSWE KMAPMPTARE
HLASAVLDGQ LFVIGGRAGQ VNSDANEMYD YTSDTWKILE PLPTARSGLA ASVISGAVFV
FGGESSLRTF EENEAYIPEE GWFAQQPMPI PRHGLASSTV GDNIYLIGGG VVPGFSFSGI
TEKYHNTVVP EFGVLSIVIL GISTVMIILF TKPKFQHIIQ Q