Gene Nmar_0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0231 
Symbol 
ID5774485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp203816 
End bp205486 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content34% 
IMG OID641315852 
ProductIg family protein 
Protein accessionYP_001581565 
Protein GI161527739 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATACAA AATTAGGAAT ATTTGCAGTA TTTTCATTAT CTTTACTTAT GCTAACGCCT 
GCTTATGCTA GTGTTACATC ATTTTCATTA GATAGGAGTT TTTACACAAT AGATGAGACT
TTCACTTTCA GTGGAGAGCA AGAAGGAAAA GAAACAGTAT ACATTATAAT TCGCGATTCA
AGTGGGAATT TTAAAGGAAT GCTATCTGAT CCAGCACCTG GTCAGGGTGA ATTTTCTGTG
ATACCAAGGC CAGTAGAAAA TTTCTTTTCT AGTCAAGGAA TATACAATGC AACAGCATTC
ACAGATAGTC AAAAAGAAGA AGAAGGTCTC ACAATAAAAA TTGAATATGA TGGAAAGAAA
ATTTTTGAGA TGCCTGATTT TGTTTTGGAA TTAAAACCTA TTTCGGACAA AGAAATTGAT
GAATTAAAGA CTGTTTCATT TACTGTTAGT ATTACAGACA GTTCTGTTGA AGATGAAGTA
TATAGTTTAG AAAAAAATCC ACCTAGTGGA GCAACAATTG ATTCAAGTAC AGGGAAATTT
GTATGGACTC CATCAGGATC TCATGGGAAT AATCCAGGAG CAGAATACAC TTTTGATATT
GTTGTAACAA GAGGCAGTCA AACAGATAGA CAAACAGTTA CGATTACAGT CAATGAACCT
GTTGCAGTAA ATCCTGAACC AAAAGAAACA ACAGAAACTG TACCTGAGCC AAAAGAAGCT
GTACCTGAGC CAAAAGAAGC TGTACCTGAG CCAAAAGAAT TAGAGATTCC TGCACCATTT
GTAGATGAAA CAAAAGATCC TCAAAGTTAC GTAGACAGAT ACAATGATGA AGAAGGTTAC
AAAAAGTGGT TTGATGACAA TTATTCAGAA TATTCTTCAA TTTATCAGGC TGTAGGACTA
GAAGAACCTC TAGAGATTCC TGCACCATTT GTAGATGAAA CAAAAGATCC TCAAAGTTAC
GTAGACAGAT ACAATGATGA AGAAGGTTAC AAAAAGTGGT TTGATGACAA TTATTCAGAA
TATTCTTCAA TTTATCAGGC TGTAGGACTA GAAGAACCAA AGGTCTTGGC ACCATTTGTT
GATCCTAATC TAGATCCACA GTACTATGTA GAGAGATACA ATAATGAAAT CACATACAAA
GATTGGTTTG ATAAAACATA TCCTGAAATG ACAATTTTTG AAGCAGTGGG ATTAGGTGAA
CCGAAAATTG TTGAAAAAGA ATTTGGAGAA TGTGGTGTAG GAACAAATCT AGTAAATGGA
GAATGTACAG TTATTCCAAT AGAAAGTAAT GATGGGGGCG GTTGCCTAAT TGCAACTGCA
GCATATGGTT CTGAGATGGC ACCACAAGTT CAACTCTTAA GAGAAATTCG TGATAATCAA
TTAATGAATA CTGAGTCAGG AATGTCATTT ATGACTGGAT TTAACCAAAT TTACTATTCA
TTCTCACCAT ACATTGCAGA TATGCAAAGA GAAAATCCAA TGTTCAAAGA AGCAATAAAG
ATTGGAATTA CACCATTATT ATCATCATTG TCTGTAATGA AATATGCAGA ATCAGAATCA
CAAGTTCTTG GATATGGAGT TGGAGTGATA TTGATGAATA TTGGAATTTA TTTTGCAGTA
CCAGCAATGT TGTTTTTTGG AATAAAAAAA GTAAGACGAG TTAGGTTTTA A
 
Protein sequence
MNTKLGIFAV FSLSLLMLTP AYASVTSFSL DRSFYTIDET FTFSGEQEGK ETVYIIIRDS 
SGNFKGMLSD PAPGQGEFSV IPRPVENFFS SQGIYNATAF TDSQKEEEGL TIKIEYDGKK
IFEMPDFVLE LKPISDKEID ELKTVSFTVS ITDSSVEDEV YSLEKNPPSG ATIDSSTGKF
VWTPSGSHGN NPGAEYTFDI VVTRGSQTDR QTVTITVNEP VAVNPEPKET TETVPEPKEA
VPEPKEAVPE PKELEIPAPF VDETKDPQSY VDRYNDEEGY KKWFDDNYSE YSSIYQAVGL
EEPLEIPAPF VDETKDPQSY VDRYNDEEGY KKWFDDNYSE YSSIYQAVGL EEPKVLAPFV
DPNLDPQYYV ERYNNEITYK DWFDKTYPEM TIFEAVGLGE PKIVEKEFGE CGVGTNLVNG
ECTVIPIESN DGGGCLIATA AYGSEMAPQV QLLREIRDNQ LMNTESGMSF MTGFNQIYYS
FSPYIADMQR ENPMFKEAIK IGITPLLSSL SVMKYAESES QVLGYGVGVI LMNIGIYFAV
PAMLFFGIKK VRRVRF