Gene Nmar_0333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0333 
Symbol 
ID5773747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp294247 
End bp297375 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content34% 
IMG OID641315961 
Producthypothetical protein 
Protein accessionYP_001581667 
Protein GI161527841 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCTG ATGCATCTAG TAATCCAAAT CTTTCAGTAT CTGCTGAAAA TTCAGAATTT 
GGTAATCTTT TTGCAGGATC TATGGTAATT GAAGTAGTTA TTCGTGATTC CAATATCTCT
GACACTGATG AAGGCAAAGG CGAACCTGAT GTCACCATTA ATGGTAAAAC ACTACGTATG
GTTCAAGCTA CAGATGGAAA TTGGTATGCA TATTTTGCAA ATGTTGACAA GGCAAAAATT
GCTGATGCTA CTGTTGGTTT AACTGGTAAA GGATTAGACT TTGGAGAATT CTGTAGTCGT
GATACTACAA CATCTGTTTT TGGAATATCT CTAAGTGAAA CTGATGGTTT TGCAATTCCT
AGGCCTGATT CTGTTACTGG ATCTACAAAC GGCAACTCAT CTTTTTCTGA ATGCACTTCA
AGTCCTACAA CCACTTCTAA TCATAACAAT GTCGTTAGAA ATGCAAAATC AATCAATACA
AATTCTAACA TATCTGTTGG TCAAATTGGT TTAGATGCTG ATGCATGGCC ATTAATTCAA
CTTTATTCTT TTGATGACGT AACAATACAA TACAATCCTG GCGGACCTTC CCAACAAGTG
TATCTTGATT ATGATGAAAT TGAGAATATC AGTTTAGGAC TTGATAGGGA ACTGTATCCT
GAAAATTCTG AAGTATTTCT AACTATTAAT GACATTCAGC TAAACCAAGA TCCAACTGAT
GAAGACTCTT GGACATTTAA TGTTGATTCT ACTGTAAGTA CATTTTATCA AGCATTTGAT
AATTCTGGAA ATGATGATGC AAACGGAAAT GCTGGATTGG TAAATCTTGT TCCACATCTT
TCAAATTTAG GATTTGACGA TAATGGAAAA TTATCTCTCA ACCTTGGAAA TATTATGGAA
TTAAAAACAA ACAATGACCA ACCTGATTCT TCTATTGATG CTGATGGAGT GTCAAATACG
TTTTCTAAAA TTGTAACCCT AGTTGAACAA GGACCCAACT CTGGAATCTT CTCTAGTTCT
GATTCTAATG ATCAATCAGT AATCGGTATC AAAGATGATG CACCTAGAGG ACAATCTGGA
CAAATAACAT ACAACCAAAA ATCTATTTCG GTAGTCTCTG GTTCTTCTAC AGCTTCTGTT
TCTTTTGATG ATAAACCTGT TTTAACAATA GGAAATGGTG ACTCACTTAG GCCTGGTACT
GAATATCCTG TATTGTTAGT TGATCCTGAT CAAAATCTGA ATACTGGTGC TAGAGATGAT
CTTGATGTCT TTAGAGACTC TGCAATTATT CCAACACTCA CAATAGGGAA TCCCATAACT
TTGGAGAATG CAAACAGTGT TGAATTCCAT TCTACTTCTC CTACTATCTC TGGAGGTGAT
GATTCTAATT CATCTGTTCC TGATTCAAAT TCTGCTATCT TACTAATTGA CACCTCAAAT
GTCTCTAATG CTTCTTATGA GATGATATCG ATTAATCTGG GTATTTCTGC CTTGTCTTTG
GCATCCTCAC TCATAGATTC TTCAGAATCT AATACTGATG GAACAAACTG GATAAATTAT
GATCTAAGAT CTCTTGAAAA TGAACTAGGA CTTTCTGATT TTAGCAGTAC TACATTTGCC
CTTGCATTTG GAACCCGTGA TTCTACTCAG ATTGTAATTG TAGATGATGG TGATGTGTCG
TCTTCTCAAG GATTTATTCA AATTGATAAT GATGATGTTG AAAACATTAA AACAAAAACT
GGCAATGTTT TTCTAATCAT TGATTTTGAT TCTTCTGACA CAGTTACAAT TTCTAATGAA
TCAAACAAAC TTCCAATGGT TTTTGACTTT TTTTCATTTG GATTAAAAAA TGATAATAGT
ATAAACAATT CTATTTATCG TTTTGAACTA GAAGAGACTC GAGATAATTC ATCTACATTT
GAAGGTAGTC TTGAATATGC AGTTACAAAT CAACTAAACA TAGTTGATCC AAATTTTATT
CAAACTATTC AAACAATCGA TGATGAAATA AAAATTATCA TTACTGATAG ATTAATTGAT
GAAGAAGGAA TTGCAATCTC CTATGCTGAT TTAGATTCAT CTGGCGTGAC AACAACGACT
ACAAGTAAAT CTGATGCTGC AACGCATTCA GGAAATGTTT CTCTAGATTC AACAACATTT
CGATTTGGGC AACCAGTAAC AATTACACTT AGCGACTCTG ATCTTAATTT GAAAAGTGAT
ACCGTTGAAA TTTATTCAGT GATCAATGAT TCAAACTCTG CCAACGTTGA TACTGTAGGT
AAAGACGGGG AAATCTTACT AGAGATTAAA ATCAAAGATA TTCGTTACAA ACGATGTACT
GTCAATGGAG TTGAGCATGG TGGTTTAGCC TCTACTGGAT TTACACTTGT AGAAACAGGA
CCTAGCACTG GAATATTTGC AGGAGTGTTC AAAATGCCAT CTCAGATTTG TGACAAAACA
GGCTCAAAAC TAATCTATAC CTCTGGTGGA AGTATTGATG TTCGTTATCA TGACTCTCGT
GATGCTTCTG GAAATGCAAA TATTTTCAGT TTGCTTGATT CAAAGTCATC TGTATCATTT
TACACTCCTG CCAAATTAAG CCTAGAAAAA ATGGTACTTC CTATGAGTGA TTCAAAAGAA
ATAGTTCTGA CAGGAAGTAT TGAAAATCAT AAACGAGGAA TTCCATTATC TATTGAACTT
ACAAATCCTG ATTACACAAA ACAAAACTTT GGCGCGTCTC TAAGTAATAG CGGAGGTTAT
CGTTCTGTAT TCACTATTAA TCCAAATACT TTGCTTGGAA CATATTTTGT ATCCCTTTCA
TATGATGGTA AAAACCTTGG AACTCTCTCA TTTGATGTAG TTTCTGAAAA TGTTCCTGAT
TGGGTAAAAA ATAATGCTCG TTGGTGGTCT TCAAACAATA TATCTGATGA TGAATTTATT
GGTGGAATAG AACATCTAAT TGAGACAGGA ATAATTTCTA TTGATTCCTC TGAACAAAAT
TCGACAGAAC AAGAAATTCC CGATTGGATA AAAAATACTG CAAGATGGTG GGCAGATGAT
CAGATTCCAG AAGATGAGTT CTTAAAATCG ATTCAATATT TGGTCAAAAA AGGTATAATT
CAGGTATGA
 
Protein sequence
MNADASSNPN LSVSAENSEF GNLFAGSMVI EVVIRDSNIS DTDEGKGEPD VTINGKTLRM 
VQATDGNWYA YFANVDKAKI ADATVGLTGK GLDFGEFCSR DTTTSVFGIS LSETDGFAIP
RPDSVTGSTN GNSSFSECTS SPTTTSNHNN VVRNAKSINT NSNISVGQIG LDADAWPLIQ
LYSFDDVTIQ YNPGGPSQQV YLDYDEIENI SLGLDRELYP ENSEVFLTIN DIQLNQDPTD
EDSWTFNVDS TVSTFYQAFD NSGNDDANGN AGLVNLVPHL SNLGFDDNGK LSLNLGNIME
LKTNNDQPDS SIDADGVSNT FSKIVTLVEQ GPNSGIFSSS DSNDQSVIGI KDDAPRGQSG
QITYNQKSIS VVSGSSTASV SFDDKPVLTI GNGDSLRPGT EYPVLLVDPD QNLNTGARDD
LDVFRDSAII PTLTIGNPIT LENANSVEFH STSPTISGGD DSNSSVPDSN SAILLIDTSN
VSNASYEMIS INLGISALSL ASSLIDSSES NTDGTNWINY DLRSLENELG LSDFSSTTFA
LAFGTRDSTQ IVIVDDGDVS SSQGFIQIDN DDVENIKTKT GNVFLIIDFD SSDTVTISNE
SNKLPMVFDF FSFGLKNDNS INNSIYRFEL EETRDNSSTF EGSLEYAVTN QLNIVDPNFI
QTIQTIDDEI KIIITDRLID EEGIAISYAD LDSSGVTTTT TSKSDAATHS GNVSLDSTTF
RFGQPVTITL SDSDLNLKSD TVEIYSVIND SNSANVDTVG KDGEILLEIK IKDIRYKRCT
VNGVEHGGLA STGFTLVETG PSTGIFAGVF KMPSQICDKT GSKLIYTSGG SIDVRYHDSR
DASGNANIFS LLDSKSSVSF YTPAKLSLEK MVLPMSDSKE IVLTGSIENH KRGIPLSIEL
TNPDYTKQNF GASLSNSGGY RSVFTINPNT LLGTYFVSLS YDGKNLGTLS FDVVSENVPD
WVKNNARWWS SNNISDDEFI GGIEHLIETG IISIDSSEQN STEQEIPDWI KNTARWWADD
QIPEDEFLKS IQYLVKKGII QV