Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0333 |
Symbol | |
ID | 5773747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 294247 |
End bp | 297375 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641315961 |
Product | hypothetical protein |
Protein accession | YP_001581667 |
Protein GI | 161527841 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCTG ATGCATCTAG TAATCCAAAT CTTTCAGTAT CTGCTGAAAA TTCAGAATTT GGTAATCTTT TTGCAGGATC TATGGTAATT GAAGTAGTTA TTCGTGATTC CAATATCTCT GACACTGATG AAGGCAAAGG CGAACCTGAT GTCACCATTA ATGGTAAAAC ACTACGTATG GTTCAAGCTA CAGATGGAAA TTGGTATGCA TATTTTGCAA ATGTTGACAA GGCAAAAATT GCTGATGCTA CTGTTGGTTT AACTGGTAAA GGATTAGACT TTGGAGAATT CTGTAGTCGT GATACTACAA CATCTGTTTT TGGAATATCT CTAAGTGAAA CTGATGGTTT TGCAATTCCT AGGCCTGATT CTGTTACTGG ATCTACAAAC GGCAACTCAT CTTTTTCTGA ATGCACTTCA AGTCCTACAA CCACTTCTAA TCATAACAAT GTCGTTAGAA ATGCAAAATC AATCAATACA AATTCTAACA TATCTGTTGG TCAAATTGGT TTAGATGCTG ATGCATGGCC ATTAATTCAA CTTTATTCTT TTGATGACGT AACAATACAA TACAATCCTG GCGGACCTTC CCAACAAGTG TATCTTGATT ATGATGAAAT TGAGAATATC AGTTTAGGAC TTGATAGGGA ACTGTATCCT GAAAATTCTG AAGTATTTCT AACTATTAAT GACATTCAGC TAAACCAAGA TCCAACTGAT GAAGACTCTT GGACATTTAA TGTTGATTCT ACTGTAAGTA CATTTTATCA AGCATTTGAT AATTCTGGAA ATGATGATGC AAACGGAAAT GCTGGATTGG TAAATCTTGT TCCACATCTT TCAAATTTAG GATTTGACGA TAATGGAAAA TTATCTCTCA ACCTTGGAAA TATTATGGAA TTAAAAACAA ACAATGACCA ACCTGATTCT TCTATTGATG CTGATGGAGT GTCAAATACG TTTTCTAAAA TTGTAACCCT AGTTGAACAA GGACCCAACT CTGGAATCTT CTCTAGTTCT GATTCTAATG ATCAATCAGT AATCGGTATC AAAGATGATG CACCTAGAGG ACAATCTGGA CAAATAACAT ACAACCAAAA ATCTATTTCG GTAGTCTCTG GTTCTTCTAC AGCTTCTGTT TCTTTTGATG ATAAACCTGT TTTAACAATA GGAAATGGTG ACTCACTTAG GCCTGGTACT GAATATCCTG TATTGTTAGT TGATCCTGAT CAAAATCTGA ATACTGGTGC TAGAGATGAT CTTGATGTCT TTAGAGACTC TGCAATTATT CCAACACTCA CAATAGGGAA TCCCATAACT TTGGAGAATG CAAACAGTGT TGAATTCCAT TCTACTTCTC CTACTATCTC TGGAGGTGAT GATTCTAATT CATCTGTTCC TGATTCAAAT TCTGCTATCT TACTAATTGA CACCTCAAAT GTCTCTAATG CTTCTTATGA GATGATATCG ATTAATCTGG GTATTTCTGC CTTGTCTTTG GCATCCTCAC TCATAGATTC TTCAGAATCT AATACTGATG GAACAAACTG GATAAATTAT GATCTAAGAT CTCTTGAAAA TGAACTAGGA CTTTCTGATT TTAGCAGTAC TACATTTGCC CTTGCATTTG GAACCCGTGA TTCTACTCAG ATTGTAATTG TAGATGATGG TGATGTGTCG TCTTCTCAAG GATTTATTCA AATTGATAAT GATGATGTTG AAAACATTAA AACAAAAACT GGCAATGTTT TTCTAATCAT TGATTTTGAT TCTTCTGACA CAGTTACAAT TTCTAATGAA TCAAACAAAC TTCCAATGGT TTTTGACTTT TTTTCATTTG GATTAAAAAA TGATAATAGT ATAAACAATT CTATTTATCG TTTTGAACTA GAAGAGACTC GAGATAATTC ATCTACATTT GAAGGTAGTC TTGAATATGC AGTTACAAAT CAACTAAACA TAGTTGATCC AAATTTTATT CAAACTATTC AAACAATCGA TGATGAAATA AAAATTATCA TTACTGATAG ATTAATTGAT GAAGAAGGAA TTGCAATCTC CTATGCTGAT TTAGATTCAT CTGGCGTGAC AACAACGACT ACAAGTAAAT CTGATGCTGC AACGCATTCA GGAAATGTTT CTCTAGATTC AACAACATTT CGATTTGGGC AACCAGTAAC AATTACACTT AGCGACTCTG ATCTTAATTT GAAAAGTGAT ACCGTTGAAA TTTATTCAGT GATCAATGAT TCAAACTCTG CCAACGTTGA TACTGTAGGT AAAGACGGGG AAATCTTACT AGAGATTAAA ATCAAAGATA TTCGTTACAA ACGATGTACT GTCAATGGAG TTGAGCATGG TGGTTTAGCC TCTACTGGAT TTACACTTGT AGAAACAGGA CCTAGCACTG GAATATTTGC AGGAGTGTTC AAAATGCCAT CTCAGATTTG TGACAAAACA GGCTCAAAAC TAATCTATAC CTCTGGTGGA AGTATTGATG TTCGTTATCA TGACTCTCGT GATGCTTCTG GAAATGCAAA TATTTTCAGT TTGCTTGATT CAAAGTCATC TGTATCATTT TACACTCCTG CCAAATTAAG CCTAGAAAAA ATGGTACTTC CTATGAGTGA TTCAAAAGAA ATAGTTCTGA CAGGAAGTAT TGAAAATCAT AAACGAGGAA TTCCATTATC TATTGAACTT ACAAATCCTG ATTACACAAA ACAAAACTTT GGCGCGTCTC TAAGTAATAG CGGAGGTTAT CGTTCTGTAT TCACTATTAA TCCAAATACT TTGCTTGGAA CATATTTTGT ATCCCTTTCA TATGATGGTA AAAACCTTGG AACTCTCTCA TTTGATGTAG TTTCTGAAAA TGTTCCTGAT TGGGTAAAAA ATAATGCTCG TTGGTGGTCT TCAAACAATA TATCTGATGA TGAATTTATT GGTGGAATAG AACATCTAAT TGAGACAGGA ATAATTTCTA TTGATTCCTC TGAACAAAAT TCGACAGAAC AAGAAATTCC CGATTGGATA AAAAATACTG CAAGATGGTG GGCAGATGAT CAGATTCCAG AAGATGAGTT CTTAAAATCG ATTCAATATT TGGTCAAAAA AGGTATAATT CAGGTATGA
|
Protein sequence | MNADASSNPN LSVSAENSEF GNLFAGSMVI EVVIRDSNIS DTDEGKGEPD VTINGKTLRM VQATDGNWYA YFANVDKAKI ADATVGLTGK GLDFGEFCSR DTTTSVFGIS LSETDGFAIP RPDSVTGSTN GNSSFSECTS SPTTTSNHNN VVRNAKSINT NSNISVGQIG LDADAWPLIQ LYSFDDVTIQ YNPGGPSQQV YLDYDEIENI SLGLDRELYP ENSEVFLTIN DIQLNQDPTD EDSWTFNVDS TVSTFYQAFD NSGNDDANGN AGLVNLVPHL SNLGFDDNGK LSLNLGNIME LKTNNDQPDS SIDADGVSNT FSKIVTLVEQ GPNSGIFSSS DSNDQSVIGI KDDAPRGQSG QITYNQKSIS VVSGSSTASV SFDDKPVLTI GNGDSLRPGT EYPVLLVDPD QNLNTGARDD LDVFRDSAII PTLTIGNPIT LENANSVEFH STSPTISGGD DSNSSVPDSN SAILLIDTSN VSNASYEMIS INLGISALSL ASSLIDSSES NTDGTNWINY DLRSLENELG LSDFSSTTFA LAFGTRDSTQ IVIVDDGDVS SSQGFIQIDN DDVENIKTKT GNVFLIIDFD SSDTVTISNE SNKLPMVFDF FSFGLKNDNS INNSIYRFEL EETRDNSSTF EGSLEYAVTN QLNIVDPNFI QTIQTIDDEI KIIITDRLID EEGIAISYAD LDSSGVTTTT TSKSDAATHS GNVSLDSTTF RFGQPVTITL SDSDLNLKSD TVEIYSVIND SNSANVDTVG KDGEILLEIK IKDIRYKRCT VNGVEHGGLA STGFTLVETG PSTGIFAGVF KMPSQICDKT GSKLIYTSGG SIDVRYHDSR DASGNANIFS LLDSKSSVSF YTPAKLSLEK MVLPMSDSKE IVLTGSIENH KRGIPLSIEL TNPDYTKQNF GASLSNSGGY RSVFTINPNT LLGTYFVSLS YDGKNLGTLS FDVVSENVPD WVKNNARWWS SNNISDDEFI GGIEHLIETG IISIDSSEQN STEQEIPDWI KNTARWWADD QIPEDEFLKS IQYLVKKGII QV
|
| |