Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1343 |
Symbol | |
ID | 5773792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1230831 |
End bp | 1231745 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641316988 |
Product | ectoine hydroxylase |
Protein accession | YP_001582677 |
Protein GI | 161528851 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG5285] Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin |
TIGRFAM ID | [TIGR02408] ectoine hydroxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00503536 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTATCAAA ATAACACTGA AATGGATTTT TATCCAACAA GAATGAATTC TGAATCAAAA ATTATCCCAA GGACTGATCC TGTGGTATAT CCAAGTTCTT TAACCTCTTT GACTAGGAAA CAAGAGGATT TTTTTGAGAA AAACGGATAT CTTGTATTTG AGAATCTGTT TTCTATAGAT GAGGTGACTG CATTACTTGA TGAACTAAAG GCATTGTCAA AAGATGACTC TAAAAAAGAA CTGCCTCAAT TCATCTTGGA GGAAACAAAA AAAGATGTAC GTTCCATATT TGAAATACAT AAACTAAGTG AGATTTTTAG CAGACTTTGC AAAGACAGTA GACTTGTAGA TGTGGCTCAG CAGCTTTTAG GAAGCAAAGT GTACGTACAT CAATCCCGAG TTAATCTGAA ACCTGGATTT GATGGAAAAG AGTTCTACTG GCATTCGGAC TTTGAGACGT GGCACTCTGA GGATGGCATG CCAAACATGC GTGCTGTCTC GTGCTCTGTC AGTCTTACAA AAAACTATGA GTTTAACGGT CCTCTGATGG TCATTCCAGG ATCTCACAAA GAATTTGTCT CGTGTTCTGG TACAACTCCT GACAAACACT ACAAACAATC CCTAAAAAGA CAAGAGATAG GTACTCCTGA CAAGAAAATC CTTGAGGATA TGGTAGAAAA AGGAGGAATT GTATCTGCCA AAGGTGATGC AGGCTCTGCA ATATTCTTTG ACTGCAATAT CATGCATGGC TCTAATGGAA ATATTTCCCC TTATCCGCGA AGCAACGCGT TTATTGTATT CAATAGTATT CACAATAAAC TAATCACACC ATTTTGTGGA TTGGAACCTC GCCCCAACTA CATTGGATCC AGAGAGTTTT CTGTATTGGA TCCTATAGAA AACTTTCTAA ACTAA
|
Protein sequence | MYQNNTEMDF YPTRMNSESK IIPRTDPVVY PSSLTSLTRK QEDFFEKNGY LVFENLFSID EVTALLDELK ALSKDDSKKE LPQFILEETK KDVRSIFEIH KLSEIFSRLC KDSRLVDVAQ QLLGSKVYVH QSRVNLKPGF DGKEFYWHSD FETWHSEDGM PNMRAVSCSV SLTKNYEFNG PLMVIPGSHK EFVSCSGTTP DKHYKQSLKR QEIGTPDKKI LEDMVEKGGI VSAKGDAGSA IFFDCNIMHG SNGNISPYPR SNAFIVFNSI HNKLITPFCG LEPRPNYIGS REFSVLDPIE NFLN
|
| |