Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1074 |
Symbol | |
ID | 8823905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 1096949 |
End bp | 1099954 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | protein of unknown function DUF1508 |
Protein accession | YP_003479220 |
Protein GI | 289580754 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTTC TGCCAGGTGG CCGGAGGTTT ATTATGATGG GTTGCAGTTC TCCGCTCACA TTTCGAATGT CTTCACCAAG TGACGTTCAC CACAAACTGT ACCGGCTGTA CGAACGCTAC GTCGGCGAAC CCGACTCGAC GAAGGATGTC TACGGCTACT GGCTGTTCAT CGTCGGCTAC GTCGTTGCTG CGGCGGCCGT ATTCACGTTC GTTGCCGGCT ATGCGGGGGA TGCAGATACG TACGGGCTAA TCAGGGCCTC GGGTGTGACC GCGGCGACAG GGCTTGCACT CTGCCTGTTC GGGATCGTTC TCATGCTTCC AGTTCGAAGA CGAGGAATTC AGGCGAGTGT GTTGGGACTG CTGATTTCGT TTGGTGGCGT TGCGTTCTTC GCGTGGGCGT ATCCGTACAA TTGGCGAGAA CTCGGCACGG ACTACAGCGT TCACGTCATC CTCGTCTACA CGGTCGGGAT TGGGATCATC GCGGGTGTTA CTGCGCTCGT TCCCGTTCTG ACCGGCCGGA AAGGGATGTT CGTCGAGGAG GAAGGAGAAA CGGAAGATCC GCCGATTCTC ACCGGAGATG CACTGGAGGG TGCACAGTTC GCTGTCTTCC GTGACGACAA CGGCGACTGG AAGTGGAACG TTCTCCATCT CGAAGCGCTG GCGACGAGCA ACGAGAGCGC CGTGACTCGA CCGAAGGCCA CCGAGGGAAT TGAACGCGTC CAGTCCCAGA TCAGTTCAGC TGGACTGATG GAGCTCACCA CGTCCGCGTT CCGGCTCTAC GAGGACAGAG ATGGGAGCTG GCAGTGGACG CTCGCCCGAG ACGACGGCAG CGTCGTCGGC ACCTGTGCTG GCGAGTTTAG CGAGCGCGAT GGGGCTGAGG AGTCCGTGAG CTTCCTCAAA GATCGTGGAC CAACGGCAGA CGTGATCGAA ATCGACGGCG CGGCGTTCAC GTACGCCGAA GAACGCGACC AGTGGCACTG GCAACTGGTG GACGACGAGC GGTTGCCGCT GGCTTCGGGT GCGAACGGCC ACGGCACCCA GGAGAACGCC GAGACGGCCG CACGCACGTT CGCCGAGCGG TTCGACCAGG CACGCGTACT CGACCTCGAA CACGTTGGTG CCGAACTCTA CGACCGAACG GACGACAGCG GCGCGAACGG CTGGTCCTGG CGCTTCGTCG ACGAACAGGA TTCACCGCTT GCCGCCGCAA CCGACGCGTA CGACGCCCGG CGCGACGCAG AGGAAGCTGC GGATGCACTG CTTTCGGAAC TCGGCAGTGC GTCGGTGACG GTGGCTGGCG AACCAACCTA CGAACGCTAC CAGACCGGCG ACCAGTGGCG CTGGCGGCTG GTCGGCGAGT CCGAACACGT TGTCGCCCAA AGTCCAAGCG ACGCCGAAAC CGAGGCCGAC GCGACTCACG AGACCGACAC CTTCGGAGCA CACGCCCGCG ACGCCGACGT CGTCGAAATC GAGGACGCGG AGTACGAGGT CTATCCGACC GACAGCCAGG AACTAACCTA CGAGGAGGGC GACGCACTGC CTGCAACGTC CGACGAGCAG CAGATGGTGT CGACCGACGG CGGCACGGCG ACGGCCGAGG GGGAGGACGG CGCAGACGAC GGCCGCTCCT GGCACTGGCG TCTCGTCACC GAAGACCGCG ACGTGATCGC CGGAAGCACC GAACCCCACT ACGACGCCGA GACGGCGACC GAAGCGATCC AGCGCGTTCG CGAGCAAGCG AGCGAAGCCG AACTCATCGA GTTCGAGGAG GCTGCCTTCC AGGTCTACGA AGCCGATGAC GGCGAGTGGC GCTGGCGGCT CATCGACGAG GACGGCAACG TCCTCGCAGA CAGCGGTGCA GAACACACCT CCCGCGGCGA GGCCGCAGAA GCGATGATGA CGCTCAAAGA GCAGGCGCCG GACGCCGAAC TGCTCGAAAT CGAAACGGCA GCCTTCGAGC TCTTCGTCAA CGAGGACAAC GAATGGGGCT GGCGACTTAT CGACGAAGCC GGTCAGCTCG TCGCCGAAGA TCCGTCGACG CACCCAACCC GCGGTGCCGC GCGCAAGGCG ATGAACCGAC TCCTCGAGTA CCTCGACTCT GACGTGCGGA CCATGGAAGA TGCGATCTTC CAGCCGTACG CAGCGGACGA CTGGCACTGG CGGTTCGTCC TGCCAACCGG GGAAACGGTC GCCGTTGCCG GTGACACCTA CGCGACACGC GACGAACTCG TCGATGCCAT CCCTGCCGTT CGCGACGCAG CCGAATCCGC ACAGGACTAC ACGATCGGCA ACGTCACGAT CCAGCTCTAC CGCAGCGGTG ATTGGAGCTT CCGACTCCTC GACCGCGATC GCAAGGAGAT TGCCGACGCG ACTGACACCT ACGCGGAACG CGACGCCGCA CTCGAGATCG TCGAAGATCT CAAAGCACAC GCCGACGATG CCCCGATCTT CACGATCGAG GACGCCGCGA TCCGCGTCAC TGACGCTGAC ACGGACGACG GCTGGACATG GGACCTCGTC GACCGCGAGC GCACCGTCCT CGCAAGCGCC GTCGACACGG TGGCGAGCCG CGAGGAACTT CACGAGGAGA TCGAAACTGT CCGCCAGCTC GCACCGATGG CCGGCCGTGT CGACTTCGAC GTTGCCTCGT TCGAACTCGT CGCCGACGAG GACGACCGCT GGCAGTGGCG GCTCATCGAC GAGGACGGCC ACACGGTCGC CACCGGCTCC GAATCACACG AATCGAGCGA GGCCGCTCGT GAGGCACTCG AGAACGTCCG CGAACTGATC GACGCAGCGA GCATCCTCGA GATCGACAGC GTCTCCTTCG AACTCCATAC CGCGGAGGAC GAGAACGAGG ATGGCTGGGT CTGGCGGCTG GTCGACGAGT ACGGCTCGAC GATGGCCCAG AGCACGCAGG TTTACGAGTC CCGGACGGAC GCCCGTGAGG CGATGAACAA CGTGAAAGCG GAAGCCCCAG AGGGCTGGAT CACGTTCACG GAGTAA
|
Protein sequence | MSVLPGGRRF IMMGCSSPLT FRMSSPSDVH HKLYRLYERY VGEPDSTKDV YGYWLFIVGY VVAAAAVFTF VAGYAGDADT YGLIRASGVT AATGLALCLF GIVLMLPVRR RGIQASVLGL LISFGGVAFF AWAYPYNWRE LGTDYSVHVI LVYTVGIGII AGVTALVPVL TGRKGMFVEE EGETEDPPIL TGDALEGAQF AVFRDDNGDW KWNVLHLEAL ATSNESAVTR PKATEGIERV QSQISSAGLM ELTTSAFRLY EDRDGSWQWT LARDDGSVVG TCAGEFSERD GAEESVSFLK DRGPTADVIE IDGAAFTYAE ERDQWHWQLV DDERLPLASG ANGHGTQENA ETAARTFAER FDQARVLDLE HVGAELYDRT DDSGANGWSW RFVDEQDSPL AAATDAYDAR RDAEEAADAL LSELGSASVT VAGEPTYERY QTGDQWRWRL VGESEHVVAQ SPSDAETEAD ATHETDTFGA HARDADVVEI EDAEYEVYPT DSQELTYEEG DALPATSDEQ QMVSTDGGTA TAEGEDGADD GRSWHWRLVT EDRDVIAGST EPHYDAETAT EAIQRVREQA SEAELIEFEE AAFQVYEADD GEWRWRLIDE DGNVLADSGA EHTSRGEAAE AMMTLKEQAP DAELLEIETA AFELFVNEDN EWGWRLIDEA GQLVAEDPST HPTRGAARKA MNRLLEYLDS DVRTMEDAIF QPYAADDWHW RFVLPTGETV AVAGDTYATR DELVDAIPAV RDAAESAQDY TIGNVTIQLY RSGDWSFRLL DRDRKEIADA TDTYAERDAA LEIVEDLKAH ADDAPIFTIE DAAIRVTDAD TDDGWTWDLV DRERTVLASA VDTVASREEL HEEIETVRQL APMAGRVDFD VASFELVADE DDRWQWRLID EDGHTVATGS ESHESSEAAR EALENVRELI DAASILEIDS VSFELHTAED ENEDGWVWRL VDEYGSTMAQ STQVYESRTD AREAMNNVKA EAPEGWITFT E
|
| |