Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3272 |
Symbol | |
ID | 8826136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 3393606 |
End bp | 3395246 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_003481384 |
Protein GI | 289582918 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.509561 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGAA GCATTGCAAG CGGTATCTTC TCGGTCGTCA GTACGAAGGT GATCGTTCTC ATTGTTACTG CCCTCTCGAC GCCGCTGCTG TACCGGTTCC TCGGTGCGTC GGGGTTCGGC GATTACTCCT TCCTGATGTC GGTGTTCGCC ATCTACATGA TCTTCGTGAG TTCTGGCATC ACGGATGGCG TCCGGAAGTT CCTCGCAGAG GATCGAAGCG CAGCCAACTG GAGCGAGCAC GTCGTCGGCT ACTACTTCCG GCTCGCGGTG ATCCTCGCCG TCGTGGGTGC AGCCTTGTTG CTCGCAAGCG TCCAGTTCGG ACTCGTCAGC CTCGCCTTCG GCGACGAGTT GGCGATTTAC TTCTACGCAC TCGCGCTGCT CGTTATCACC GCGCAGTTTC GCGACTACGC GCGCAAGACG CTCATGGGGT TCGGACTTGA GCGCTACTCC GAGCCGCTGA AAGTGCTTGA CAAGGTCGGC TTCGTCGTCG TCGCGATTCC GCTCGTCTAC GCTGGCGCTG GCGTCCTCGG CGCGCTCGCG GGCCACCTCT TTGCGAGCCT GCTGGTTGCG ACGGTCGGGC TGCTCATCGT CCACCGTCGG ATCTCGCTCT CCTGCGTGTT CAGCACGCCC AGCTCTGACT TCCCACGAAA GAAGATGCTC ACGTTCAACT CGATGAGCAT CGCGCTCGTT TTCCTGTTGA TGTCGCTCTA TCACATCGAC ATCGTGATGC TCCAGCAGTT CCGCGAGAGC GCCGACGTCG GAAACTACCG GGCTGCACTC ACACTCGCGG AGTTTCTCTG GTTCGTCCCG CTCGCCCTCC AGACGGTGTA CGTCCACTCG ACATCCGAAC TCTGGTCGCA GAACCGTCAC CGGAAGATCA CCGAACTCGC CTCGCGAACG ACCCGATACA CGTTGCTGTT GACGGTGATC ATGGCAGTCG GTCTCGCCGC ACTCGCAGAC GTCGCCGTCC CGATCTACTT CGGCGAGGAG GCAGTGCCCG CGATCGAGCC GCTCTTGCTC TTGCTCCCCG GCGCGCTTGG CTTCGCGCTC GCACGGCCGG TGCTCGCTAT CTCGCAGGGC AACGGCACGC TCAGATACCC CGTCGCCGCA ACCGGTGTCG CGGCGCTCAT CAACGTGATC CTCAACGCGC TGTTGATCCC GCGCTATGGA ATGCACGGTG CGGCGGTCGC GACGAGCGTC GGCTACGGCT CGATGTTCGT CTTCCACTGC GTGAGCGCGC GTCAGATCGG CTTCGATCCG CTCGCGGACG CCCGAGTCGG GCGCGGCCTT CTCGCCGCGT TGCTCTCCGG TGGGCCGATC TTCGCGCTCT CGGCGGCGAT CACACACCCG ATTCTCGCGC TCGTACTCGT CCCGCCGGTC GGCTTCCTGC TGTTCGTCGG TTTCGCCGTC CTCGTGGGCG CACTCGATCC GACGGAACCG TTCGAAATCC TCGGCCTCTT CCCCGATCCG ATCGGCTCGA AGGCGGACGC GATTCACGAC CGACTCGAAC GCTCGGCCGC AAGTCATGGC GAAACCTCAC GGGGCTGGCT CCAGCGACTG CTGTTCGTCG TCGGCCTCTC GCTACTCGCC TCGGGACTCG CGCTTAGCTT CCTCGGCCCA GCAGTCGACG CACTGCTCTG A
|
Protein sequence | MNRSIASGIF SVVSTKVIVL IVTALSTPLL YRFLGASGFG DYSFLMSVFA IYMIFVSSGI TDGVRKFLAE DRSAANWSEH VVGYYFRLAV ILAVVGAALL LASVQFGLVS LAFGDELAIY FYALALLVIT AQFRDYARKT LMGFGLERYS EPLKVLDKVG FVVVAIPLVY AGAGVLGALA GHLFASLLVA TVGLLIVHRR ISLSCVFSTP SSDFPRKKML TFNSMSIALV FLLMSLYHID IVMLQQFRES ADVGNYRAAL TLAEFLWFVP LALQTVYVHS TSELWSQNRH RKITELASRT TRYTLLLTVI MAVGLAALAD VAVPIYFGEE AVPAIEPLLL LLPGALGFAL ARPVLAISQG NGTLRYPVAA TGVAALINVI LNALLIPRYG MHGAAVATSV GYGSMFVFHC VSARQIGFDP LADARVGRGL LAALLSGGPI FALSAAITHP ILALVLVPPV GFLLFVGFAV LVGALDPTEP FEILGLFPDP IGSKADAIHD RLERSAASHG ETSRGWLQRL LFVVGLSLLA SGLALSFLGP AVDALL
|
| |