Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1689 |
Symbol | |
ID | 8824529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 1714998 |
End bp | 1717718 |
Gene Length | 2721 bp |
Protein Length | 906 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003479827 |
Protein GI | 289581361 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.268874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGTCAG GATCGGCAAC CGAGACGGGC GGGGTAGCGA ATCGCGAGTC AGAACCATCA GCCGGCAGGG TTGGGGACGT GGGCGAGAGT AGGGATGGTG ACAAGGGTGA GGATGGAGAC CCAGGGCCGG CCGAACGGCC GGACTGGGGG CAGTTTCTTC TGGTCTCTCT CGCCGTGCTA GCACTTACAC TCGCCGCACT CACCGTCCCC GCACTCGGTG GCGTCGCGGG GACGCCGGGC GATGGGAGCG GCGATCTGTT CCCCGATGAA GTGGCAGAAC TCGAAGACGG GGACGGAGAG CCGGACTCGG AAGAAGATTC GAATGGGGAT GCAGAGACAG AGACGGCTAC TGAAGAGCGA GAGGAACAAG ATCAAGACCA GGAGCAGGGG CAGGAGTCGG AACACGGGCA GGAGGGACAG GAACAGGAGT CGGAACACGG GCAGGAGCAA GACGATGCGC CCGACGCCGA GAGCGACCCT GAGAGCGGAG AGATCGACGA GGACAACGAT AGAGACGGCG AGCAGAACGA CGATGCGCCC GACGAGATCG AAGATCACGA ACAGGTCGAC CCTGACGAAG AGTCGACTGA CGAGTCTGGT GCGAACGAGC CAGAACACGG TGACGAGGAG CCGGATTCAG AAATCGACGC TGACGATGCG ATCGCCGAAG AGTCGGATGC GGACAGCGAA ATCGAGGGAG GCGATACCGA GGCCGAACAG GTCGAGGAGG GCGAAGAGGG CGAAGAGGGC GAAGAGGGCG AAGAGGGCGA GCAAGACGAG CAAGACGAGT CAGACGAAGA AACGGACGAG CACACCGAAA CCGACAGCGG CTACGACATC GCGTTCAACG AGTCGCCGAC GCCCGGCAGC GTTGTCGAGG TAACCGTCAC CGAAAACGGT GAGACGCGCG AGAACGCGGG CGTCACCTTC AACGGGGTGC CGGTCGGCGA AACAGACGCC GACGGAACGG TAACTGGAAC GGTCCCATTT ACGGACACGC TGACGGTCAC AATCGATCCG GCGGACGCAG CCGCTGAATC GACGACCGAG CCGTCGATCC GTGGCCCAGT CGGCGGCCCA GTCGGGGCAC CGGTCCACTT CTACGGCGGG AGTGGAGCAG TTCCGGCGTC CGATCACGAC GAGACAGGTT CTGCCAGCGA TGACGACAAC GACGGTGAGG GAGACAGCGA AACCGCCGAG CGAGAGCCGG AAACCGAACG GACCGTCGAG ATGGACGCCG AAACCGAACT CACGCTCGAC CGTGAAACGC CGATGGACCC CGAGGACGCC GTGCTTGCCG GGACGAACGC CACGGTGACG GCGACTGTTG CGGAGAATCC GATTCCGGAC GGTACGGTGC TCATCGACGG CGAGGAAGTC GCGACGACAG ATGCGACCGG AACTGCACAG CTCACCGTTC CCGAGACGAC CGGAGAGATG GCGATCGCCG TCGAGCGTGA CGAGATTCGT GCCGAACGAG AGTTTGCGGT CTACGGTCTC GCCGTTGACG TGACAGAGCG GATCCCGCTA CCCGGCCGAA CCGTGACCGC CGACGTCTCC TACACGGGGC CACCGGTAGA AGACGCTACC AACGACGATT CAGCGGCGAA CGAGAGCACG AGTGAAACAC CCGACGCAAC CGCTAACGCG ACGGTGTTCC TCGATGGAAC CGCCGTCAGC GAGACCGGAG CGGACGGAAC TAGCTCGGTC AGACTACCCC TCGCGAACGA GGCCACGCTC GGCGCAGCGG TCGGCGAATC CACGGCAGAA ACGACCGTCA GCGGACTCTA TCGAAACGCC GCGCTCGTCG CACTCGGTGT GCTCACCCTC GTGACCGCAC TCGGCTGGCT CCTCGTTCGC CGGTTCGGAC TCTCGAGAAA GAGCGTTCGA TCGATCCCGT CGCTCATCCG GCGTGCGGTC CGCCGTGCGG GCACGCTGGC ACAGTTGATC GGTCGAAAGG GAGTCGAAGC GGTTGTCCGA CTCGCGCGCA GCCTCGAGCG GGCGGGTACC TGGCTCGCCG AGCGCGGTCG GGAGGCGATA GCGGCAACGC GACGCGCGGG GCGCTGGCTG CTCGCACTGC CCGGCGAACT CGCAAAGCGA GGGTTCGCCG CACTGGCGGC GATACACCCC ACACGGCTCG TCAGCACTCT CCTCGCATTG CTGCGGTCGT TCGGAAAGTC GTCCCGTGAA CGGGTATCGT CGATGACCGG ATCGGCACAG ACGGATACCG CAGCGGCCGG CTCAACGAGT GACCCCGATC AGTCCGTTCG CACACTGCGA ACACTCTGGC ACGAGTTCAT CCGCGCGGTT CGTCCGCCAC GGATTCGAAC GAAGACGCCC GGCGAAATTG GGCGCTACGC GGTCGACAAA GGGTTCCCAG AGCCACCGGT TCGAACTGTC GTCGACGCGT TCCGGGATGC AGAGTATGGC GAATCCTCGC CAACGGAGAC GCGACTCGAG TCGGTCGAGC GTGCGGTCGG GTCGGTGACC GAGCCGGAGT CGGAAGCGAG AGTGGATGGA CAGTCAGCAG ACAACACCGA AGTCTCGGAC GCAGACCGTG TGAAATCCCG CGACGACACA GCAGTTGACG AGGATCTACT CCCTGGAAAC CAGGGAGACT CCACAACGGA GAACGAGACG AACCAGTCCG ACGGGCCATC GCCGGCGAAC AACCCCACGT CAGCGGACGA ATCAGCTAAC GGACACGGAG GGCCACAATG A
|
Protein sequence | MGSGSATETG GVANRESEPS AGRVGDVGES RDGDKGEDGD PGPAERPDWG QFLLVSLAVL ALTLAALTVP ALGGVAGTPG DGSGDLFPDE VAELEDGDGE PDSEEDSNGD AETETATEER EEQDQDQEQG QESEHGQEGQ EQESEHGQEQ DDAPDAESDP ESGEIDEDND RDGEQNDDAP DEIEDHEQVD PDEESTDESG ANEPEHGDEE PDSEIDADDA IAEESDADSE IEGGDTEAEQ VEEGEEGEEG EEGEEGEQDE QDESDEETDE HTETDSGYDI AFNESPTPGS VVEVTVTENG ETRENAGVTF NGVPVGETDA DGTVTGTVPF TDTLTVTIDP ADAAAESTTE PSIRGPVGGP VGAPVHFYGG SGAVPASDHD ETGSASDDDN DGEGDSETAE REPETERTVE MDAETELTLD RETPMDPEDA VLAGTNATVT ATVAENPIPD GTVLIDGEEV ATTDATGTAQ LTVPETTGEM AIAVERDEIR AEREFAVYGL AVDVTERIPL PGRTVTADVS YTGPPVEDAT NDDSAANEST SETPDATANA TVFLDGTAVS ETGADGTSSV RLPLANEATL GAAVGESTAE TTVSGLYRNA ALVALGVLTL VTALGWLLVR RFGLSRKSVR SIPSLIRRAV RRAGTLAQLI GRKGVEAVVR LARSLERAGT WLAERGREAI AATRRAGRWL LALPGELAKR GFAALAAIHP TRLVSTLLAL LRSFGKSSRE RVSSMTGSAQ TDTAAAGSTS DPDQSVRTLR TLWHEFIRAV RPPRIRTKTP GEIGRYAVDK GFPEPPVRTV VDAFRDAEYG ESSPTETRLE SVERAVGSVT EPESEARVDG QSADNTEVSD ADRVKSRDDT AVDEDLLPGN QGDSTTENET NQSDGPSPAN NPTSADESAN GHGGPQ
|
| |