Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1469 |
Symbol | |
ID | 8824302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | - |
Start bp | 1498557 |
End bp | 1500221 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | thermosome |
Protein accession | YP_003479609 |
Protein GI | 289581143 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGC GAATGCAGCA GGGACAGCCG ATGATCGTGA TGAGCGAGGA CTCCCAGCGC GTCAAGGACA AGGACGCGCA GGACTACAAC ATCAGCGCCG CCCGTGCGGT CGCTGAGTCC GTCAAGTCCA CGCTCGGCCC GAAAGGGATG GACAAAATGC TCGTCGACTC GATGGGATCG GTAACGATCA CCAACGACGG CGTCACCATC CTCCAGGAGA TGGACATCGA CAACCCGACG GCCGAAATGA TCATCGAGGT CGCCGAGACC CAGGAGGACG AGGCTGGCGA CGGCACCACG ACCGCCGTCT CCATCGCCGG TGAACTCCTC AAGAACGCCG AGGATCTCCT CGAGCAGGAC ATCCACCCGA CGGCGATCAT CAAGGGCTTC CACATGGCGA GCGAGCAGGC TCGCGAAGAG ATCAACGACA TCGCCGTTGA CGTCGACACC GAGGACGAAG ACCTCCTGCG CTCGGTCGCC GAAACCTCGA TGACTGGCAA GGGTACCGAG GTCAACAAGG AGCACCTCGC CGAGCTCATC GTCGAGGCCG TCCGCCAGGT CACCGTCGAG GACGACGAGG GCAACAACGT TGTCGACCTC GAGTTCCTCA ACATCGAGAC CCAGACCGGC CGCGGCGTTT CCGAATCCGA CCTCCTCGAG GGCGGCATCA TCGACAAGGA CCCGGTCCAC GACAACATGC CGACCTCGGC CGAGGACGCC GACATTCTGC TGCTGAACGA GCCGATCGAA GTCGAAGAGA CCGACATCGA CACCGAGGTC TCCGTCACGG ACCCAGATCA GCTCCAGCAG TTCCTCGACC GCGAGGAAGA GCAGCTTAAG GAGAAGGTTC AGCAGATCGC TGACCTCGAC GCTGACGTCG TCTTCTGCCA GAAGGGCATC GACGACCTCG CACAGCACTA CCTTGCCAAG GAAGGCATCC TCGCCGTCCG CCGCGCCAAG AAGTCCGACC TCGAGTTCCT CTCGGAGGTC GTCAACGCGG CCATCGTCTC CGACCTCGAC AGCGTGAGCG ACGAGGAACT CGGCCACGGC GACATCATCC GCGACGAGGA GGACGAACTG TTCTACGTCG AGGGTGAGGA CGCCCACGGC GTCACCCTCC TGCTCCGTGG CTCCACCGAC CACGTCGTCG ACGAACTCGA GCGCGGTGTC AACGACGCAC TCGACGTCGT CGCGCAGACC GTCTCCGACG GCCGCGCCCT CGCTGGCGGC GGTGCGATCG AGGTCGAACT CGCCTCGCGC CTGCGTGATT ACGCCGACTC CGTCTCCGGT CGCGAGCAGC TGGCCGTCGA GGCCTTCGCC GACTCGCTCG AGCTCGTCCC ACGCGTGCTC GCCGAGAACG CTGGACTCGA CTCCATCGAC ACGCTCGTCG ACCTCCGCGC CGCACACGAC GACGGCGACG TCGAGGCCGG CCTGAACGTC TTCACGGGCA ACGTTGAGGA CACCTACGAC GCCGGTGTCG TCGAGCCAGC CCACGCCAAG GAGCAGGCCG TGACCTCTGC CGCAGAGGCC GCGAACCTCG TGCTCAAGAT CGACGACATC ATCTCCGCCG GTGACCTCTC CACCGACAAG GGCGACGACG AAGGCGGTGC CCCAGGTGCC GGCGGCATGG GCGGTATGGG CGGCGGCATG GGCGGCATGA TGTAA
|
Protein sequence | MSQRMQQGQP MIVMSEDSQR VKDKDAQDYN ISAARAVAES VKSTLGPKGM DKMLVDSMGS VTITNDGVTI LQEMDIDNPT AEMIIEVAET QEDEAGDGTT TAVSIAGELL KNAEDLLEQD IHPTAIIKGF HMASEQAREE INDIAVDVDT EDEDLLRSVA ETSMTGKGTE VNKEHLAELI VEAVRQVTVE DDEGNNVVDL EFLNIETQTG RGVSESDLLE GGIIDKDPVH DNMPTSAEDA DILLLNEPIE VEETDIDTEV SVTDPDQLQQ FLDREEEQLK EKVQQIADLD ADVVFCQKGI DDLAQHYLAK EGILAVRRAK KSDLEFLSEV VNAAIVSDLD SVSDEELGHG DIIRDEEDEL FYVEGEDAHG VTLLLRGSTD HVVDELERGV NDALDVVAQT VSDGRALAGG GAIEVELASR LRDYADSVSG REQLAVEAFA DSLELVPRVL AENAGLDSID TLVDLRAAHD DGDVEAGLNV FTGNVEDTYD AGVVEPAHAK EQAVTSAAEA ANLVLKIDDI ISAGDLSTDK GDDEGGAPGA GGMGGMGGGM GGMM
|
| |