Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_0945 |
Symbol | |
ID | 8823775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 972009 |
End bp | 973265 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003479091 |
Protein GI | 289580625 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000913985 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCCCT CCACAGATCG ACGACGATTC GTGCAGGCAA TGGGCGGCGG ATTTGCGGTT GCACTCGCCG GTTGTTTGAG TGACGACGAG GAAAACGGTG GGGAGAACGG TGGTGACGGG ACTGCCACCG AGAACGGTGA CGATGATGGA ACCGGTGACG CCGACACCGA GGGCAGCGAG GGCCTCGTCT ACGCCTTCGC ACCGGATCAA ATCGCAATCA TCGACCCCGA AGAGGGCGAA CTCGTCGACG AGATTACTGA CGGGATCGAC GACGAAGGAT GGGGTGACCC GCGAATTACT GCGGACTACA GCGAGATTTA CGTCATTCGA GAGTCGCCGT CGCAGGTGCT CGTCATCGAC ACCGACACCC GCGAAGTGGT CGACGAAGTC GACGTTGGCC CGGGGCCGAC GCACATGTAT CACCCGAACG ACGACGAGAT GTGGGTTCAC TCGGACGACG AAGGCACGTT CTACGTTATC GACACTGACT CCCACGAGGT CACGGAGATC ATCGAGTCGG GCCGCGAAAA CGAGGGCCAC GGCAAACTGC TCTACCACGA GGACTTTGGC TCGATGGGGT ACGCGACGAA CGTGAACGAC CCTGGCGCGC CGGTGATCGA CCTCGAAAAC TACGAGCGAA GCGATTTCAT CGAATTCGAC GATGTCGACG ATCAGGGCAC CCATTACAAG GCGTACAGCC CCGAAACGGG GCTCGCGTAC TTCGAATTCG GGGACGAGAC CGTCGTTGTG GACACCGAGG ACGACGAAAT CGTCGACACA CTCGACTTTG CGGGCGGTAT GTACCTCTCG CCCGACGAGC AGGTGCTCGG CTTCCTCGAC GGTGACAGTA TCCGGTTCAT CGACGCGACC AATGAGGACA GCGAAGAACT CGGCGTCGTC GACGTCGGTG AGGGTCCTGA CGCGCTCCGC TATCACGAAG GTGAGGACGG TGCGCTGTAC GCGGCGACCG CCCACACTCA CACCGACGAG GCGTCGATTA TCGACGTCGA CGAACTCGAG GTCGTCGAGA CGGTCGACGT CGGCGACATC GTCCGTCCCG AGGGCGCCCA CCACTTCCAC CGCTCCGGCG TCGCCGGCGG CGACTACTTC ATTACTCCCG CCGACGAGGA CGGAATCGTC GCGATCGTGG ACATGGAAGC ACAAGAGGTC GTCGACCACG TCGAGGTGGC GGAGGGTGTC GACACGGTCC AGTACGTCGG GGACTCGGGC GTCGGCTACT CCAGTCGACT CCGCTAA
|
Protein sequence | MDPSTDRRRF VQAMGGGFAV ALAGCLSDDE ENGGENGGDG TATENGDDDG TGDADTEGSE GLVYAFAPDQ IAIIDPEEGE LVDEITDGID DEGWGDPRIT ADYSEIYVIR ESPSQVLVID TDTREVVDEV DVGPGPTHMY HPNDDEMWVH SDDEGTFYVI DTDSHEVTEI IESGRENEGH GKLLYHEDFG SMGYATNVND PGAPVIDLEN YERSDFIEFD DVDDQGTHYK AYSPETGLAY FEFGDETVVV DTEDDEIVDT LDFAGGMYLS PDEQVLGFLD GDSIRFIDAT NEDSEELGVV DVGEGPDALR YHEGEDGALY AATAHTHTDE ASIIDVDELE VVETVDVGDI VRPEGAHHFH RSGVAGGDYF ITPADEDGIV AIVDMEAQEV VDHVEVAEGV DTVQYVGDSG VGYSSRLR
|
| |