Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3039 |
Symbol | |
ID | 8825899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 3134861 |
End bp | 3136810 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | YP_003481153 |
Protein GI | 289582687 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGACG AACGTTCGTC AGCGACACGA AGCGGCGAGC ATCCCCCCGA CGACTCCGAC GACACCCCGT CTGACACACC TTCCGACGAC GAGGACACCG GCGAGTCGAC CTCAGCCAGT GGGACGACCA GCCACTCTCT CTCACTCGAG TCCTTCCACG ACGCCTGCCA GCAGGCTGGC CGTCCCGTCC TCACCGCCGG CGCCGTCGCC CGTGCCCTGG ACCGGCCCCA CGAGGCGATC AGCGAGGACC TCGACGCACT CACAGAACGA GGGGCACTCG AGCGCCTCTC CGTCTCGACC GACCCCGTCG TCTGGTATCC GAGCGAACTC GAGGACCTGA CCGATCGCGA GCGCGTGGTC GTCTTCCCGA AGCGCCGGGA GATCATCGTC GACCGCCCCG ACCAGTTCAC CCGCGCCCAG CTCGCGCAGT TCGCTCACCT CGCGGACGGC AACGGCGAGG AGGGCTACCG CTACGTCGTC AGACCGGAGG ACATCTGGCA GGCCCCCCAC GACTCCTTCG ACGAACTCGC CCGAACCATG CGCCAGGCAC TCGGCCAGCG CTCGCGGGCA CTCGAGGACT GGGTCGAGAG CCAGTGGGAT CGCGCCCATC AGTTCCGCCT CACGACGCAC GAGGAGGGCT ACACCGTGCT GGAGGCAAAG AGTCCCGAAG TGATGGGCAA CGTCGCGCGC CAGAAACTCG ACGAGGAGCA CGTCCACGCC CCCATCTCTG AGACCGAAGA CTGGGTTCGC GAGGGAGCGG AGGCTGCGAT CAAGCGGATT CTCTACGAAG CGGGCTATCC CGTGCAGGAC CACCGCGAGC TGGAGGCCGG CGAGCAGCTA GACATCGAGC TCGGTGTCTC GCTGCGCGAC TACCAGCAGA CGTGGGTCGA CCGCTTCGCC GAGGCCGGCG AGGGCGTCTT CGTTGGCCCG CCGGGTAGCG GAAAGACGGT CGCTGCGATG GGTGCGATGG CCCACGTTGG CGGCGAGACA CTGGTGCTGG TTCCAAGCCG TGATCTCGCG CGCCAGTGGG CCGAGGCCAT CGAGGAGTAC ACCTCGCTCG AACCGGACCA GATCGGCCAG TACCACGGTG GCCAGAAGAA CGTCCGGCCC GTTACAATTG CGACCTACCA GATCGCGGGG ATGGATCGCC ACCGGTCGCT GTTCGACGAC CGCGAGTGGG GGCTCGTCGT GTTCGACGAG TGCCAACACG TGCCCTCGGA CGTCTACCGG CGGAGTACAC ACCTGCAGTC CCGGCACCGC TTGGGACTCA GCGCAAGTCC AATCCGCGAG GACGACCGTC AGACCGAGAT CTTCACACTC GTTGGTCCGC CAATCGGCAC CGACTGGCAG GCGCTGTTCG AAGCCGGCTT CGTCGCCGAA CCGGAACTCG AGATCCGCTA CGTGCCGTGG GGCGACGACG AGCAGTCAAA CGCCTACGCC TCCGCCGATG GCCGCGAGCG GTACCGCATC GCCGCCCGCA ACCGGGGGAA GATCGACGAG GTCCGATACC TGCTCTCCGC ACACCCCGGC TCGAAGTCGC TCGTCTTCGT CGACTACCTC GAACAGGGAC GTGACCTCTC CGACGCGCTC GACGTCCCCT TCCTCAGCGG GGAGACCCCT CATCACGAGC GCCGGCGGCT ACTCGAGGAG TTCCGTCGCG ACGAGCGCGA CCTGCTGATC GTTTCGCGCG TGGGTGACGA GGGAATCGAC CTTCCGACGG CGGATCTGGC GATCGTCGCC TCCGGACTCG GTGGCTCGCG CCGGCAGGGG ACCCAGCGCG CGGGGCGGAC AATGCGGCCG GCAGGTGGCG CGCTCGTCTA CGTGCTCGCG ACGCGCGGGA CGCGCGAGGA GGACTTCGCT CGGCGGCAGT TGCAGCATCT CGGCCGAAAG GGGATGACGG TGCGTGAGGA GACTATCGAG CGGGAAGACG ACGCGAGTGA CGGACAATAG
|
Protein sequence | MTDERSSATR SGEHPPDDSD DTPSDTPSDD EDTGESTSAS GTTSHSLSLE SFHDACQQAG RPVLTAGAVA RALDRPHEAI SEDLDALTER GALERLSVST DPVVWYPSEL EDLTDRERVV VFPKRREIIV DRPDQFTRAQ LAQFAHLADG NGEEGYRYVV RPEDIWQAPH DSFDELARTM RQALGQRSRA LEDWVESQWD RAHQFRLTTH EEGYTVLEAK SPEVMGNVAR QKLDEEHVHA PISETEDWVR EGAEAAIKRI LYEAGYPVQD HRELEAGEQL DIELGVSLRD YQQTWVDRFA EAGEGVFVGP PGSGKTVAAM GAMAHVGGET LVLVPSRDLA RQWAEAIEEY TSLEPDQIGQ YHGGQKNVRP VTIATYQIAG MDRHRSLFDD REWGLVVFDE CQHVPSDVYR RSTHLQSRHR LGLSASPIRE DDRQTEIFTL VGPPIGTDWQ ALFEAGFVAE PELEIRYVPW GDDEQSNAYA SADGRERYRI AARNRGKIDE VRYLLSAHPG SKSLVFVDYL EQGRDLSDAL DVPFLSGETP HHERRRLLEE FRRDERDLLI VSRVGDEGID LPTADLAIVA SGLGGSRRQG TQRAGRTMRP AGGALVYVLA TRGTREEDFA RRQLQHLGRK GMTVREETIE REDDASDGQ
|
| |