Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1115 |
Symbol | |
ID | 8823946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 1135341 |
End bp | 1137041 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | thiamine pyrophosphate protein TPP binding domain protein |
Protein accession | YP_003479261 |
Protein GI | 289580795 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGCGA CGGGTGCTGA TCTCTTCATC GACAGTCTCG AATCGTACGG TGTCACGAAA TTGTTCGGTA ATCCCGGCAC CACCGAACTC CCGCTCATGC AATCGCTCGT CGAGAGCGAA CTCGAGTACG TGCTCGGCTT GCACGAGGAC GTAGCGGTGG GAGCGGCGGC GGGGTATGCA ATGCGGCGGC GGCATCACGC GACCGAAGCG GCAACTGGTA CTGGTGGTCG CGATGCCGAC GACGTCCTCC CGCTCGGCGT CGCGAACCTC CACCTCGCCG GTGGACTGGC ACACGGGCTG GGGAACCTCT ACAACGCAGA CGTTTCGGGC GCGCCGCTGC TGGTCACGTC GGGAACCCAC AGCCGGGACT ACCAGCAGGA GGAACCGATT CTCAGCGGCG ACCTCGTCGA GATGGCCGAG CCGTTCACGA AGTGGAGCGC CGAGGTAAAA CACGTCGATG CGCTGCCGAC GATGGTTCGT CGGGCCGTCC GGACCGCGCT GACGCCGCCG ACGGGACCGG TCTTCCTCTC GATTCCGGTC GACGTCCAGA CGGAGGAGAC CACGGCTGAA CCGGAGCCAC TCGGTCGGAT TCCGACGGCG GGCCGGGGCG ACGAGGCGGC GATTCAGGAG GCTGCCACGA TGCTCGCCGA CGCCGACGAA CCCGTCTTCG TCCTCGGCGA CGAGGTTGGG CGCAGCGGTC CGGCAGCGGT TGAGGCTGCG GTCGACCTCG CTGAAGCGAC GGGCGCACGC GTCCACAACG AAATCCTCGC CTACGAGGCG AACTTCCCGA CGGACCACGG CCAGTGGCAG GGCGCGCTCT CGACGAAAGC GCCCGGCTCG GCCGCCGCGA TGGACACGGA CACGCTCGTC TTCGTCGGCT GCTCGACGAA CACGACGGTA ACGCGGCCGA CGACTCAACT CGTCCCCGAT GAGGCGACAC GAGTTCACAT CTCGCCCGAC GCGTGGGAAC TCGGCAAGCA CGCTCCCGCA GAGACTGCGG TGCTTGGCGA CCCCGCGACC GTCCTTGCGG ATCTCGCCGA TCGTGTCAGC AACGCTGTCG ACGACGGCGA ACGCGAACGG CGACTCGAGT CCGTCCGCGA CTGGGCGAAC GCACACGACA CCGATCCTGC CCCCGAAACG ATCGACGGGA CGCTGACGAA AGCGGGGCTC GCTCGGGCGT TCGACAGCGT TGCGCCGGAT GCGCTCGTGG TCAGCGAGGC AATCACCGCG TCGCCGCCGC TGTTCGACGA GTTCGAGTTC GAGGCCAACC AGCTCCTGGG AACGAAAGGT GGCGGCCTCG GCTACGGACT GCCGGCCAGC GTGGGCGCTG CGGTCGCAGA GCAGGAAGCC GGCGGCGACC GCTCGGTGCT TGGCTACGTC GGCGACGGCT CGTACCTCTA CTACCCGCAG ACGCTGTACA CGGCGGTTCG AAACGACCTC GATCTCACCG TCGTCGTCCC CGACAACCGC AACTACCGCA TCCTGAAGGA GAACACGGCG AACCTGCTCG GCGGCGACCC CGACGAGTAC GAGTACGACG GCTTCGACTT CGAGCCGCCC GTCGACATTG CAGCGAGTGC CGCCGCCCAC GGTGCGACGG GAGTGACTGT CGAGGAGCCA GGGGAACTCG AGTCGGTACT CGAGGACGCA CTGACGACGG CGGGGCCTGC GGTTGTCGAC GTACCGGTGA CTGACGAATA A
|
Protein sequence | MVATGADLFI DSLESYGVTK LFGNPGTTEL PLMQSLVESE LEYVLGLHED VAVGAAAGYA MRRRHHATEA ATGTGGRDAD DVLPLGVANL HLAGGLAHGL GNLYNADVSG APLLVTSGTH SRDYQQEEPI LSGDLVEMAE PFTKWSAEVK HVDALPTMVR RAVRTALTPP TGPVFLSIPV DVQTEETTAE PEPLGRIPTA GRGDEAAIQE AATMLADADE PVFVLGDEVG RSGPAAVEAA VDLAEATGAR VHNEILAYEA NFPTDHGQWQ GALSTKAPGS AAAMDTDTLV FVGCSTNTTV TRPTTQLVPD EATRVHISPD AWELGKHAPA ETAVLGDPAT VLADLADRVS NAVDDGERER RLESVRDWAN AHDTDPAPET IDGTLTKAGL ARAFDSVAPD ALVVSEAITA SPPLFDEFEF EANQLLGTKG GGLGYGLPAS VGAAVAEQEA GGDRSVLGYV GDGSYLYYPQ TLYTAVRNDL DLTVVVPDNR NYRILKENTA NLLGGDPDEY EYDGFDFEPP VDIAASAAAH GATGVTVEEP GELESVLEDA LTTAGPAVVD VPVTDE
|
| |