Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3837 |
Symbol | |
ID | 8826707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013923 |
Strand | + |
Start bp | 224641 |
End bp | 226251 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | histidine ammonia-lyase |
Protein accession | YP_003481940 |
Protein GI | 289583530 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.783362 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTCA CCCTCGACGG CGACTCGCTG ACGATCGAGG AGGTCGTTGC CGTCGCTCGC CACGGGGAGG CTGTCGAACT CGCGAGCGAG GCCCGCGAGC GCGTTCGAAC CTCTCGAGAG CGAGTCGAGG ATATCGTCGA GGCCGGCGAC CCGGTGTACG GCGTAAACAC CGGTTTCGGC GAACTCGTCG ACACGCAGGT CCCACGAACC GAGGTCGCGG AACTGCAGAC CAATCTCGTC CGGAGTCACG CCGCCGGCGT CGGCCGCGAA CTCAGCCGCG AGGAGGTTCG GGCGATAATG GTCACGCGGC TCAACGCGCT CCTCGCGGGC TACTCTGGTA TCCGCCTTCG TGTGGTCGAA CTGCTCGCGT CCCTGCTCAA CGAGCAGGTC CATCCGGTCG TCCCCTCGCG TGGGAGCCTC GGCGCGTCGG GCGATCTCGC ACCGCTCGCA CACCTTGCAC TCGTCCTGAT CGGCGAGGGC GAGGCCGATG TGGCGGTCGA TTCGTCGGCT GACGGCGGAT CCGGCGGCAG TTCCGCTGCA CCCGAGTCCG AGCGCCTTCC CGGTGACGAG GCGCTCACAG CAGTGGGACT CGAGTCGGTC GGCCTCAAAG CGAAGGAGGG ACTCGCGCTC ATCAACGGCA CCCAGTTGAC GCTCGGACTC GCGGCCCTGC TCGTCGCCGA CGCCGAGCGA CTCTGTCGCG CGGCAGACGC CGCTGGCGCG CTCACGACGG AGGTGACGAT GGGGACGACG GCTGCTTGCG CCGAGCCGCT CCACGAGGTG CGACCGCACG CGGGTCAGGC GACGAGCGCG CGCACCGTGC GACGACTAAC CGCTGACTCG GACGTCGTCG AATCACACCG CAACTGCGAC CGGGTGCAGG ATGCCTACTC GATCCGCTGT CTGCCACAGA TCCACGGCGC GGTTCGAGAC GCCGTGGCTC ACCTCCGCGA GGCCGTCGAA ATCGAACTCA ACAGCGTCAC CGACAATCCG CTCGTCTTCC CGCGGGCGGC CTTCGACGAC CGCGCCTCTG GGACGGACCA GGGTGCGGTC GTCTCCGGCG GGAACTTCCA CGGTGCGCCG CTCGCTCATC GACTCGACTA CCTCATTGCG GCGCTGACCG ACCTCGCCGC CGTCGCCGAA CGACGGACCG ATCGACTGGT GAACCCGAAC CTCCAGGAAT CGCACCTGCC GCCGTTCCTC GCCCAGCGCT CCGGCGTCGA ATCCGGGCTG ATGATCGCCC AGTACACCGC CGCTTCGCTG GTCAACGAGT GTCGAAGCGT TGGCCGTGCG GCGACCGACA ACACGCCCGT CTCGGGCGGT CAAGAGGATC ACGTCAGTAT GAGCGCGACA GCGGCGGTCA ACGCGCGGCG AGTGCTCGAC CGTGCTCGCC AGGTCGTCGC CACCGAACTC CTCTGTGCAG CCGAAGCCGC CGAGTACGTC GACGAATCGC TCGGATCTGG AACCAGTGAG GCCTACGAGA CGGTCCGTGA AGTCGTCCCG CCGTTGACCG GCGACCGTCG ACCGGACGCC GACATGAAGG CAGTCGACTC GCTGATCGAG ACCGGCGTGC TCGACGACGC GGTCGATGAA CTCGGTGCTG ACACCCAGTG A
|
Protein sequence | MTVTLDGDSL TIEEVVAVAR HGEAVELASE ARERVRTSRE RVEDIVEAGD PVYGVNTGFG ELVDTQVPRT EVAELQTNLV RSHAAGVGRE LSREEVRAIM VTRLNALLAG YSGIRLRVVE LLASLLNEQV HPVVPSRGSL GASGDLAPLA HLALVLIGEG EADVAVDSSA DGGSGGSSAA PESERLPGDE ALTAVGLESV GLKAKEGLAL INGTQLTLGL AALLVADAER LCRAADAAGA LTTEVTMGTT AACAEPLHEV RPHAGQATSA RTVRRLTADS DVVESHRNCD RVQDAYSIRC LPQIHGAVRD AVAHLREAVE IELNSVTDNP LVFPRAAFDD RASGTDQGAV VSGGNFHGAP LAHRLDYLIA ALTDLAAVAE RRTDRLVNPN LQESHLPPFL AQRSGVESGL MIAQYTAASL VNECRSVGRA ATDNTPVSGG QEDHVSMSAT AAVNARRVLD RARQVVATEL LCAAEAAEYV DESLGSGTSE AYETVREVVP PLTGDRRPDA DMKAVDSLIE TGVLDDAVDE LGADTQ
|
| |