Gene Nmag_3837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3837 
Symbol 
ID8826707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp224641 
End bp226251 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content68% 
IMG OID 
Producthistidine ammonia-lyase 
Protein accessionYP_003481940 
Protein GI289583530 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.783362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTCA CCCTCGACGG CGACTCGCTG ACGATCGAGG AGGTCGTTGC CGTCGCTCGC 
CACGGGGAGG CTGTCGAACT CGCGAGCGAG GCCCGCGAGC GCGTTCGAAC CTCTCGAGAG
CGAGTCGAGG ATATCGTCGA GGCCGGCGAC CCGGTGTACG GCGTAAACAC CGGTTTCGGC
GAACTCGTCG ACACGCAGGT CCCACGAACC GAGGTCGCGG AACTGCAGAC CAATCTCGTC
CGGAGTCACG CCGCCGGCGT CGGCCGCGAA CTCAGCCGCG AGGAGGTTCG GGCGATAATG
GTCACGCGGC TCAACGCGCT CCTCGCGGGC TACTCTGGTA TCCGCCTTCG TGTGGTCGAA
CTGCTCGCGT CCCTGCTCAA CGAGCAGGTC CATCCGGTCG TCCCCTCGCG TGGGAGCCTC
GGCGCGTCGG GCGATCTCGC ACCGCTCGCA CACCTTGCAC TCGTCCTGAT CGGCGAGGGC
GAGGCCGATG TGGCGGTCGA TTCGTCGGCT GACGGCGGAT CCGGCGGCAG TTCCGCTGCA
CCCGAGTCCG AGCGCCTTCC CGGTGACGAG GCGCTCACAG CAGTGGGACT CGAGTCGGTC
GGCCTCAAAG CGAAGGAGGG ACTCGCGCTC ATCAACGGCA CCCAGTTGAC GCTCGGACTC
GCGGCCCTGC TCGTCGCCGA CGCCGAGCGA CTCTGTCGCG CGGCAGACGC CGCTGGCGCG
CTCACGACGG AGGTGACGAT GGGGACGACG GCTGCTTGCG CCGAGCCGCT CCACGAGGTG
CGACCGCACG CGGGTCAGGC GACGAGCGCG CGCACCGTGC GACGACTAAC CGCTGACTCG
GACGTCGTCG AATCACACCG CAACTGCGAC CGGGTGCAGG ATGCCTACTC GATCCGCTGT
CTGCCACAGA TCCACGGCGC GGTTCGAGAC GCCGTGGCTC ACCTCCGCGA GGCCGTCGAA
ATCGAACTCA ACAGCGTCAC CGACAATCCG CTCGTCTTCC CGCGGGCGGC CTTCGACGAC
CGCGCCTCTG GGACGGACCA GGGTGCGGTC GTCTCCGGCG GGAACTTCCA CGGTGCGCCG
CTCGCTCATC GACTCGACTA CCTCATTGCG GCGCTGACCG ACCTCGCCGC CGTCGCCGAA
CGACGGACCG ATCGACTGGT GAACCCGAAC CTCCAGGAAT CGCACCTGCC GCCGTTCCTC
GCCCAGCGCT CCGGCGTCGA ATCCGGGCTG ATGATCGCCC AGTACACCGC CGCTTCGCTG
GTCAACGAGT GTCGAAGCGT TGGCCGTGCG GCGACCGACA ACACGCCCGT CTCGGGCGGT
CAAGAGGATC ACGTCAGTAT GAGCGCGACA GCGGCGGTCA ACGCGCGGCG AGTGCTCGAC
CGTGCTCGCC AGGTCGTCGC CACCGAACTC CTCTGTGCAG CCGAAGCCGC CGAGTACGTC
GACGAATCGC TCGGATCTGG AACCAGTGAG GCCTACGAGA CGGTCCGTGA AGTCGTCCCG
CCGTTGACCG GCGACCGTCG ACCGGACGCC GACATGAAGG CAGTCGACTC GCTGATCGAG
ACCGGCGTGC TCGACGACGC GGTCGATGAA CTCGGTGCTG ACACCCAGTG A
 
Protein sequence
MTVTLDGDSL TIEEVVAVAR HGEAVELASE ARERVRTSRE RVEDIVEAGD PVYGVNTGFG 
ELVDTQVPRT EVAELQTNLV RSHAAGVGRE LSREEVRAIM VTRLNALLAG YSGIRLRVVE
LLASLLNEQV HPVVPSRGSL GASGDLAPLA HLALVLIGEG EADVAVDSSA DGGSGGSSAA
PESERLPGDE ALTAVGLESV GLKAKEGLAL INGTQLTLGL AALLVADAER LCRAADAAGA
LTTEVTMGTT AACAEPLHEV RPHAGQATSA RTVRRLTADS DVVESHRNCD RVQDAYSIRC
LPQIHGAVRD AVAHLREAVE IELNSVTDNP LVFPRAAFDD RASGTDQGAV VSGGNFHGAP
LAHRLDYLIA ALTDLAAVAE RRTDRLVNPN LQESHLPPFL AQRSGVESGL MIAQYTAASL
VNECRSVGRA ATDNTPVSGG QEDHVSMSAT AAVNARRVLD RARQVVATEL LCAAEAAEYV
DESLGSGTSE AYETVREVVP PLTGDRRPDA DMKAVDSLIE TGVLDDAVDE LGADTQ