Gene Nmag_1915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1915 
Symbol 
ID8824756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1944089 
End bp1945324 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content64% 
IMG OID 
Productthreonine dehydratase 
Protein accessionYP_003480048 
Protein GI289581582 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAAT ATTCAGGATC CGTCGACGTT TCCGATATCG AGGCCGCTCG CGACCGGTTC 
GACGACGAGT CGGTCGTCAA GCGAACGCCC ATCGAGCGGA GCACGTCGCT CGACGAACTG
ACCGGTGGCG AGGTGTTCCT CAAGATGGAG CACCTCCAGT GGACGGGCTC GTTCAAAACT
CGCGGCGCGT ACAACAAAAT CGCCCAGTGT GTCACCGAGG ACGAAACCGA GCGCGTCGTC
GCCGCGAGCG CGGGCAACCA CGCCCAGGGT GTCGCACTCG CCGCGACGAA CCTCGGTATC
GACTCGACCA TCGTCATGCC CCGCACCGCA CCACAGGCGA AGGTCGACGC GACGAGAGGA
TACGGCGCGG ATGTCGAACT CGTCGGCACC GACTTCCGCG AGGCCATGGA CTTCGCGGAA
GACCTCGTTT CCGGCACCGA CGCCGAGTTC GTCCACGCCT ACGACGACCC GGCGATCGTC
GCCGGCCAGG GCACCCTTGG TCTCGAGATG TACGAGGATC TGCCGGAGGT GGACACCGTC
GTCGTCCCGA TCGGTGGCGG CGGCCTCATC GCTGGCATCG CAACCGCGTT CGCCGAGCGG
TCGCCCGAGA CGCGCGTCGT GGGTGTGCAG GCGACCGACG CGGCCACTGT TCCCGATAGC
CTCCAGAAGG GAACGCCGAT CTCACTCGAG TCGGTGAACA CGATCGCGGA CGGTATCGCC
ACCGGCGGCG TCTCGGAGTT GACGCTCTCG CTCATCGAGG AGCACGTCGA CGACGTAGTA
ACGGTGACCG ACGGCGAAAT TGCACGAGCG ATTTTGCTCC TGTTAGAGCG TGCAAAGCAG
GTCGTCGAGG GGGCTGGCGC GGCCTCGGTT GCAGCGATTA TCAGCGACGA ACTCGACGTG
GAGGGCGAGA CGGTGATGGC CCTACTCTGT GGTGGGAACC TCGATATGAC GATGCTCCAG
ACGGTGCTCG TCCACGCGCT CTCCGATCGG GAACAACTGC TGCGGCTTCG CGTGCGGATC
GACGATCAGT CCGGCAAGAT GGAGGAAATT TCGGGCGTGA TCGCCGACCA TAACGCCAAC
ATTCAGACGG TTCGCCACGA CCGCTCGGCA CCGGAACTCG ACGTCGGCGA GGCACACCTC
GACTTCCAGA TCGAGACCAG CGGCGCGGGA CAGGCACGAG CAATCATCCG ATCGATTCGT
GATCACGGCT ACGAGGTGAC GCACGTCAAC GCTTGA
 
Protein sequence
MAQYSGSVDV SDIEAARDRF DDESVVKRTP IERSTSLDEL TGGEVFLKME HLQWTGSFKT 
RGAYNKIAQC VTEDETERVV AASAGNHAQG VALAATNLGI DSTIVMPRTA PQAKVDATRG
YGADVELVGT DFREAMDFAE DLVSGTDAEF VHAYDDPAIV AGQGTLGLEM YEDLPEVDTV
VVPIGGGGLI AGIATAFAER SPETRVVGVQ ATDAATVPDS LQKGTPISLE SVNTIADGIA
TGGVSELTLS LIEEHVDDVV TVTDGEIARA ILLLLERAKQ VVEGAGAASV AAIISDELDV
EGETVMALLC GGNLDMTMLQ TVLVHALSDR EQLLRLRVRI DDQSGKMEEI SGVIADHNAN
IQTVRHDRSA PELDVGEAHL DFQIETSGAG QARAIIRSIR DHGYEVTHVN A