Gene Nmag_0830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_0830 
Symbol 
ID8823659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp841596 
End bp842897 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content67% 
IMG OID 
Productthreonine synthase 
Protein accessionYP_003478976 
Protein GI289580510 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCGA GTTCGCAGGC CCGACTCCGA TGTTACGACT GTGGCTGCAC GTATGACCAC 
GGCGACCGCA CGCGATGTTC CTGCGGGGAA CCGCTCTGGT TCGATCTCGA TGTGGACGAA
TTCGAGTGGC CCACAGAGAC AGCCCAAGCA ACGGAACAAG ATCCGGCGAC GACCGGTATC
TGGCGCTACG ACGCCGTCCT CCCGGTTTCC GCACCCGAGA CGACTACACT GCCGCCCGGT
TCGACACCGC TCGTCCGCGC AGGTGCACTG GACTCGTTCG CCGGCTGCGA GCTACACCTC
AAGCCAGAGG GACAGAACCC CACCGGCAGC TTCAAAGACC GCGGCACCGC CGTCGGCGTC
GCCCACGCCC TCGACACCGA CACAGACTGG ATCGGCACCG TCTCCCACGG GAACATGGCC
CTGAGCACCA GCGCCTACGC CGCGGCGGCC GGACTCGAGT GCACCGTCTT CGTCCCGGCG
GATATTCCGC CGGAACGGCT CGATTTGCTC GCTCGGCACG ATCCGCATAT CTTCCGCGTC
GAGGGAGAGT ATGGCAGGCT CTACGAGGAG ACGCTCGCGC TCGATACTGA CGGCGACGCC
GGTATCACGT TCGTCAACTC CGATACGCCG CTGCGGGTGG CCGGCCAGAA GACGATTGCG
TACGAGTTAC TCGAGCAGTT TCGACTGGAG TCCCCGAACG CGCCGTCTTC GCCGGGCGTG
CCGTCCTCGT CGACGGCGTC GACCGCGTCG ACCGCACCGG ACGCGATCGT CCTCCCGGTC
AGCAGCGGCG GCCAGGCCAG CGGCGTCTGG AAAGCACTCC GAGAACTGAC CCGGAGTGGC
GTGCTCGCAG CCGACGACGT TCCGCGGCTC TACTTCGTGC AGGCAGCGCC GTGCGATCCA
ATCGCGACGG CCTTCCGGGA GGGCCGCGAG GAGGTAATGC CGATCGACGC CGATGAGACG
ATTGCTGTCT CCATCGCGAA CAGCGACCCG CCGAGTGGCA CCCGCGCGCT CACGGCCGCT
CGCGATACCG ATGGCGCGGT CGTTGCCGTC CCCGACGAGG CCACGCGCGA GGCGATGGAC
CGGCTGGCGA CGGACGCCGG TCTCGCCGTC GAGCCGTCCA GCGCAGTCGC GCTCGCGGGA
GTTCGAGAGC TCTCGGACCG CGGCGAGATC GCAGCAGACG AGTTGGTCGT TACGATTCTA
ACCGGCTCCG GGTACAAGGA GTCAGTCGAG ACGGAGCCGC GAAGCCGCCA CATCGACCTC
GCGGACCTCG AACACGAACT CGCGTCTGTC GTCGAGTCCT GA
 
Protein sequence
MTASSQARLR CYDCGCTYDH GDRTRCSCGE PLWFDLDVDE FEWPTETAQA TEQDPATTGI 
WRYDAVLPVS APETTTLPPG STPLVRAGAL DSFAGCELHL KPEGQNPTGS FKDRGTAVGV
AHALDTDTDW IGTVSHGNMA LSTSAYAAAA GLECTVFVPA DIPPERLDLL ARHDPHIFRV
EGEYGRLYEE TLALDTDGDA GITFVNSDTP LRVAGQKTIA YELLEQFRLE SPNAPSSPGV
PSSSTASTAS TAPDAIVLPV SSGGQASGVW KALRELTRSG VLAADDVPRL YFVQAAPCDP
IATAFREGRE EVMPIDADET IAVSIANSDP PSGTRALTAA RDTDGAVVAV PDEATREAMD
RLATDAGLAV EPSSAVALAG VRELSDRGEI AADELVVTIL TGSGYKESVE TEPRSRHIDL
ADLEHELASV VES