Gene Nmag_3767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3767 
Symbol 
ID8826637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp147515 
End bp148795 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content66% 
IMG OID 
Productthreonine dehydratase 
Protein accessionYP_003481871 
Protein GI289583461 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.826311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGACG ACACCAAGAA CGACGACACC GGCCTCGTTA CCCGTGCCGA CATCGAGGCA 
GCGCGCGAGC GCATCGACGA CGTTGTCCAC CGAACACCGC TCGACACGTC GCGCACGTTC
GCCGATCTGA GCGGCGCGGC GTCCGTCGGG CTCAAACTCG AGAACGTCCA GCGAACGGGT
TCGTTCAAGA TCCGTGGCGC GTACAACAAG ATGGCCCAGC TCTCGGCCGA CGAGCGGGAG
GCTGGCGTCA TCTCCTCGAG TGCGGGTAAT CACGCCCAGG GCGTCGCGCT GGCAGGGCAG
GTGCTCGACA CCGACACGAC GATCGTCGTT CCCGATGTCA CCCCTGCAGC GAAAATCGAG
GCCACCCGCG GCTACGGCGC CGAGGTCGTC GTCGAGGGCG ACATCTACGA ACGATCGTAC
GAGTTCGCAC TCGAGCGGGC CGCCGAAACC GGTGAGACGT TCGTCCACCC CTTCGACGAC
GAGGATATCA TCGCCGGCCA GGGAACGATC GGCCTCGAAC TCCGCGAGCA GTACCCCGAC
CTCGACACCG TCCTCGTGGC GATCGGCGGC GGCGGGCTGA TCTCGGGGAT CGGCACGGTG
TTGAAGGCCC ACGATCCGAC GACGCGCGTG ATCGGCGTCC AGCCCGAGGG GGCCGCCCAC
GCGAAGCCGA CACTCGAGTC CGACCCGGGA GAGATCCACG AACTGCCGGA CGTTGACACC
GTCGCGGAAG GGATCGCAGA TACCAGGCTC CTCGAGACGA CAGCGGCGAA CGTTCGTGAG
GTGGTCGACG ACGTAGTGAG TGTCAGCGAC CGCGACATTG TGACTGCCGT CACGTTACTG
GCCGAGCGCG CAAAGACCGT CGTCGAGGGG GCCGGTGCTG CCCCGCTCGC GGCTGCGCTC
TCGGATGCAG TCGACGTGGC AGATAAGCAC GTCGCCGTCG TAATTTCTGG CGGGAACGTC
AACCTCACCG ACCATGCCGA ACTGACTCGC ACCGGCCTGC ACGAATTAGG GCGCTACGCC
GAAGCGAGAC TAGCCGTCGA CGGCTGGCCG ACCGCGGTCA GCGACGTGGT CGAAACCGTC
GAAGCCGAGG GTGCAGAACT GGACGTACTC GAGCGCGCCC GCCGTGGGTC GGGGCTCGGA
ACGGACGCGG TGGACCACCC GAACCGCGTT CCCGTGACGG TCGGACTCGA GGGCAGCGGG
CCGGACCATC TCGTGGGCGT GCTTGATGCG ATTGCGGAAC TCGACAGCGT CGATGTACTT
TCTTCGTTGC CGGAAGAGTG A
 
Protein sequence
MTDDTKNDDT GLVTRADIEA ARERIDDVVH RTPLDTSRTF ADLSGAASVG LKLENVQRTG 
SFKIRGAYNK MAQLSADERE AGVISSSAGN HAQGVALAGQ VLDTDTTIVV PDVTPAAKIE
ATRGYGAEVV VEGDIYERSY EFALERAAET GETFVHPFDD EDIIAGQGTI GLELREQYPD
LDTVLVAIGG GGLISGIGTV LKAHDPTTRV IGVQPEGAAH AKPTLESDPG EIHELPDVDT
VAEGIADTRL LETTAANVRE VVDDVVSVSD RDIVTAVTLL AERAKTVVEG AGAAPLAAAL
SDAVDVADKH VAVVISGGNV NLTDHAELTR TGLHELGRYA EARLAVDGWP TAVSDVVETV
EAEGAELDVL ERARRGSGLG TDAVDHPNRV PVTVGLEGSG PDHLVGVLDA IAELDSVDVL
SSLPEE