Gene Nmag_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3337 
Symbol 
ID8826202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp3468641 
End bp3469795 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content66% 
IMG OID 
ProductMaoC domain protein dehydratase 
Protein accessionYP_003481449 
Protein GI289582983 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.191953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAAG AGCCACAACG GGATTCCGAC ACCGACACCG ACACCGGGAG CGGAGAGCAA 
CCCGAACCGC GACCGATCGA CTGGACCGAT CCCGACACGT TTGCACAGGC ACTCGAACAG
GTCGAGACGA AGGAGAAGGG CAACTACTTC GAGGACTTCT CGGAAGGCGA CCTCCTCGAA
CACGACCCCG GGCTCACGCT CACCCGCTGG GGGAACGAGT CATGGATGAG CCAGACGCTC
AACCACGACC CGGCCTACTG GCGCGCCGAC GCCGCCGCGG AGCGCGGCTT CGACGAACCG
CCGATCCATC CGGACTATCT CACCGCTGCC ACGCTCGGCA TCACTGTCGA GGACCTGAGC
GAGAAAGGAG GCTACTTCCT CGGCCGGACA GACGTTCGGT TCCCTGGCAC GCCGGTCTAC
GCCGGTACCG AACTGCACGT CGAAAGCGAG GTCGTCTCGA CGGCGACCTC GAGTTCCCGT
CCCGAGTTCG GCATCGTGAC GTGGCGAACG CGCGGCACCG ACGCCGAGAC TGGTGACGTG
CTCTGCTCGT ACGAGCGGAC GAACATGATT CCGCGGCGAG AGCCGGTTGC GACGGACGGC
GGCGGGAGTG CTGCAACGGC CGACGCCGAC GCAAACGGCG ACAACACCCC TGCGCTCCCC
GAAACGTTCG TCACCCCCGA CGGCGGCTAC TTCGAGGATT TCGTGGCTGC ACTCGAGACG
GCCGAGGGAG ACGACGAGAA CGCCGCAGTT GCCTATCGCC ACGAGCGCGG CCGTACGCAG
GACGACGTAA CCGTCGCCTC GCTCCCGCTC GCGACGCTGA ACACGGCCAA ACAGCACCAC
AACATCGACG TGATGGCCGA CTCGCCGTCG GGCGATATCG TCACCTACGG CGACGTGACC
CGATCGACCG CGCTTGGCCA CGCGCGCTCG GACGAACAGA CCTGGCGCGA GGTCGGCTTC
GACGACGAGC AGTTCCACAC GTTCGTCGCG GCCGGCGACA CCGTCTACGC GTTCACGCGC
GTCCTCGACG CCGAAGACGA TGCGTCCACC GACGCAGCGG GAACGGTCCG GTTCGAACAC
ATCGCGTTCA ACCAGGACGA CGAACCCGTC TACTCGGGAA CCAGAACAGC GGAAATCCAG
AAGCGCACAG CCTAA
 
Protein sequence
MTKEPQRDSD TDTDTGSGEQ PEPRPIDWTD PDTFAQALEQ VETKEKGNYF EDFSEGDLLE 
HDPGLTLTRW GNESWMSQTL NHDPAYWRAD AAAERGFDEP PIHPDYLTAA TLGITVEDLS
EKGGYFLGRT DVRFPGTPVY AGTELHVESE VVSTATSSSR PEFGIVTWRT RGTDAETGDV
LCSYERTNMI PRREPVATDG GGSAATADAD ANGDNTPALP ETFVTPDGGY FEDFVAALET
AEGDDENAAV AYRHERGRTQ DDVTVASLPL ATLNTAKQHH NIDVMADSPS GDIVTYGDVT
RSTALGHARS DEQTWREVGF DDEQFHTFVA AGDTVYAFTR VLDAEDDAST DAAGTVRFEH
IAFNQDDEPV YSGTRTAEIQ KRTA