Gene Nmag_3841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3841 
Symbol 
ID8826711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp230592 
End bp232103 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content66% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003481944 
Protein GI289583534 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.260308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGT TCGACCCGTT CGGCCGTCAC ATCTTCGCCG ACGGCACCTT CCGCGAGAGC 
GAGTCACTCG AGTCCATGGA CGTAATCGAC CCTGCGACCG AGGAACCCGT CGGCTCCGTT
GCGGTCTGTG ATCCCGACGA GGTCGAGGCC GTCATCGAGG GAGCCGTCGA GGCACAGTCA
GCGTGGGGCG ACGAACCCGC AGGGACACGT GCAGCAGCGC TCCACGAGGT TGCAGATTCG
ATCGAGGCGG ACGATTTCGA GCGCATCGCG ACGCTGATGA CGAGAGAACA CGGCAAGCCC
TTCCCCGAAT CGGAGGGCGA ACTCGCAAAC GTCGCGGGCA TCTTCCGCTA CTACGCCGAA
CTGGCGCGCG ACGACCAGGG GAACGTCCCC GGCTCGACGC AGGCGGAGTC GTTCCAGTTC
GACCGGGCGT TTCCCTACGG CGTCACCGTT CACATCGTCC CCTCGAACTT CCCCGTCCTG
CTAACGGCCT GGACAGTCGC TGCCTCGCTC GCGGCCGGCA ACGCTGTGAT CGTCAAGCCG
TCCGAGCAGA CGCCGCTCTC GACGCTCCAG TTCATAGAGC ACTTCAAAGG GCTTCCCGAC
GGCCTCGTCT CGTGTCTCAC CGGCCGCGGC GAAACCGCAC AGGCGATGAT CCAGTCGGAC
GGGACGGACG CCGTCGCGTT CACCGGCGGC GTCGAAACCG GACAGCAGGT GAGCACAGCG
GCCGGCAAGC AGCTGATGCC CGCCGTCATC GAAGCCGGCG GCAACGACCC GCTCATCGTC
ACCGAGCACG CGCCCATGGA GGTCGCAATC GCCGGCTCGA CCACCGCAGC GTTCCATCTC
TCCGGACAGG TCTGTACCGC CGCCGAGCGG TTCTACGTCC ACGACGCCGT CCACGACGAG
TTCGTCGACG GCCTCGTCGA GATGACCGAG GCACTCCGCG TCGGCAACGG CTTCGAATCC
AGCGAGATCG GCCCGCTCGT CAGCGAGGCC GCCCGCGACA ACGTCGAGCG ACTGGTGGAG
GATGCCCTCG AGAAGGGCGC GACACTCGAG TGCGGCGGGC AGGTGCCACC GGAGCAGGAA
ACGGGCTGGT TCTACGAGCC GACAGTGTTG ACAGACGTAA CGCCGGAGAT GGCCATCGTC
CGCGAGGAGG TGTTCGGCCC GGTTGCGCCG ATCTGTCGCG TCGAGAGCTT CGAGGAGGCG
CTCACGGAGG CGAACAACTC CGAGTTCGGA CTGGGCGCGT CGGTCTTCAC GACGGATCTC
GAGGAGGCGA TGCGAGCCTA CGAGACGCTG GAGGCGGGCA TGGTCTGGAT CAACAATCCG
ATGATCGACA ACGACGCGAT TCCGTTCGGC GGCTGGAAAC ACTCCGGCAT TGGCCGCGAA
CTCGGCCGGC AGGGGCTGGA TGCGTTCCGC CAGACGAAGA TGGGGGTCAT CGACTGGAAC
CCGCAGGTTC ACGACTGGTG GTATCCCTAC CCCGAGGAGT GGTTCTACGA CACCGAGGAG
AAGCGGTTCT GA
 
Protein sequence
MTEFDPFGRH IFADGTFRES ESLESMDVID PATEEPVGSV AVCDPDEVEA VIEGAVEAQS 
AWGDEPAGTR AAALHEVADS IEADDFERIA TLMTREHGKP FPESEGELAN VAGIFRYYAE
LARDDQGNVP GSTQAESFQF DRAFPYGVTV HIVPSNFPVL LTAWTVAASL AAGNAVIVKP
SEQTPLSTLQ FIEHFKGLPD GLVSCLTGRG ETAQAMIQSD GTDAVAFTGG VETGQQVSTA
AGKQLMPAVI EAGGNDPLIV TEHAPMEVAI AGSTTAAFHL SGQVCTAAER FYVHDAVHDE
FVDGLVEMTE ALRVGNGFES SEIGPLVSEA ARDNVERLVE DALEKGATLE CGGQVPPEQE
TGWFYEPTVL TDVTPEMAIV REEVFGPVAP ICRVESFEEA LTEANNSEFG LGASVFTTDL
EEAMRAYETL EAGMVWINNP MIDNDAIPFG GWKHSGIGRE LGRQGLDAFR QTKMGVIDWN
PQVHDWWYPY PEEWFYDTEE KRF