Gene Nmag_3000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3000 
Symbol 
ID8825860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp3089784 
End bp3090950 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content64% 
IMG OID 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_003481114 
Protein GI289582648 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTCGC CGGCGATTCG CGACAGGACA GTACTCGTCA CCGGCGGTGG CGGCTTCATC 
GGGAGCCACC TCGTCGAGGC GCTGGCACCG TACAACGATG TCCGCGTACT GGATAACTTC
TCGACCGGTT CGCGGGACAA TCTCTCGTCG GTGACCAGTC CGCAGTGGAC CAACGATGCG
CCGACAAGTG CGGATGGTGG GTTCGACGAC GCTGGAGGCG GCGGAGACGC TGGAGACGCC
GGAGACGCTG GAGACGCCAG GTACGACGGC TCGCCCACGA TCATCGACGG AGACATTACC
GATCCGATGG CCCTCCAGCG CGCCGCTCGC GGCGTCGACC TCATTTTCCA CCAGGCCGCG
CTCGTTAGCG TCGCCAAGAG CGTCGACGCG CCACGCCGGA GCAACGAGAC CAACCTCGAC
GCCAGCCTAC TCGTCCTCGA CCAGGCCCGC CAGGAGGACG CCCGCGTCGT CCTCGCCTCG
AGTGCGGCCG TCTACGGTCA CCCCGACGAA TTACCCGTCT CCGAGACGGC AAGGACGGAG
CCGACCTCGC CCTACGGCAT TCAGAAGCTC GCACTCGACC AGTACGCTCG CCGCTACCAC
GAACTATATG ACCTCCCAAC CGTTGCGCTA CGCTATTTTA ACGCGTACGG ACCACGCCAG
CAGGGCCCCT ACAGCGGCGT CATCTCGACG TTCCTCGAGC AGGCCCGTTC CGACGATCCG
ATCACGATCG AAGGTGACGG CGAGCAGACG CGAGACTTCG TCCACGTTTC AGATGTCGTC
CGTGCAAACA TCCGCGCTGC GACGACTGAC GCCGTCGGCG AGGCCTACAA CGTCGGTACC
GGAGACCGGA CCTCGATCCG GGACCTCGCC GAACTCGTTC GCGACGCCGT TGGTTCGTCC
TCGCCAATCG TCCACCGTGA GCCTCGTCCG GGCGATATCA GACACAGTCG TGCAGATGTT
TCGAAAGCGA GTCGCGAACT CGGCTTCGAG ACCCGCGTCG GTCTCGAGTC CGGGATTCGA
TCGCTTGTCG CTGAGACTGG GAGTGAACAG GGGCGTTCGA CTTCGCCTCC GCAGGAACAA
GGACAGGGAC AGGGACAGGG ACAGCAACAG CAACAGCAAC AGCAACAGCC ACTGACCGCC
AGACCAGAGC GACAGTCACA GGACTAG
 
Protein sequence
MTSPAIRDRT VLVTGGGGFI GSHLVEALAP YNDVRVLDNF STGSRDNLSS VTSPQWTNDA 
PTSADGGFDD AGGGGDAGDA GDAGDARYDG SPTIIDGDIT DPMALQRAAR GVDLIFHQAA
LVSVAKSVDA PRRSNETNLD ASLLVLDQAR QEDARVVLAS SAAVYGHPDE LPVSETARTE
PTSPYGIQKL ALDQYARRYH ELYDLPTVAL RYFNAYGPRQ QGPYSGVIST FLEQARSDDP
ITIEGDGEQT RDFVHVSDVV RANIRAATTD AVGEAYNVGT GDRTSIRDLA ELVRDAVGSS
SPIVHREPRP GDIRHSRADV SKASRELGFE TRVGLESGIR SLVAETGSEQ GRSTSPPQEQ
GQGQGQGQQQ QQQQQQPLTA RPERQSQD