Gene Nmag_1933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1933 
Symbol 
ID8824774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1966101 
End bp1967645 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content62% 
IMG OID 
ProductDNA-directed DNA polymerase 
Protein accessionYP_003480066 
Protein GI289581600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCACTCG AGGGGCCGGC CCGAATCGTC AGCGAACTTA CGAGCCGCGG CTACAACGCT 
GAGCGCGAGG CAGTGACCCA CATCGCTTCG GCGGACCATC CCGCGGCCGT TCTCGAGCAA
GTTGTCGAGG AGATGCCTGA CGACGCGCTG GTCGTTCGAA CGGATCACGT CGAAGCGGTG
CTAGCGGCTG ACGAAGACCC CTCCGTTTCA ACTGGAACTC GTCCCGCAGA TCAGCACGAC
TCGGTGGAGC AAGTTCCAGC TGAAACAGGG GGGTCTGGAC CGACCGCAGT CGCCGAGTCG
GAGGCAAACG AGCGAATCGA GCGCAACAAC GATCCCTCGC TTCGCTCACT CGAGATCGAC
GGCGACATGA CCGGCAACAG TACGGGAACC GGCGAGTACG AGGACTTTGT CGCCGTCTTC
CGGGACCGAC TCGAGCGCCT CGGCGGAAAG CTCCGCGGGC GAGTCAATCA CCGACCTGCA
ACGGCCATCC AGTCGATGCC GGGCGGTAGC GAGGCAGGAA TGGTCGGGCT GGTCAACGAC
ATTCGATCGA CGGCGAGCGG CCACTGGCTG ATCGAACTCG AGGACGCCAC GGGTACGTTC
CCGTGGCTGG TGATGAAAGA TCGCGAGTTC GCCGATATGG TTGACCAACT GCTCTGTGAT
GAGGTGCTGG CGATGGAGGG GACCCTCTCG GACGATGCGG GAATCGCCTT CGTCGACTCG
ATGTACTTCC CGGACGTGCC GCGTACGCAC GAGCCATCGA CGGCAGACCG CCACGTGCAG
GCAGCGCTGA TCAGCGACGT CCACGTCGGC AGCCAGGAGT TCATGGCCGA CGCCTGGGAC
GCCTTCGCGG ACTGGCTGTA CACGGAGCAG GCCCAGCACG TCGAGTACCT GCTAATCGCC
GGCGACATGG TCGAGGGCGT GGGCATCTAT CCGAACCAGG ACGAGGAGCT CGACGTAGTC
GACATCTACG AGCAGTACGA GGCCTTCAAC GAGCGCCTCA AGCAAGTTCC CGGTGATATA
GACATCGTCA TGATTCCGGG CAACCACGAC GCGGTTCGGC TCGCAGAGCC CCAGCCCGGC
TTCGACGACC GGCTTCGCGA TATCATGTCC GCCCACGATC CACAGATCGT GAGCAACCCT
TCTACCGTTA CCGTCGAGGG CGTCTCTATT CTGATGTACC ACGGTGTCTC GCTGGACGAG
GTCATCGCCG AACTCCCCGA GGAAAAGGCG AGCTACGACG AGCCACACAA GGCGATGTAC
CAGCTCCTGA AAAAGCGCCA CGTCGCACCG CAGTTCGGGG GCCACACTCG CCTTGCACCC
GAAGAGGAGG ACTTCCTCGT GATGGACGAG GTGCCGGACA TCTTCCACAC CGGCCACGTC
CACAAACTCG GCTTCGGGAA GTACCACAAT GTGCTCGCGA TCAATTCTGG CTGCTGGCAG
GCCCAGACGG ACTTTCAGAA GAGCGTCAAC ATCAACCCCG ATTCGGGTTA CGCGCCGATT
GTCGACCTCG ACACGCTGGA CGTAACGGTA CAGAAATTCA GCTGA
 
Protein sequence
MPLEGPARIV SELTSRGYNA EREAVTHIAS ADHPAAVLEQ VVEEMPDDAL VVRTDHVEAV 
LAADEDPSVS TGTRPADQHD SVEQVPAETG GSGPTAVAES EANERIERNN DPSLRSLEID
GDMTGNSTGT GEYEDFVAVF RDRLERLGGK LRGRVNHRPA TAIQSMPGGS EAGMVGLVND
IRSTASGHWL IELEDATGTF PWLVMKDREF ADMVDQLLCD EVLAMEGTLS DDAGIAFVDS
MYFPDVPRTH EPSTADRHVQ AALISDVHVG SQEFMADAWD AFADWLYTEQ AQHVEYLLIA
GDMVEGVGIY PNQDEELDVV DIYEQYEAFN ERLKQVPGDI DIVMIPGNHD AVRLAEPQPG
FDDRLRDIMS AHDPQIVSNP STVTVEGVSI LMYHGVSLDE VIAELPEEKA SYDEPHKAMY
QLLKKRHVAP QFGGHTRLAP EEEDFLVMDE VPDIFHTGHV HKLGFGKYHN VLAINSGCWQ
AQTDFQKSVN INPDSGYAPI VDLDTLDVTV QKFS