Gene Nmag_3903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3903 
Symbol 
ID8826773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp300155 
End bp301864 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content66% 
IMG OID 
Productthiamine pyrophosphate protein central region 
Protein accessionYP_003482006 
Protein GI289583596 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.567329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCCA CAGCAGCCAC ACTCGTCGAG ACACTCGAGG ACCTCGGCGT CGAGTACGTC 
TTCGGCTACC CGGGCGGCCG CGTGATCGAA CTGTTCGAGG CGGTTCCCGA CGCCGATATC
GACCTCGTCC GGCCGCGAGA CGAGCGCGAG GCGAGCGTGA TGGCCGAAAT GTACGGCCGG
CTAACCGGAG ATCCGGGCGT CCTCACCGGG CAGGGGCCGT GGATCGGCAG TATCGGCATG
ATCGGCCAGA TGGAGGCCCG ACTTGCCTCT TCGCCGATGG TTGTTCTCAC CGAAGCCTCC
GAGCGCGGCG AGTACTCGAC GCTCGCGCCG TACCAGCAGG CTCGCGGCGA TTACGGTGGC
TTCAGCCTCC CGGATATCCT CGACGGTGTG AGCAAGGAGT GGTGGTTCCC GCGGACGCCG
GTCGAGACGA TTCGCTCGAC GCAACTGGCG TTCAAACACG CGGTCGCCGG TCGCTCCGGC
CCGACAGCAG TTATCCTCGA CGGGAACGCG ATCACTGCTG AGGTTCCCGA GGACCCAACA
CCCAGAGCCT GGGATGCAGC AGCACAGACG CGGACGTGGG ACGCCGCGCC GACCGCCACC
GACACCGCGG CAGCGGTGGG CGTACTCGAG TCCGCCGAGC GACCAGTGAT CGTTGCGGGC
AACGGCGTCC ACGCCGCACA GGCCTACGAC GAACTCGCGG CGGTTGCCGA GACGTACGAC
TGTGCGGTCG TCACGTCCTA CCTCGGCAAG TCGACCTACC CTGAAACTGA CGAGCGGGCA
GCGGGCGTTA TCGGCTCCTT CGGCCACGAG GGGGCAAACC GCGTCGTCAG CGAGGCCGAC
ACGCTGCTGG TCGTTGGGTG CCGGCTGAAC CCAATGGACA CCAACTGGCA GGCGCCCGAG
TTCATCCGCC CGGACGAGCA GACGATTATC CACGCCGATA TCGACACGCG AAACGCTGGC
TGGGTCTATC CCGCGGACGT CGGCCTGATC GGTGACGCCG CCGAGACGCT CGCGGTGCTC
GCCGAGGCAG GTTCGGGAGG CTCGTCGAAC GGGTGGGCAC TCGAGCGCGC CGCCGAGGCT
CGTGAGTGGT TCGACGCACC CGAGTGTACG GACGATTCGG CACCGATCAA GCCCCAGCGC
GCTGCGACGG CCATCCAGTC AGTCGTCGAC GAGGACACCA TCGTCACCGC CGACTCGGGG
AACAACCGCT TCTGGCTGCT GTACTACCTC CAGACGCCCG CCGTCAGAAC CTACTTCGGC
AGTGGCGGCG TCGGCGGTAT GGGGTGGGCC AACCCCGCTG CGGTGTCTGC GGCGCTCACA
ACCGACGACG AAACAGACGT CATCGCCGTC GCCGGCGACG GCGGCTTCTC GATGACGATG
AACAGCGTCG AAACTGCCGT CGAGTACGGC GTCGCGCCCA CGTTCGTCAT TCTGAACGAC
ACCAGCCTCG GGATGGTCCG CCAGATGCAA CACGAGGATG GCGACATCGC CGGCGTGGAG
TTCCACGACA CCGACTTCGT CGGCATCGCC GAGGCCTTCG GCGCGGTCGG CAAGCGGGTG
ACTGAGCCCA GTGAGTTGGC TGGGGTACTC GAGTCCGCCA AGTCGGCGGA CGTGCCACAC
GTGATCGACG TTCGGATTGA TCGCGAGGAG GATATGGCGG AAACGCTATC GTCGTCGTTC
TACGAGTCAG TTGGCGGGTT ACACGAGTGA
 
Protein sequence
MTSTAATLVE TLEDLGVEYV FGYPGGRVIE LFEAVPDADI DLVRPRDERE ASVMAEMYGR 
LTGDPGVLTG QGPWIGSIGM IGQMEARLAS SPMVVLTEAS ERGEYSTLAP YQQARGDYGG
FSLPDILDGV SKEWWFPRTP VETIRSTQLA FKHAVAGRSG PTAVILDGNA ITAEVPEDPT
PRAWDAAAQT RTWDAAPTAT DTAAAVGVLE SAERPVIVAG NGVHAAQAYD ELAAVAETYD
CAVVTSYLGK STYPETDERA AGVIGSFGHE GANRVVSEAD TLLVVGCRLN PMDTNWQAPE
FIRPDEQTII HADIDTRNAG WVYPADVGLI GDAAETLAVL AEAGSGGSSN GWALERAAEA
REWFDAPECT DDSAPIKPQR AATAIQSVVD EDTIVTADSG NNRFWLLYYL QTPAVRTYFG
SGGVGGMGWA NPAAVSAALT TDDETDVIAV AGDGGFSMTM NSVETAVEYG VAPTFVILND
TSLGMVRQMQ HEDGDIAGVE FHDTDFVGIA EAFGAVGKRV TEPSELAGVL ESAKSADVPH
VIDVRIDREE DMAETLSSSF YESVGGLHE