Gene Nmag_1115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1115 
Symbol 
ID8823946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1135341 
End bp1137041 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content67% 
IMG OID 
Productthiamine pyrophosphate protein TPP binding domain protein 
Protein accessionYP_003479261 
Protein GI289580795 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGCGA CGGGTGCTGA TCTCTTCATC GACAGTCTCG AATCGTACGG TGTCACGAAA 
TTGTTCGGTA ATCCCGGCAC CACCGAACTC CCGCTCATGC AATCGCTCGT CGAGAGCGAA
CTCGAGTACG TGCTCGGCTT GCACGAGGAC GTAGCGGTGG GAGCGGCGGC GGGGTATGCA
ATGCGGCGGC GGCATCACGC GACCGAAGCG GCAACTGGTA CTGGTGGTCG CGATGCCGAC
GACGTCCTCC CGCTCGGCGT CGCGAACCTC CACCTCGCCG GTGGACTGGC ACACGGGCTG
GGGAACCTCT ACAACGCAGA CGTTTCGGGC GCGCCGCTGC TGGTCACGTC GGGAACCCAC
AGCCGGGACT ACCAGCAGGA GGAACCGATT CTCAGCGGCG ACCTCGTCGA GATGGCCGAG
CCGTTCACGA AGTGGAGCGC CGAGGTAAAA CACGTCGATG CGCTGCCGAC GATGGTTCGT
CGGGCCGTCC GGACCGCGCT GACGCCGCCG ACGGGACCGG TCTTCCTCTC GATTCCGGTC
GACGTCCAGA CGGAGGAGAC CACGGCTGAA CCGGAGCCAC TCGGTCGGAT TCCGACGGCG
GGCCGGGGCG ACGAGGCGGC GATTCAGGAG GCTGCCACGA TGCTCGCCGA CGCCGACGAA
CCCGTCTTCG TCCTCGGCGA CGAGGTTGGG CGCAGCGGTC CGGCAGCGGT TGAGGCTGCG
GTCGACCTCG CTGAAGCGAC GGGCGCACGC GTCCACAACG AAATCCTCGC CTACGAGGCG
AACTTCCCGA CGGACCACGG CCAGTGGCAG GGCGCGCTCT CGACGAAAGC GCCCGGCTCG
GCCGCCGCGA TGGACACGGA CACGCTCGTC TTCGTCGGCT GCTCGACGAA CACGACGGTA
ACGCGGCCGA CGACTCAACT CGTCCCCGAT GAGGCGACAC GAGTTCACAT CTCGCCCGAC
GCGTGGGAAC TCGGCAAGCA CGCTCCCGCA GAGACTGCGG TGCTTGGCGA CCCCGCGACC
GTCCTTGCGG ATCTCGCCGA TCGTGTCAGC AACGCTGTCG ACGACGGCGA ACGCGAACGG
CGACTCGAGT CCGTCCGCGA CTGGGCGAAC GCACACGACA CCGATCCTGC CCCCGAAACG
ATCGACGGGA CGCTGACGAA AGCGGGGCTC GCTCGGGCGT TCGACAGCGT TGCGCCGGAT
GCGCTCGTGG TCAGCGAGGC AATCACCGCG TCGCCGCCGC TGTTCGACGA GTTCGAGTTC
GAGGCCAACC AGCTCCTGGG AACGAAAGGT GGCGGCCTCG GCTACGGACT GCCGGCCAGC
GTGGGCGCTG CGGTCGCAGA GCAGGAAGCC GGCGGCGACC GCTCGGTGCT TGGCTACGTC
GGCGACGGCT CGTACCTCTA CTACCCGCAG ACGCTGTACA CGGCGGTTCG AAACGACCTC
GATCTCACCG TCGTCGTCCC CGACAACCGC AACTACCGCA TCCTGAAGGA GAACACGGCG
AACCTGCTCG GCGGCGACCC CGACGAGTAC GAGTACGACG GCTTCGACTT CGAGCCGCCC
GTCGACATTG CAGCGAGTGC CGCCGCCCAC GGTGCGACGG GAGTGACTGT CGAGGAGCCA
GGGGAACTCG AGTCGGTACT CGAGGACGCA CTGACGACGG CGGGGCCTGC GGTTGTCGAC
GTACCGGTGA CTGACGAATA A
 
Protein sequence
MVATGADLFI DSLESYGVTK LFGNPGTTEL PLMQSLVESE LEYVLGLHED VAVGAAAGYA 
MRRRHHATEA ATGTGGRDAD DVLPLGVANL HLAGGLAHGL GNLYNADVSG APLLVTSGTH
SRDYQQEEPI LSGDLVEMAE PFTKWSAEVK HVDALPTMVR RAVRTALTPP TGPVFLSIPV
DVQTEETTAE PEPLGRIPTA GRGDEAAIQE AATMLADADE PVFVLGDEVG RSGPAAVEAA
VDLAEATGAR VHNEILAYEA NFPTDHGQWQ GALSTKAPGS AAAMDTDTLV FVGCSTNTTV
TRPTTQLVPD EATRVHISPD AWELGKHAPA ETAVLGDPAT VLADLADRVS NAVDDGERER
RLESVRDWAN AHDTDPAPET IDGTLTKAGL ARAFDSVAPD ALVVSEAITA SPPLFDEFEF
EANQLLGTKG GGLGYGLPAS VGAAVAEQEA GGDRSVLGYV GDGSYLYYPQ TLYTAVRNDL
DLTVVVPDNR NYRILKENTA NLLGGDPDEY EYDGFDFEPP VDIAASAAAH GATGVTVEEP
GELESVLEDA LTTAGPAVVD VPVTDE