Gene Nmag_0922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_0922 
Symbol 
ID8823752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp942151 
End bp943590 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content58% 
IMG OID 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_003479068 
Protein GI289580602 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAATTG GCCAAACCTC CTTCATCGTC TTCGTCTCTA AATTTGCAAA AGCAGCCCTT 
GGCTTCGTCG CGACGATCTA CTTCGCTCGA GTACTCGGCG CTGAGATCCT TGGCTATTAT
GCGTTAATTC TCGCACTCGT CGCATGGCTC GAACTCGGCG GAAAGATCGG CATCTCGTCG
GCGATAACGA AGCGCCTGAG TGAGGGCGAA GAGCAGTCTG CGTACTTTAC CGCAGGTGCG
ATCGCTATCG GAATCCTTGC GGCTGTGCTT TCGGTCGGCG TCATCGTCTT TCGAGATGCG
GTCAACGACT ACGTCGGAGT GGAAGCCGCA GTGTTCGTCG TCTTCTTGCT CGTCCTCAAA
CTCGTTCACT CGCTTCTGAC TGCCGTTCTA CAGGGTGAGC ACCTGGTTCA CATATACGGA
CTCCTCGATC CGTTGAAGAC AGGCTCGCGA GCCGTCATCC AGATTGGCCT CGTGTTCGCC
GGCTTCGGGC TTACGGGGAT GATCGTCGGC AAGGGAGTCG GCATCTTGAT CGCGTCACTG
GTTGCGCTGG TATTTGTCTC GGTTACGCTC GCACGGCCCT CGACTGAGCA CTTCCGTAGT
CTCTTCGACT ACGCGAAGTA CTCGTGGCTC GGGAATCTCG AGTCGAGATC GTTCAACGAC
GTGGACATCG TTATTCTCGG TGCGTTAGTC TCCCCCGCAC TCGTCGGGAT CTATTCGGTC
GCCTGGAGCA TCGCGAAGTT TCTCACTGTA TTTGGCACAG CAGTGAAAGC GACGCTGTTT
CCGGAACTAA GCGTTGCGGA CGCTGAAGGT GACAGTGAAA CGGTTTCTGC ACTCGTCAGC
GATGCACTCA CGTACGGCGG GTTAGTCATC ATTCCCGGAT TGTTCGGTGC GATACTGCTC
GGCGATCGAT TGCTTCTGCT GTACGGGTCG GAGTTTGTCC AGGGAACTGC CGTCCTCGGC
GTGTTAATCG TCGCGACACT GGCTCGTGGC TACCAGAAGC AACTCGTCAA CGTGCTCAAC
GGAATCGACA GGCCAGATGT CGCATTCAGG GTCAACGCGG TCGCCATCGT TGCGAACGTC
GTGCTCAACG TCGTGCTGAT CGTCTGGCTC GGCTGGCTCG GTGCAGCGAT CGCGACCGCA
CTGTCCGCAA CGATCGGGTT GTCACTCTCG CTTCGCGAAC TCCACCGGCT CGTTGCGTTC
GACATTCCGT ACGGTGAGCT TGCCCGGCAG CTCAGTGCAG CCGTCGTAAT GGCAGCAATC
GTGTTCGGAG GCCAGAACGC CATCGAAGCG ACCGGAATCC TCGAGCAGAA CGTGGTGATA
CTCGTCCTCC TCGTTGCGGT CGGAGCTGGC ACGTACTTTA CGACGCTGTT CACGATTTCG
CGTCGGTTCC GATCCACGGT CGTTGCAAAT TCGCCGGTTC GGATTCCGCT CGTGTCCTGA
 
Protein sequence
MRIGQTSFIV FVSKFAKAAL GFVATIYFAR VLGAEILGYY ALILALVAWL ELGGKIGISS 
AITKRLSEGE EQSAYFTAGA IAIGILAAVL SVGVIVFRDA VNDYVGVEAA VFVVFLLVLK
LVHSLLTAVL QGEHLVHIYG LLDPLKTGSR AVIQIGLVFA GFGLTGMIVG KGVGILIASL
VALVFVSVTL ARPSTEHFRS LFDYAKYSWL GNLESRSFND VDIVILGALV SPALVGIYSV
AWSIAKFLTV FGTAVKATLF PELSVADAEG DSETVSALVS DALTYGGLVI IPGLFGAILL
GDRLLLLYGS EFVQGTAVLG VLIVATLARG YQKQLVNVLN GIDRPDVAFR VNAVAIVANV
VLNVVLIVWL GWLGAAIATA LSATIGLSLS LRELHRLVAF DIPYGELARQ LSAAVVMAAI
VFGGQNAIEA TGILEQNVVI LVLLVAVGAG TYFTTLFTIS RRFRSTVVAN SPVRIPLVS