Gene Nmag_0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_0143 
Symbol 
ID8822962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp158096 
End bp159313 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content35% 
IMG OID 
ProductO-antigen polymerase 
Protein accessionYP_003478299 
Protein GI289579833 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.697441 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCAAA ATGACTGGAA AGGATATATA GTAAATATCT TGTTTGTATT TCTATTGATT 
ATTATATTAT TACCTAGTAC GGTAGTTGTT AATACAACAA TAGGGAACGC ATCATCTATA
ATTCTAACCT TTTTGTTCCT AATAATAATA TATTCAATAT ATGGTGGAGT GGCTGTTGAG
TATGTTAAGT CCATCGGAAT TGCTTTTTTC ATAATATTTC TTGTGTTGCT GTATCACATA
CATAATGGGC GTTACAACCT CGCGCACAAT GCATTTTATC CAATCTATGG CGTTCTTTAC
CCAGTTATAT TCATGTTTAT CTTCCCCAAT TACATAAATT ACAGAGTGAT ATCAAAGGCT
ATCGCAGTGT TTTCGTCAGT GGTCGTAATC ATTGGGTTGC CAGCACTCAT AATCGGAACA
TATGATTTAT TTTGGTTTCA AGTCCAGGCA GTTGAATATA GTTCTGTAAC CCGGTTGAGA
TCAATCTTCG AAGGTAGTCA AAATTCACTA GGCAGGTTTT TAATGATTGG TTCAATATTC
GCCTTCTCAG AGTATCATAA CACAAGCAAT ACACTTTGGG GGGGAGTAGT AGCTATAAAC
ATATTTGGGT TATATTTGAC TGGTAGCAGA GGAAGTCTTG CTGCCTTTTC TATAGGTTTC
TCTATTTACT TAATATACTA TTTTTACAAT AAGGAGATTT ATAAGAAAGT GTATGCAATA
GTTCTCTTTG GATACTCGTC AGCACTTCTT TTCTTTTTTG GACTAATACC TTGGCCAAAT
ACAATCAAAA GCATTGATTT CAGTCATAGA TTTGAGATAT GGGATGCTAC CCTGAACGCC
TCTTCAAATA ATATCATTTT AGGAAACGGA TTAGTACCCC GGAGCGAATT GATAGCCCCA
TATCTTTATA CACCAGAAAT AATGGGGGTG AATCCACATA ATGGGTATTT ATCAATTTTA
CTGTATTCTG GTATAATTGG ACTCATTTCA TACCTCTCCA TTATATACCA GGTTTTACTT
CTGTCCATTG CTAGAGATGA GTCAAACGTC CTAATGATGT CTGTTTCCAT CTCAATCCTA
ACCGAATCTT TTGTAGAAGA TGTTATGATC ATTGGAACCG GCTTCAGTAC AATAATCTTA
TCTATGTGTT TTGGGTACTT GATTAAGGAG AGTGAAATTA GTAATAGAAT AATCATCAAG
CAGAAAACTA CTGACTAA
 
Protein sequence
MDQNDWKGYI VNILFVFLLI IILLPSTVVV NTTIGNASSI ILTFLFLIII YSIYGGVAVE 
YVKSIGIAFF IIFLVLLYHI HNGRYNLAHN AFYPIYGVLY PVIFMFIFPN YINYRVISKA
IAVFSSVVVI IGLPALIIGT YDLFWFQVQA VEYSSVTRLR SIFEGSQNSL GRFLMIGSIF
AFSEYHNTSN TLWGGVVAIN IFGLYLTGSR GSLAAFSIGF SIYLIYYFYN KEIYKKVYAI
VLFGYSSALL FFFGLIPWPN TIKSIDFSHR FEIWDATLNA SSNNIILGNG LVPRSELIAP
YLYTPEIMGV NPHNGYLSIL LYSGIIGLIS YLSIIYQVLL LSIARDESNV LMMSVSISIL
TESFVEDVMI IGTGFSTIIL SMCFGYLIKE SEISNRIIIK QKTTD