Gene Nmag_1469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1469 
Symbol 
ID8824302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1498557 
End bp1500221 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content65% 
IMG OID 
Productthermosome 
Protein accessionYP_003479609 
Protein GI289581143 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGC GAATGCAGCA GGGACAGCCG ATGATCGTGA TGAGCGAGGA CTCCCAGCGC 
GTCAAGGACA AGGACGCGCA GGACTACAAC ATCAGCGCCG CCCGTGCGGT CGCTGAGTCC
GTCAAGTCCA CGCTCGGCCC GAAAGGGATG GACAAAATGC TCGTCGACTC GATGGGATCG
GTAACGATCA CCAACGACGG CGTCACCATC CTCCAGGAGA TGGACATCGA CAACCCGACG
GCCGAAATGA TCATCGAGGT CGCCGAGACC CAGGAGGACG AGGCTGGCGA CGGCACCACG
ACCGCCGTCT CCATCGCCGG TGAACTCCTC AAGAACGCCG AGGATCTCCT CGAGCAGGAC
ATCCACCCGA CGGCGATCAT CAAGGGCTTC CACATGGCGA GCGAGCAGGC TCGCGAAGAG
ATCAACGACA TCGCCGTTGA CGTCGACACC GAGGACGAAG ACCTCCTGCG CTCGGTCGCC
GAAACCTCGA TGACTGGCAA GGGTACCGAG GTCAACAAGG AGCACCTCGC CGAGCTCATC
GTCGAGGCCG TCCGCCAGGT CACCGTCGAG GACGACGAGG GCAACAACGT TGTCGACCTC
GAGTTCCTCA ACATCGAGAC CCAGACCGGC CGCGGCGTTT CCGAATCCGA CCTCCTCGAG
GGCGGCATCA TCGACAAGGA CCCGGTCCAC GACAACATGC CGACCTCGGC CGAGGACGCC
GACATTCTGC TGCTGAACGA GCCGATCGAA GTCGAAGAGA CCGACATCGA CACCGAGGTC
TCCGTCACGG ACCCAGATCA GCTCCAGCAG TTCCTCGACC GCGAGGAAGA GCAGCTTAAG
GAGAAGGTTC AGCAGATCGC TGACCTCGAC GCTGACGTCG TCTTCTGCCA GAAGGGCATC
GACGACCTCG CACAGCACTA CCTTGCCAAG GAAGGCATCC TCGCCGTCCG CCGCGCCAAG
AAGTCCGACC TCGAGTTCCT CTCGGAGGTC GTCAACGCGG CCATCGTCTC CGACCTCGAC
AGCGTGAGCG ACGAGGAACT CGGCCACGGC GACATCATCC GCGACGAGGA GGACGAACTG
TTCTACGTCG AGGGTGAGGA CGCCCACGGC GTCACCCTCC TGCTCCGTGG CTCCACCGAC
CACGTCGTCG ACGAACTCGA GCGCGGTGTC AACGACGCAC TCGACGTCGT CGCGCAGACC
GTCTCCGACG GCCGCGCCCT CGCTGGCGGC GGTGCGATCG AGGTCGAACT CGCCTCGCGC
CTGCGTGATT ACGCCGACTC CGTCTCCGGT CGCGAGCAGC TGGCCGTCGA GGCCTTCGCC
GACTCGCTCG AGCTCGTCCC ACGCGTGCTC GCCGAGAACG CTGGACTCGA CTCCATCGAC
ACGCTCGTCG ACCTCCGCGC CGCACACGAC GACGGCGACG TCGAGGCCGG CCTGAACGTC
TTCACGGGCA ACGTTGAGGA CACCTACGAC GCCGGTGTCG TCGAGCCAGC CCACGCCAAG
GAGCAGGCCG TGACCTCTGC CGCAGAGGCC GCGAACCTCG TGCTCAAGAT CGACGACATC
ATCTCCGCCG GTGACCTCTC CACCGACAAG GGCGACGACG AAGGCGGTGC CCCAGGTGCC
GGCGGCATGG GCGGTATGGG CGGCGGCATG GGCGGCATGA TGTAA
 
Protein sequence
MSQRMQQGQP MIVMSEDSQR VKDKDAQDYN ISAARAVAES VKSTLGPKGM DKMLVDSMGS 
VTITNDGVTI LQEMDIDNPT AEMIIEVAET QEDEAGDGTT TAVSIAGELL KNAEDLLEQD
IHPTAIIKGF HMASEQAREE INDIAVDVDT EDEDLLRSVA ETSMTGKGTE VNKEHLAELI
VEAVRQVTVE DDEGNNVVDL EFLNIETQTG RGVSESDLLE GGIIDKDPVH DNMPTSAEDA
DILLLNEPIE VEETDIDTEV SVTDPDQLQQ FLDREEEQLK EKVQQIADLD ADVVFCQKGI
DDLAQHYLAK EGILAVRRAK KSDLEFLSEV VNAAIVSDLD SVSDEELGHG DIIRDEEDEL
FYVEGEDAHG VTLLLRGSTD HVVDELERGV NDALDVVAQT VSDGRALAGG GAIEVELASR
LRDYADSVSG REQLAVEAFA DSLELVPRVL AENAGLDSID TLVDLRAAHD DGDVEAGLNV
FTGNVEDTYD AGVVEPAHAK EQAVTSAAEA ANLVLKIDDI ISAGDLSTDK GDDEGGAPGA
GGMGGMGGGM GGMM