Gene Nmag_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1001 
Symbol 
ID8823832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1021117 
End bp1022700 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content63% 
IMG OID 
Productphytoene desaturase 
Protein accessionYP_003479147 
Protein GI289580681 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.917372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACCAC TATCCGGGAA CTCCGTCGTC ATCGTCGGGG GCGGTGTCGG CGGCCTCTCG 
ACCGCCTGTT ATCTCGCCGA CGCCGGCGCT GACGTCCGCG TCGTCGAGCA AAACGACCAA
CTCGGCGGCC GCGCGAGCCG TCTCGAACGC GACGGGTTCC GGTTCGACAT GGGACCATCC
TGGTATCTGA TGCCCGATGT CTTCGAGCGG TTTTTCGCTA ACTTCGACCG AACGCCAACG
GACTACTACG ACCTGACACA CCTCGACCCC CACTACCGAA TTTTCTTCAA AGACAGCACG
AACCCAGATC TCGAATCAGC AGGGACTCGA GTACGCACGG ATGGCGACAC CCCAGCGACA
ACCACGATGA CGGCCTCGTC TCAGATACCG GGCGACCGGA TCGACATCAC ACCGGATCTG
GAGCGAACGA AGGAACTGTT CGAGATGTAC GAGGAGGGCG CTGGTGAGGC ACTCGAGCGC
TATCTCGAAA AATCTCGGGA GAACTACGAG GTCGGCATGG AGCACTTCGT CTACAAGGAT
CGGCCACGGC TGCGAGATTA CATCGACCCG GCCGTTGCGC GACAGGCGCG CGGGCTCTCG
TTGCTCGGAT CGATGCAGGG CCACGTCGAG AACTACTTCG ACCATCCGCA GTTGCAACAG
ATCATGCAGT ACACGCTGGT CTTTTTGGGC GGGTCGCCGT CGAACACGCC GGCGCTGTAC
AACCTGATGA GCCACGTCGA TTTCAACCTC GGCGTCTGGT ATCCCGACGG CGGTATCGGG
GCCGTTATCG ACGCCTTCGT CGACCTCGGA CGAGAACTCG GCGTCGAATA CGACACCGGG
CGACCGGTGA CGAAAATCAA GGGCCGGACG GGCGCGTTCC TCGTCGAAAC GACCGACGGT
GCGCTCCGAC CGGATCTGGT GGTGAGCAAC ACTGATTACG CACACGCCGA ACAGGACCTG
CTCGCGCCCG AACGACGCGG CTACGACGCC GACTACTGGG AGTCCCGAAC CTACGCCCCC
TCTGCGTTCT TGCTCTATCT CGGCGTCGAG GGCGACGTCG ACGACCTCGC CCACCACACG
CTCGTTCTCC CGACCGACTG GGAGGAGCAC TTCGATCAGA TCTTCGAGGA TCCGCAGTGG
CCCGAGGACC CGGCGTACTA CCTCTGTGTG CCCTCCGAAA CCGATGATAC TGTCGCACCC
GACGGCCACA GCGCGCTGTT CGTCCTCGTT CCGATCGCAC CGGGACTCGA GGACACCCCC
GAAATGCGCG ACCAGTACCG AGAACTCATC CTCGACGATA TCGAGACGTA TACGGGAACC
AGCCTCCGGG ATCGGATCGT CTTCGAGGAG GACTTCTGTG TCTCCGACTT CGCCAGTCGG
TACAACAGCT ACGACGGGAC CGCACTCGGG CTCGCCCACA CGCTCAGACA GACGGCCCTG
TTCCGGCCAC CACACCGCTC GAAGCAGGTT GACGGACTCT ATTTCACCGG GGCCGACACG
ACGCCGGGGA TCGGCGTCCC GATGTGTCTG ATCAGCGGCG AGCTGACGGC TGAGGCCGTG
CTGGGGGACC ATGGCGGTGC CTGA
 
Protein sequence
MQPLSGNSVV IVGGGVGGLS TACYLADAGA DVRVVEQNDQ LGGRASRLER DGFRFDMGPS 
WYLMPDVFER FFANFDRTPT DYYDLTHLDP HYRIFFKDST NPDLESAGTR VRTDGDTPAT
TTMTASSQIP GDRIDITPDL ERTKELFEMY EEGAGEALER YLEKSRENYE VGMEHFVYKD
RPRLRDYIDP AVARQARGLS LLGSMQGHVE NYFDHPQLQQ IMQYTLVFLG GSPSNTPALY
NLMSHVDFNL GVWYPDGGIG AVIDAFVDLG RELGVEYDTG RPVTKIKGRT GAFLVETTDG
ALRPDLVVSN TDYAHAEQDL LAPERRGYDA DYWESRTYAP SAFLLYLGVE GDVDDLAHHT
LVLPTDWEEH FDQIFEDPQW PEDPAYYLCV PSETDDTVAP DGHSALFVLV PIAPGLEDTP
EMRDQYRELI LDDIETYTGT SLRDRIVFEE DFCVSDFASR YNSYDGTALG LAHTLRQTAL
FRPPHRSKQV DGLYFTGADT TPGIGVPMCL ISGELTAEAV LGDHGGA