Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_4021 |
Symbol | |
ID | 8828755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013924 |
Strand | + |
Start bp | 62065 |
End bp | 63510 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | protein of unknown function DUF35 |
Protein accession | YP_003482113 |
Protein GI | 289937511 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.605764 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGCTA TTACTGCTGT CGGCGCGTAC GCACCGCGAT TCCGTATTAC GGCCGAAGAA TTCGCAGACG CCTGGGGTCA CTTCCAGGCC TCCGGTATCT CCGAGAAGGC CGTTCCCGCG GCCGACGAGG ACGCCCTGAC GATGGGCTAC GAGGCTGCAA CGCGCGCACT CGAGTCAGCC GACCTTACTG GCGAGGCTAT CGACTGGCTT GGCTTCGCAT CCTCTCGCCC ACCGCTCGCA GAGGAGGACT TGACGGCCCG TCTGGGCGCG ATGCTCGCCG TCAGCGAGGC GGCCACTCGT CACGTCTTCA CCGGCAGCAC GCGCGCCGGC ACCCGCGCGC TCTGGGCCGG CATAGACGCC GTCGCTGCTG ACGAGACCAC GACGGGGCTC GTCGTCGCCG CCGACGCACC GGTCGGTGAA CCCGACAGCG AACTCGACCA CGCCGCCGGC GCAGGCAGCG CTGCGTTCGT CCTCGAATCG AGCGGCCCCG CCGAGATCGT CGACCGCGCC GAGTACTCCC GCCCCTACCC CGGCACCCGC TTCCGGAACA CCGGCGAGGA GGAGACTCAG GGCCTCGGCG TCACCCAGTA CGACCGCCAG GCGTTCACCG AAACCATCGC TGGCGCTGTG GCTGCACTCG AGTCCAATTC CAACGACGAC CTCGAGCCAG CGGCCGCCGC CATCCAGGCA CCGAACGGGA AACTCCCCTA CCGCGCCGCC GGCGCGGCCG GCGTGGGCAC CGACGAGATC CAGGCCGCTG CAACGGTTCA CGACCTTGGC GACCTCGGTG CCGCGAGCGT CCCCGTCTCG CTCGCCAGCG CGCTCGCAGA GGGTCACGAG TCGATTCTGG GCGTCTCATT CGGTAGTGGA GCCGGTGCTG GCGCTTTCTT GCTTACCGTT GATGGCGAGG TTCCGACTGA AACGGCCCTC GAAGGCGGCG ATTCGCTCTC ATATGCTGAA TACCTTCGTC AGCGGGGCGT CGTCACCTCG GACCCTCCAG CTGGTGGTGG GGCATACGTT AGCGTCCCGT CGTGGCGACG GTCGATCCCA CAGCGATACC GACTTGAGGC GGGCCGTTGT CCCGAGTGTG GAGCAGTGAC CTTCCCACCA GAGGGTGCCT GTGCCAGCTG TGGCTCGCTT GATGAGTATG ACTTCACCGA GCTTTCTGGC GATGGGGTTG TCGAGGCTGT AACGACAATC TCACAGGGTG GTGCGCCCCC GGAGTTCGCG ACCCAGCAGT CACAATCGGG TGACTATGCG GCCGCAATCG TTGCGTTTGA CGTTGCAAAC GGTGAGGAGA CCGTCAGCGT TCCGGTAATG GGAACTGACG CTGCGCCTTC AGCGTTTGTC GTCGGAGACC GTGTCGAGAC GACGATCCGT CGGATCTACA CGCAAGAGGG TGTGACACGA TACGGGTTCA AGATTCGACC ACCACACGAC GACTAA
|
Protein sequence | MVAITAVGAY APRFRITAEE FADAWGHFQA SGISEKAVPA ADEDALTMGY EAATRALESA DLTGEAIDWL GFASSRPPLA EEDLTARLGA MLAVSEAATR HVFTGSTRAG TRALWAGIDA VAADETTTGL VVAADAPVGE PDSELDHAAG AGSAAFVLES SGPAEIVDRA EYSRPYPGTR FRNTGEEETQ GLGVTQYDRQ AFTETIAGAV AALESNSNDD LEPAAAAIQA PNGKLPYRAA GAAGVGTDEI QAAATVHDLG DLGAASVPVS LASALAEGHE SILGVSFGSG AGAGAFLLTV DGEVPTETAL EGGDSLSYAE YLRQRGVVTS DPPAGGGAYV SVPSWRRSIP QRYRLEAGRC PECGAVTFPP EGACASCGSL DEYDFTELSG DGVVEAVTTI SQGGAPPEFA TQQSQSGDYA AAIVAFDVAN GEETVSVPVM GTDAAPSAFV VGDRVETTIR RIYTQEGVTR YGFKIRPPHD D
|
| |