Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_4098 |
Symbol | |
ID | 8828832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013924 |
Strand | + |
Start bp | 139807 |
End bp | 143001 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003482182 |
Protein GI | 289937580 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTTTCGC GCGGTATCCG TGTCGGCCTG GGTTTCCCCG ATCAACCGGG CCGACCAAGC CAACACCGAA GGGTCAATCT GGCTCCAGTT ACCTCGAATA AATTTTCATT CACTGCAAGT AGAATACCAG TTTTACGTGA CATTCGTGGT AGTAGTTCTA TGTCACATCG TAGCGTTTTC CTGGCAGCCT GTATCGGCAT ACTGGTTGTC TGCTGTGCAA TCGCACTCGT TCCCGCGGGG GTCAGTGCAA CCGACGACGA TACACCCGAC CCTGACGAGT ACGACTCACT CGTGGACGGA ATGGACGGAA ATGGCACGGC AGATGACCCT TTCCACGTCA CGAACGTGAC GGAGTTGCAG GCGATGGAGG CGAACCTATC GGCGCACTAC AAACTAGTTT CGCCGATCGA CGCGAGCGAG ACGGCCGACT GGAACGACGG TGATGGTTTC GATCCGATCG GCGCGTGCGA CTTCAACGCA ACAATCGACG AATGTGAGGA AACCCCGTTT GAAGGGTCGT TTGACGGCGG GCTGTATCCG ATTGCCGACC TCACGATCGA CCGTGAGAAC GAGTCTGAGG TGGGGCTTTT CGGATACGTC TTTCCCGACG GTGAAGTTGT CGATATCAAA CTGCGTGACG CCCGCGTGAC CGGCAACGTC GAGGTAGGTA CCGTCACGGG ACTGAACTTC GGGTCGATCG AGGGTGCGGA CGTTACCGGA ACCACTTCAG GCGACCTCGA CGCTTTCCCG GGACGGATCG GCGGACTGGC CGGGTACAAC GGCGGCGAAG TGCGCGACTC GTTCGTCGAC AACGATGTGA TCGGACCGAG CTTCACGGGC GGTGCCGTCG GCTCGAACAA CGGGACCATT GTCCAGACCC ACGCGAGCGG CGATGTCGAG GCGTTGGACA TCTCTGGGGG CCTTGTTGGT ACGAACGTCG GGGACGGCGA GGTCGGACAC TCGACTGCCA GCGGCGACGT GAACGGCACT TTCAGTGGAT TTGGTGGATT TGTCGGTACC CACGTGGCTG GAGAGGGGAT CATCTACGAA TCGTACGCGA CCGGTGACGT CTCCGGTGAA ACTATCAGCG CGGCCGGCGG GTTCGCTGGA TCGAACTCCG CCCTCACCGG CGAGGCCGTA ATCTACGACG CGTACGCGAC TGGGAACGTG AACGGCGAGG ACCGTCTTGG TGGCTTCGTC GGCGACCTCG GATCCAGTGC GTTCGTGGAG ATGTCATACG CAACCGGATC CGTTTCCGGT GCGCCAGACG AAGCGAGCGT TGGCGGCTTC GCCGGCAACG TCCCCGACCG AGACGCCGTC GCGATCACTG GCTCGTACTA CGACGAGGTA ACAAGCCAGC AGGATGAGGG TATTGGCGTC GGTGACGGTG ACGTGACCGG GCTGCCGACC CAGAACATGA CGGATGACGC GGCGGCCGAG AATATGACCG CGTTCGACTT CGCTGCGATC TGGACGACGC TCCCCGACGA CTACCCGACG CTGCAGGCGC TCGATCCGGA ACCGATGCCG CCAGACCCGC CGAACTTCGC CGTGACGATC GACGAAACGA CCAGTCCCGT CACCGAGGGT TCACCCCTCA ACGTGAACAC GACCATCGAA AACGTCGGTG AACAGGCGAC CGAGCAAACC GTGTCGCTAG AGGTTGCCGG CGACCAACGA GACGTGACGA CGGTGGCGCT CGGCGGTGAC GAGTACGAAA CCGTGGTACT CACCTGGGAG ACCGAGGTGA ACGCCGCCGG TGACTACGAC GCCACTGTCT CCAGTGATGA CGACTCCGAG ACCGTGCCCA TCACCGTCGA AGAACAGCCC GACGATGCCG TCTTCGACGT GACGATCGAC GAGACCAACA GTCCCGTCAC GGAAGGAGGC AATCTGCTGG TGAATGCTAC CGTTGAGAAC ACCGGCAATC TTGCCGACGA ACAGGACGTC AATCTGACGA TCAACGGTGA CGAAGTGGAT GCAACGGTCG TCAGCCTCGA CGGTGACGAA ACCGAAGAGA TAACGTTCAC CTGGGAGACC GAGGAGAACG ACGCCGATGA CTACGACGCC ACCGTCTCCA GTGATGATAA CTCCGAGACC GTGCCCATCA CCGTCGAAGA ACAGCCCGAC GATGCCTTCT TCGACGTGAC GATCGACGAG ACCAGCAGTC CCGTCACTGA GGGCGAGGAA CTTGAGGTCG GTGCCACGAT TGAAAACACC GGCGATCTTG CTGACGAACA GGACATCAAA CTCTCCATCA ACGATAGCGA GATGGACGTC ACGACGGTGG ACCTCGACGG TAACGAGACC GAAGAGGTCA CGCTTATTTG GGAGACTGAA AAATCCGAGG CCGGTGACTA TGTGGCCAAC ATTTCAAGTA ATAACGACCT CGACATGGCA AATGTATCGG TCGGAGAGAA ACCGGACCCA CCAACGCCCA CTCCACCGAC ACCGGATCCT GCGTTCTTCG ATGTGACTGT TGATAATACA ACCAGTCCCG TTACAGAGGG TGAAAAGTTA CTAGTCAACG CCACGATCGA GAACACCGGC GACCGGTCCG ACAAACAGGA CATCAACCTA ACAATCAACG GTAACGAGGT CAACGTCACA TCGATCGAAC TCGACAGCGA TGAAAGCGAA AAAGTGACAC TTACGTGGGA AACTGAAAAA TCCAATACTG GGGAGTACGT CGTTACAGTT TCAACGAAAG ATAATACTGA TATGGAAAAT GTCACCGTAA ACGCAATTAA ACCTGCATTC TTCACCGTCG ATATCAAAGA AGTCACTGAC TCGGTTCATA TCAATGAAGA AGTGTGTGAA AAAGCATATA TTACGAATGT CGGTGAGGAA GTAGACACCC AGAACGTTGT GTTGGATATT GACAAACAAG AAGGTGTCGA CAGCACAACA GTCACGCTGA AGCCCAGCAA GTCACAGAAG GTGACACTCT GCCACGAATG GATTACTGCG GATGCAGACA AGGACGTCCC TATGACTGTT CGTAGCGATA ACAGTGCGGA GACGGTCAGT GTCAGTATCA TCGGATCCGA GCCGGTGAAA GAAGACGATG ATGAGGAAAC AGAACCTGAT GTGTTAGACG ATGATGAAAC AGCTGACGGG ACACCAGGGT TTGGTGTTGT AGGCACCCTC ATTGTGGTTC TCATGGCGGT AGCACTTGCT CATCGACGCC GCTGA
|
Protein sequence | MVSRGIRVGL GFPDQPGRPS QHRRVNLAPV TSNKFSFTAS RIPVLRDIRG SSSMSHRSVF LAACIGILVV CCAIALVPAG VSATDDDTPD PDEYDSLVDG MDGNGTADDP FHVTNVTELQ AMEANLSAHY KLVSPIDASE TADWNDGDGF DPIGACDFNA TIDECEETPF EGSFDGGLYP IADLTIDREN ESEVGLFGYV FPDGEVVDIK LRDARVTGNV EVGTVTGLNF GSIEGADVTG TTSGDLDAFP GRIGGLAGYN GGEVRDSFVD NDVIGPSFTG GAVGSNNGTI VQTHASGDVE ALDISGGLVG TNVGDGEVGH STASGDVNGT FSGFGGFVGT HVAGEGIIYE SYATGDVSGE TISAAGGFAG SNSALTGEAV IYDAYATGNV NGEDRLGGFV GDLGSSAFVE MSYATGSVSG APDEASVGGF AGNVPDRDAV AITGSYYDEV TSQQDEGIGV GDGDVTGLPT QNMTDDAAAE NMTAFDFAAI WTTLPDDYPT LQALDPEPMP PDPPNFAVTI DETTSPVTEG SPLNVNTTIE NVGEQATEQT VSLEVAGDQR DVTTVALGGD EYETVVLTWE TEVNAAGDYD ATVSSDDDSE TVPITVEEQP DDAVFDVTID ETNSPVTEGG NLLVNATVEN TGNLADEQDV NLTINGDEVD ATVVSLDGDE TEEITFTWET EENDADDYDA TVSSDDNSET VPITVEEQPD DAFFDVTIDE TSSPVTEGEE LEVGATIENT GDLADEQDIK LSINDSEMDV TTVDLDGNET EEVTLIWETE KSEAGDYVAN ISSNNDLDMA NVSVGEKPDP PTPTPPTPDP AFFDVTVDNT TSPVTEGEKL LVNATIENTG DRSDKQDINL TINGNEVNVT SIELDSDESE KVTLTWETEK SNTGEYVVTV STKDNTDMEN VTVNAIKPAF FTVDIKEVTD SVHINEEVCE KAYITNVGEE VDTQNVVLDI DKQEGVDSTT VTLKPSKSQK VTLCHEWITA DADKDVPMTV RSDNSAETVS VSIIGSEPVK EDDDEETEPD VLDDDETADG TPGFGVVGTL IVVLMAVALA HRRR
|
| |