Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_0398 |
Symbol | |
ID | 8823220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | - |
Start bp | 389421 |
End bp | 390698 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | protein of unknown function UPF0118 |
Protein accession | YP_003478548 |
Protein GI | 289580082 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0034127 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGATC GTCCCCCGAC CGGCGGCTGG TATGCAAAAC GGCCCGGGCT GACCGTTGTC GCTGTCGTCG CCGCGTTACT CGGACTTCTT GTCATCCTGC CGTATCTACA GTACGTGCTG CTCGGCGTCG TCCTCGCGTA CATTCTCCGC CCTGCACAGC GCTGGCTCGA GCGCTACACG GGTCCGCTTA TCGCCGCGCT GACACTCGTC GCCGTCGCGA TTCTCGCGAT TCTCCTGCCG CTCACCTACG TCCTCATTGT CGCATTTCGG GAAGCACTCG GTCTCGTCGG TGCCATCCAG AACGGGCAAC TCGACATCGA GGCGATCGAG TCTCAGATCG AGGCGACCGG CTACTCGATT GACGTCGTCG AATTCTACCA GACCTACCAG GAGCCCATCG CGACGGGACT GCAGGGACTC GCGATGAGCG CGCTCGATAT CGCCGGCGGC GTGCCGGGGA TACTCATCGG CCTTACCGTG ACGCTGTTCG TCCTGTTTGC CCTTCTGCGA GACGGCAGCG AGTTCGCCGA GTGGGTCACG CGCGTCCTGC CGGTCGAAGA CGAACTGCTC TCAGAACTGC TCTCGGAGCT CGATCAGCTC ATGTGGGCGT CGGTCGTCGG CAACGTTGCA GTCGCCGCAA TTCAGGCTAT TCTCCTGGGT ATCGGGCTTG CGTTACTCGA AGTTCCTGCC GTCGTGTTGC TTTCGGTCGT GACGTTCGTC TTCGCGCTGT TGCCGCTCAT CGGTGCGTTC GGTGTCTGGG TCCCCGTTTC GTTGTATCTA CTCGCAACGG GACAGTTCGT CGTGGCCGGC ATACTCGTCG CCTACGGCTC GATCGTCAGC GCTTCGGATA CCTACATCCG GCCTGCGCTC ATCGGCCGGA CGAGCGCGTT CAACTCCGCG ATCATCGTCG TCGGTATCTT CGGCGGGCTG ATCATCTTCG GCGCGGTCGG TCTGTTCATC GGACCGGTCG TCCTCGGCGG CGCGAAGATC ACCCTCGACC TGTTCGCCCG AGAGCGCGCG GCGGGTACCC AGCCAGTTAC CGGTGCCGGA CAACCGGGCG AGGACGAGGG AGTGCAGACG GTCGGTACTG GTGTCGGAAC CGACAGGAAC GGCGACGACG ATGAGAGCAA CGGCGACCAC GAGACCCGCG GTGCAGGCGA GAGTGACACC GACACGGACG AAGCAACCGA CAAAACAGAC GACACCGACA CGGACGAAGC GACCGACAAA ACAGACGACA CCGCCGACGC CGACGGCTCG GATTCAAGTA CCCAGTAA
|
Protein sequence | MADRPPTGGW YAKRPGLTVV AVVAALLGLL VILPYLQYVL LGVVLAYILR PAQRWLERYT GPLIAALTLV AVAILAILLP LTYVLIVAFR EALGLVGAIQ NGQLDIEAIE SQIEATGYSI DVVEFYQTYQ EPIATGLQGL AMSALDIAGG VPGILIGLTV TLFVLFALLR DGSEFAEWVT RVLPVEDELL SELLSELDQL MWASVVGNVA VAAIQAILLG IGLALLEVPA VVLLSVVTFV FALLPLIGAF GVWVPVSLYL LATGQFVVAG ILVAYGSIVS ASDTYIRPAL IGRTSAFNSA IIVVGIFGGL IIFGAVGLFI GPVVLGGAKI TLDLFARERA AGTQPVTGAG QPGEDEGVQT VGTGVGTDRN GDDDESNGDH ETRGAGESDT DTDEATDKTD DTDTDEATDK TDDTADADGS DSSTQ
|
| |