Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3502 |
Symbol | |
ID | 8826370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 3638233 |
End bp | 3639462 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003481614 |
Protein GI | 289583148 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.114524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGTCA CGAAGCGGGT CCAGCAGTTC GCACACTTCG ACGTGCTTCT GGTGACCGCC GGGATCTGGT TTCTGGCGAA GTTCCTTCGC TATGTCTTTC CACCTCTCTT TGGCTCGTTT CAGGAAACCT ACGCGGTCTC GAACGCCGCC CTCGGCGCTG CGTTCACCGG CTTCATGCTC GTCTACGCCG CGATGCAGTT CCCCTCCGGC ATGCTCGCCG ACCGCCTCGG CTCGGTCACC GTTATCATAG CGGGCGTCAC CATCGCCGCC GTCGCCGCGT TCGCACTCGT CGTCGACTCG CCCTTTGCGA TACTCGTGGT TGCCATGCTC GTCATGGGCG CGGGGACCGG CGCGCACAAG ACCGTCGCAG TCCGGCTGCT TGCTCACGCC TATCCTGCAC GAACAGGGCG GGCACTCGGT ATTCTCGACA CCTTCGGGAC CTTCGGCGGC GTCGTCGCCC CCTGGGCCGT CGTGCTCGCG GCGGGAATCC CGTTCGCACT CGGTGCGAGC TGGCGCGTGA TCTTCCTCGC CGCCGGTGTC GTCGGCCTCG CACTCGCTGT CCTGTTCTGG GTTCGCGTGC CACAACGAGT CCCAACCGAG ACGGAGGGCG ACGGAACTAG CGGCGTCGAA GTAAACGAAC TCCGCCGGTA CGCCGCACTC TTTCGTGACT GGCGATTTTC GGCGTTCGCG CTTCTGACTG TCCTGTTCGC GTTCACCTAC AACGGACTCG TCGCGTTCGC GCCGCTGTAC CTCACCGACG AGGCTGGGCT CACGGCGGCG ACCGCCAGCG TGCTCTACAG CGGACTCTTC CTCGCGAGTC TGGTTCAACT GGTCACCGGC GACCTGAGCG ACCGGGTCGG TCGACTCCCC ATCATCACCG CGACGCTCGG CCTCGCAGCC CTCTCGCTCG GTGCGTTCGT CGCGCTGACC GATGTCGCTG GCCCGGTCGT GCTCGGCATC GCCCTCATCG CGGCCGGCAT CGGCTCCCAC GGCTTCCGTC CCGTCAGGGG CGCATACCTC ATGTCCGCGA TCCCCAACGA CCTGGCCGCC GGCGGGCTCG GCGTCGTTCG AACGCTCCTG ATGGGTGCGG GGGCAATCGC ACCCGCGATC ATCGGCGCGA TGTCCGAAAC CGTCGGCTTC CGTCCCGCGT TCTGGCTGCT CACCGCCGCC GTGTTCGGTG CAACGCTCCT CGCAACCATC CTCTGGGTTA CCGACGAAGA CGGAGCGTGA
|
Protein sequence | MSVTKRVQQF AHFDVLLVTA GIWFLAKFLR YVFPPLFGSF QETYAVSNAA LGAAFTGFML VYAAMQFPSG MLADRLGSVT VIIAGVTIAA VAAFALVVDS PFAILVVAML VMGAGTGAHK TVAVRLLAHA YPARTGRALG ILDTFGTFGG VVAPWAVVLA AGIPFALGAS WRVIFLAAGV VGLALAVLFW VRVPQRVPTE TEGDGTSGVE VNELRRYAAL FRDWRFSAFA LLTVLFAFTY NGLVAFAPLY LTDEAGLTAA TASVLYSGLF LASLVQLVTG DLSDRVGRLP IITATLGLAA LSLGAFVALT DVAGPVVLGI ALIAAGIGSH GFRPVRGAYL MSAIPNDLAA GGLGVVRTLL MGAGAIAPAI IGAMSETVGF RPAFWLLTAA VFGATLLATI LWVTDEDGA
|
| |