Gene Nmag_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2022 
Symbol 
ID8824864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2060248 
End bp2061492 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content62% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003480154 
Protein GI289581688 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGATC AGAACGACGA TGGCGAGGAA TTCGATGCGA TCGACACGGA GATGGCTCGT 
TGGTGGCGAA TGAGACGCTG GCGATACAGC GAGACAGTGC TCGCGCTGTG TACACTCGCG
TTCTTCGCGA CGATGGTTGG CCGGTTGGCG ATCAGTCCGG TCTTGCCGAT GATCACTGAG
GACTTCGATG TCACCAACTC GGTCGTTGGC GTCGCGATGA CCGGGATGTG GATGGCCTAC
TTCCTCTCGC AGTATCCGAG CGGGATCTTC GCAGACCGCT ACGGCGAGCG ACCGATCATT
CTGATCGCCG TCGGCGGGAC GGCAGTGACG AGTCTCTTTC TCGCACTGTC GCCCTTCTTC
GCGGTGTTCG TTTTCGGAAC GATTGCACTG GGTGCCGTCG CCGGCCTCCA CTATAGTGTC
GCGACGACGC TTCTGACCCG GACCTACGAC GATATCGGTG CTGCCATCGG GGTTCACAAC
AGCGGTGGCC CCGCTGCGGG TCTCGTCGCA CCGCCGATCG CTGGCTGGGT CGGTGTCACC
TACGGCTGGC GAGCGGCCGT CGCAATCGCC GTTCCCATCG CGATTCTGGT CTACGTGCTA
TTCTCCCGGT TTATCGACCC AACAGAGCCT CGGCGACCGA ACCAGTCCAT GCAGGAACGC
GTCGACGTCG GGGCGATCAC GGACCTCCTC TCGCGGCCGA AGATTGCGTT CCCAATCTGT
CTCGCCATCG CCGCGGCGTT CGTCTGGCAG GCAACCTCAA CCTTCCTCCC CACGTTCCTC
ACCGAACACC GAGAGCAGTC GACCGAACTT GCAGCTGTCG TCTTCGCGAG CTACTTCGTC
GTGCAGGCAA TCACGCAGGT TGGTGTCGGC GCCGTCTCGG ACCGCGTCGG GCGTGACTTC
GCGACGGCTG GCTGTCTGCT CCTTGCGGGA GTCGGCTTCG TGATTTTCGT CGTTGGTCCC
GGATTCGAAG CCGTCGTCGT CGGAGTGGTA CTGGTCGGAA CCGGTCTCGG CTGGGGAGCA
GCGCTTCTCC CGCGGTTTAT GGATGTCCTC TCTGACGAGG AACGCGGTGC CGGATTCGGA
CTCATCCGCA CGGTGTACGG CTTCATCGGC GCGCTCGGTT CGGTCGCGAC CGGGCTGTTT
GCCGACCTCT TTGGCTGGGG GGTTGCATTC CTCGTGTTGG CTGGCCTTCT CGGACTTGGG
TTCTGTGCGA TTCTGGTCAA CTGGCTGTTC TCGCTCGGGT ATTGA
 
Protein sequence
MNDQNDDGEE FDAIDTEMAR WWRMRRWRYS ETVLALCTLA FFATMVGRLA ISPVLPMITE 
DFDVTNSVVG VAMTGMWMAY FLSQYPSGIF ADRYGERPII LIAVGGTAVT SLFLALSPFF
AVFVFGTIAL GAVAGLHYSV ATTLLTRTYD DIGAAIGVHN SGGPAAGLVA PPIAGWVGVT
YGWRAAVAIA VPIAILVYVL FSRFIDPTEP RRPNQSMQER VDVGAITDLL SRPKIAFPIC
LAIAAAFVWQ ATSTFLPTFL TEHREQSTEL AAVVFASYFV VQAITQVGVG AVSDRVGRDF
ATAGCLLLAG VGFVIFVVGP GFEAVVVGVV LVGTGLGWGA ALLPRFMDVL SDEERGAGFG
LIRTVYGFIG ALGSVATGLF ADLFGWGVAF LVLAGLLGLG FCAILVNWLF SLGY