Gene Nmag_3891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3891 
Symbol 
ID8826761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp286437 
End bp287651 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content61% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003481994 
Protein GI289583584 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGTGA GTACCGTCTC GACACCGTCA CCCGAACCCG ACAGCCGTCG TAGCTGGGGC 
GTTGCACTCG CCGGTGCGAT TGCGATGGTG TTTACGTTCG GTACACCGCT CTCCTACGGC
ATTTTCCAGC AGCCGTTCAG CGAGACGTTC GCCGTCTCTC CCGTCGCTCT TTCCGGTGTT
TTCGCGGTCA TGTTGTTCAC CTTCTTTATT GGCTCCGGAC TCGTCGGCAT CTTCGCCGCA
CGGCTCCCCG TTCGAGGCGT ACTGCTGGTG TGTACCATCG TAACAGCACT TCTCGCGCCC
TCACTGTACG CAGTCGACTC GTATCTCGGG CTTACGTTCG TCTTTGCAGC CCTCGGACTC
GCGCTGGGGA CCGTTTTCGT GCTGGTCGCA TCCGTCGTTC CGCGGTGGTT CGACGAGCGG
CGGGGCGCTG CAACTGGATT GATCTTCGTC GGTAACGGGC TCGGATTGTT CGTTCTGCCA
CCAATCTGGC AGTACGCACT CAGCACCGTT GGTGTTCGCG AGGGTTTCCT GATCATCATA
GCGACGACGG CCAGCGCGTT CTTCCTTGCA AGCCTGCTCT GTCGGCGACC ACCCTGGGCC
ACACGGTCGA CGGACTCGAA TAGCGCGCTC GTGTCCTGGC TCAGCGGACT CATCAGGGCG
CGAACCTTCC AGTTGCTCTT CGTGGGGATG TCACTGGCGT TTGCCTGGTA TCAGCTCCTT
GCGGCGTACG CGATCGACCT GTTCGCTGCG CGCGGGTTGA CCGAAGCTGG GGCGTCAACG
TTGTTCGGCC TGATCGGTGG CGTGAGTATT ATCTCCCGAA TCGGTGGTGG GTATATCGCC
GATATTGTCG GGTCGCGTCG GGCGTTTCTC GCATCACTCG GCTGTGCCGC CGTCGGAATC
GTCCTGTTGC TCGTCCCGCA GTACGCGATA CTCACCGTTG CTGTCTTCAG TATCGGGTTG
GGTCTCGGTG GCTGTGCAAC CCTGTACATC CCGTTGCTGA TGGAGACCTA CAATCCCGAG
AAAGACACTG CGATTATCGG TGTCTTCAAC GTCGGCGGTG GGATCGGTGC GCTGGCGATG
CCGCCACTTG GAACGGCGAG CGTCGCCTAC ACCGAGAGCT ACACCGTCGC AATCTTGCTC
ACACTCGGGG TGACAATCGT CTCGTTCTGG GCTGTGGTCG TCGGAACTGC TGGAGGCGCA
TCGACTCAGT CCTGA
 
Protein sequence
MDVSTVSTPS PEPDSRRSWG VALAGAIAMV FTFGTPLSYG IFQQPFSETF AVSPVALSGV 
FAVMLFTFFI GSGLVGIFAA RLPVRGVLLV CTIVTALLAP SLYAVDSYLG LTFVFAALGL
ALGTVFVLVA SVVPRWFDER RGAATGLIFV GNGLGLFVLP PIWQYALSTV GVREGFLIII
ATTASAFFLA SLLCRRPPWA TRSTDSNSAL VSWLSGLIRA RTFQLLFVGM SLAFAWYQLL
AAYAIDLFAA RGLTEAGAST LFGLIGGVSI ISRIGGGYIA DIVGSRRAFL ASLGCAAVGI
VLLLVPQYAI LTVAVFSIGL GLGGCATLYI PLLMETYNPE KDTAIIGVFN VGGGIGALAM
PPLGTASVAY TESYTVAILL TLGVTIVSFW AVVVGTAGGA STQS