Gene Nmag_3902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3902 
Symbol 
ID8826772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp298718 
End bp299878 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content61% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003482005 
Protein GI289583595 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.088151 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAGGG CCCGTCTCTT CACCTCGCTA TGCGTTCTCG TCTTCTTCAT CAACCTCGCC 
AGAATCGTCT TTGCGCCCCT TCTCAACGTC TTTATCAGCG AGTTCGGTAT CGGCGAGGCG
ACTGCAGGGC TCATCGTCAC GCTCGCCTGG ATCGGGAGTG CGGCCCCGCG CTTGCCGACC
GGCTGGCTTC TCACGAAAGT TCCCAGACAC TACGTCGTGA TTAGTTCAGG TTCCATTCTC
GCGGTGTCAT CCGCAATCGC TGCAACGGCG ACGACCGTCG AGCACCTGAT GGTCGGCGCG
TTCTTCATGG GGATCGCTTC GGGTGTTTAC TTCGTCTCGG CGAATCCACT GCTGAGCGAG
CTGTATCCGG AGCGGATCGG TCGAGTGATG GGCATCCACG GCGCTGCGAA CCAGATCGCG
GCCGTCGTCG CTGCGCCGTT CGTCGCACTC ACGCTGTTCG TCGACTGGCG ACTCTCCCTG
TGGGCGATTG CCGTCGGTGC TGCCATCATC ACTGTCTACA CGTGGTTCGT TGCCCGAGAA
ACCGAGATGC CCAGTGCGGG ACAGGCGGAT CGCAACTTCG TCGCCGGCGC GCTCTCGGAG
TGGCGGCTTA TCGCCACCGC GCTCGCTATC GTCGGTTTCG CGGTGTTCGT CTGGCAAGGC
CTGTTCAACT TCTACGAACT GTACATGATC CAGTCGAAGG GCCTCTCGGA TCGTGCAGCC
GGGATGATGC TCACGATCGT CTTCGCCACC GGCGTTCCAG CGTTCTACTT CGGCGGTGAC
TTCGCCGACA GGCTTCCGCA GATTCCGTAC CTCCTCGGTA TCGTCGGCGT CTTCGCCGTG
AGTGTGATCG TCCTGACGAT GGTCGAGAGC CTGATCGGGT TGATCGTCAT GTCCGTTGTC
GTCGGCTTCG TCATCCACTC GCTGTTTCCC GCGGTGGATA CGTTCATGCT CGATACGCTT
CCCGACTCGA CGCGCGGGAG TGCCTACGCC GTGTTTAGTT CGCTCTGGAT GGCGACGCAG
GCGCTTGGCT CCTCAGCCGT CGGGACGCTC ATCGAACAGG GATATTCCTA CGACGCGGTA
TTCACTGGCG GTGCGCTCTT GCTCGGTGCC TTGATCGTCG TTCTGACCAT CTTCGAGCGC
GCCGGCCGAC TACCGACGTG A
 
Protein sequence
MARARLFTSL CVLVFFINLA RIVFAPLLNV FISEFGIGEA TAGLIVTLAW IGSAAPRLPT 
GWLLTKVPRH YVVISSGSIL AVSSAIAATA TTVEHLMVGA FFMGIASGVY FVSANPLLSE
LYPERIGRVM GIHGAANQIA AVVAAPFVAL TLFVDWRLSL WAIAVGAAII TVYTWFVARE
TEMPSAGQAD RNFVAGALSE WRLIATALAI VGFAVFVWQG LFNFYELYMI QSKGLSDRAA
GMMLTIVFAT GVPAFYFGGD FADRLPQIPY LLGIVGVFAV SVIVLTMVES LIGLIVMSVV
VGFVIHSLFP AVDTFMLDTL PDSTRGSAYA VFSSLWMATQ ALGSSAVGTL IEQGYSYDAV
FTGGALLLGA LIVVLTIFER AGRLPT