Gene Snas_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1047 
Symbol 
ID8882232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1107089 
End bp1108252 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content73% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003509850 
Protein GI291298572 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.743504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTCA CGGCCGCGTT CATCGTGCAC GGACTCGTCT TCTCGTCCTG GCTGCCCCAC 
ATCCCCGCCA TCAAGGACGA CCTGCGGTTG TCGGAGGGCA CGCTCGGCCT GGTGCTGCTG
GCGCCGCCGT TGGGCGCGAT CGTCGCGATG TCGCTGACCG GCGCCGCCTG TGCCCGCTGG
GGCTCCGCCG CGGTCACCAG GGTCACCCTG GTGGTCTACG CGCTGGGCAT CACCGCGATC
GGCGTCGGGG CCGGGACCAC GTGGGGTCTG TCGCTGGCGC TGCTGTGGGC CGGGGCGCTG
GTGGGGTCCT TCGACGTGGC CATGAACGCC CAGGGCGCCA CGGTCGAGAA GGCGATGGGC
AAGTCCATCA TGGGGTCCTT CCACGCCGCC TGGAGTCTGG CGGCGGCCGC CGGGGCCGGG
ATCGGCGGCT GGGTGGCCGC CGTGGACGAA GACCTGTTCA CGACGCAGCT GTTCGCGGTG
GGCATGATCG CGCTGCTGGC GGCGCTGCCG TTCTTCACCT CCTTCATCCC CGACGCGCCA
CCCGAAGCCC ACGCGAAGGG TCGCAAGTGG AGGTTCGAGC GCGGCCTGGT GCTGCTGTCC
ATGGTGGCCT TCGCGGGGCT GCTGGCCGAG GGCGCGGTCG CCGACTGGAG CGCGGTGTTC
CTGTCCCAGG AACGCGGCGC CTCACCGATG GTCGCGGGCT GGGCCTACGC GGTGTTCTCG
GTGGCGATGC TGATCGGACG GCTGGCCGGG GACAGGCTTG TCGGCCGGTT CGGACGGTCC
CGCAGCGTCG CCGTGGCGGC CCTGACCGGT GGCGGCGGGA TGGCGGTGGG CCTGGTGGTC
TCGCAGCTGG CCGGGGACAG TGGGCTCGGC CAGGCCTCGT TCATCGCGGG GCTTTTCATT
CTGGGCCTGG GCATCGCGGT GATCGTGCCG GTGGCGTTCT CCTCGGCCGG GGACGGGCCG
GGCATCGCGA CGGTGTCGAC CGGCGGCTAC ACCGGCTGGC TGCTGGGACC GGCCGTCATC
GGCGGCCTGG GGGAGCTGAT GGGGCTGTCG GCGGCGATCT GGGTCGTGGC GGTGCTGGCC
GTGTTCGCGG GACTGGTCGC GCCCCTGGGC ATCGGGGCGC TGCGCGGCGC GTCCGACAAG
GAGAAGGCAG CCGCGGCGCC GTGA
 
Protein sequence
MAVTAAFIVH GLVFSSWLPH IPAIKDDLRL SEGTLGLVLL APPLGAIVAM SLTGAACARW 
GSAAVTRVTL VVYALGITAI GVGAGTTWGL SLALLWAGAL VGSFDVAMNA QGATVEKAMG
KSIMGSFHAA WSLAAAAGAG IGGWVAAVDE DLFTTQLFAV GMIALLAALP FFTSFIPDAP
PEAHAKGRKW RFERGLVLLS MVAFAGLLAE GAVADWSAVF LSQERGASPM VAGWAYAVFS
VAMLIGRLAG DRLVGRFGRS RSVAVAALTG GGGMAVGLVV SQLAGDSGLG QASFIAGLFI
LGLGIAVIVP VAFSSAGDGP GIATVSTGGY TGWLLGPAVI GGLGELMGLS AAIWVVAVLA
VFAGLVAPLG IGALRGASDK EKAAAAP