Gene Snas_4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4039 
Symbol 
ID8885240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4308883 
End bp4310112 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content72% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003512784 
Protein GI291301506 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0402571 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCAAGC CCTACCGCGA GATCTTCGCC GCCCCAGGCA GTCTGCGCTT CAGTCTCGCC 
GGATTCGTCG CCCGGATGCC GCAGTCGATG CTGCCCATCG GCCTGGTCGC GATGCTGTCC
GAACTGCGCG GCCAGTACGG CCTCGCCGGA GCCGTCTCGG CCACGTTCAC GCTGTCGATG
GCGCTGCTGA GCCCGCTGGT GTCGCGACTG GTCGACCGGC ACGGACAACG CCGGATCCTG
GTGCCCGCCA TGGCCGTCAG CGCCGCGTCC ATCACCGGCG TCCTGCTGAG CGCCCACTTC
GGAGCGCCCG CCTGGACCCT GTTCGCCTTC GCCGTCCCGG CCGGAACCCT GCCGACCATG
AGCGCCATGG TCCGGGCGCG CTGGACCGAG ATCTACCGGG GCAGCGACGC CATGACCACC
ACCCACTCCT TCGAGTCCGT CGTGGACGAA CTGACCTACG TCACCGGACC GGCGGTGTCG
ATCCTGCTGT CCACCGCCGT GTTCCCGCAG GCCGGGCCGC TGCTGGCCAT CGTGCTGCTG
GTCGCCGGGG TCGCGGCCTT CGCAGTACAG CGCCGCACCG AACCCGCACC CCGCCCCGCC
GAGACCGGCG GCGGCTCGGC GATCCGGCGC GCACCCTTGC GGCTGTTGGT GTTCGTCCTG
TTCGCCGGGG GAGTGGTGGT CGGCACCGTC GACGTCGTCA GCGTCGCCTT CGCCGAACAG
CAGGGCATCA CCGCCGCCGC CGGAATCGTC GCCACCTGCT ACGCGCTGGG CTCCGGCATC
GCCGGTTTGG CCTTCGGTGC CTGGAAGCCC CGCATCGCGC TGCCGAAACA GCTCGTCATC
GGCGCGGCCG GAACCGCCGC CACCACGCTG CCGTTCCTGC TGGCCGCCGA CATCGCGAGC
CTGTCGGCCG CCGTGTTCGT CGCCGGAGCC TTCTTCGCGC CCACGATGAT CATCGTCATG
AGCCTCATCG AGAAACTGGT GCCACCGGCC CAGCTCACCG AGGGATTGAC CTGGGCCGCC
ACCGGCGTCA GTATCGGCAT GGCTGCCGGA GCCGGGGCGT CGGGCTTCGT CGTGGACGCC
TTCGGCGCCA CCACCGGCTT CACCGTCGCC CTGTGCGGCG GCGTGCTGGC CCTGGCCGCG
GCGACCGTCG GCCTGCGGAT GCTGACCAGG GCGCTGCGCT CGCGACCAGG GGCCACCGAA
AGGATGCCGG TGACCAGCGC CGCCGGGTAG
 
Protein sequence
MLKPYREIFA APGSLRFSLA GFVARMPQSM LPIGLVAMLS ELRGQYGLAG AVSATFTLSM 
ALLSPLVSRL VDRHGQRRIL VPAMAVSAAS ITGVLLSAHF GAPAWTLFAF AVPAGTLPTM
SAMVRARWTE IYRGSDAMTT THSFESVVDE LTYVTGPAVS ILLSTAVFPQ AGPLLAIVLL
VAGVAAFAVQ RRTEPAPRPA ETGGGSAIRR APLRLLVFVL FAGGVVVGTV DVVSVAFAEQ
QGITAAAGIV ATCYALGSGI AGLAFGAWKP RIALPKQLVI GAAGTAATTL PFLLAADIAS
LSAAVFVAGA FFAPTMIIVM SLIEKLVPPA QLTEGLTWAA TGVSIGMAAG AGASGFVVDA
FGATTGFTVA LCGGVLALAA ATVGLRMLTR ALRSRPGATE RMPVTSAAG