Gene Snas_4174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4174 
Symbol 
ID8885375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4470457 
End bp4471701 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content69% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003512918 
Protein GI291301640 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.176115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.788258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAA CCCCCACCGG CCTGCTGCGA CGCAACCGCG ACTTCCGCAA CGTATGGGCC 
GCCGACGCGC TCAGCACCGT CGGCACCCGG TTGAGCATGC TGGCGTTGCC ACTGTTGGCA
CTGCTGACCC TGGACGCGAC CCCGTTGGAG GTCGCGCTGC TGCGGACCTT GGAGACCCTG
GCCTGGCTGG TCCTCGGTCT GGTCGCGGGG GCCTGGATGG ACCGGATTCG CTGCCGGGGC
GTCCTCATCG CCGCCGATCT GGGACGCGCG GCGGTGCTCG CGTCGATCCC GATCGCCTAT
CTGGCCGGGG TTTTGACGTT GACCCAGTTG TTCGTCGTCT CGCTGCTGGC CGGGATCGGC
AAGGTGTTCT TCGGCGTCGC CGCCACCACC TACCTGCCCC GGCTGCTGCC GAAGGAGGAC
CTCGTCGACG CCAACGCCAA GCTGGCCACG AACCTGTCGA TGGCCGCGGT GCTGACCAGC
GGTGGCGGCG GCTTCGTCAT CCAGTGGCTC ACGGCGCCGA TCGCCATCGC CGTCGACGCG
GCCAGTTTCG TGTGGTCGGC GCTGTGGCTG CGCGGCATCC GCAAGGTCGA AACCGTTCCG
CGACAGGACA ACCCGCCGCA CCTGCGCCGC GACATCGCCG AGGGCTGGCG GTTCGTGGTG
GGGCATCCGC TGTTGCGCGC CCTGGCGGGC CTGTCGGTGT GCACCGTGTT CTTCCAAGCC
GTGCACGACG CCGTCTGGAT CACGTTCCTG GTGCGGGAGT TCGGGCTGTC GGCCGGGGTC
ATCGGCCTGT TGGGAATGTC GGGCCTGCTG GGCGCGGTGC TGTCGGGTTT CGTGACGTCC
CGCATCGCGC GACGGCTCGG CAACGTCCGC GCCGCGGTGG CCGCCGCGGT GTGTTTCGCG
GTGGGGTTCG CGCTGTTCCC GTTCACGCTG CCGGGCTGGG GGCTCAGCGT CGCCGTCGTC
GCGGGCTTCA TGGTCAGCTT CTCCATCATC ACCTTGAGCG TGATGCGCTC CAGCATCCGG
CAGCTGCTGT GCCCGGAGCA CCTTTATGGA CGCGTCGGGG CCACCATGGA GTTCATGATC
TGGGGCACCA TGCCGCTGGG CAGCCTCGCC GGTGGTCTCA TCGCGACTGT GACCGACCTG
CGCACGACCC TGTGGATCGT CGGCGCCGGG ACACTGCTGT CGCTGTTGTG GATCGTGCTG
TCGCCGTTGC GGACGACGCG CGAGATCGAG CTGGCGGCCG TTTAG
 
Protein sequence
MTETPTGLLR RNRDFRNVWA ADALSTVGTR LSMLALPLLA LLTLDATPLE VALLRTLETL 
AWLVLGLVAG AWMDRIRCRG VLIAADLGRA AVLASIPIAY LAGVLTLTQL FVVSLLAGIG
KVFFGVAATT YLPRLLPKED LVDANAKLAT NLSMAAVLTS GGGGFVIQWL TAPIAIAVDA
ASFVWSALWL RGIRKVETVP RQDNPPHLRR DIAEGWRFVV GHPLLRALAG LSVCTVFFQA
VHDAVWITFL VREFGLSAGV IGLLGMSGLL GAVLSGFVTS RIARRLGNVR AAVAAAVCFA
VGFALFPFTL PGWGLSVAVV AGFMVSFSII TLSVMRSSIR QLLCPEHLYG RVGATMEFMI
WGTMPLGSLA GGLIATVTDL RTTLWIVGAG TLLSLLWIVL SPLRTTREIE LAAV