Gene Snas_5214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5214 
Symbol 
ID8886423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5538768 
End bp5539979 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content69% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003513942 
Protein GI291302664 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.833951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.607033 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATGA TGAACAGTAA GACAAGCGCG ATCAACGGCC ACACCACCGC GCGACCGGGC 
GCGGGCACCA ACCTGGCACT GGCGACGGTG GCGTTCGCGG TGACGTTCTG GGCCTGGAAC
CTGGTCGGTC CACTGTCTAA GACATACACC GACGCGCTCG ACCTGACGCC GACGCAGACG
TCCATTCTGG TGGCGTTTCC GGTGCTGGTC GGTTCGCTGG GCCGCATCCC CGTCGGCGTG
CTGACCGACC GCTACGGCGG CCGGATGATG TTCACCGTCA TCTGCTTCGT CAGCATCATC
CCGACGCTGC TGGTGGGGCT GTCGCACGGT TCGTTCACGG GACTGCTGCT GTGGGGGTTC
TTCCTGGGTA TCGCCGGGAC CTCGTTCGCG GTCGGCATCC CGTTCGCCAA CGCCTGGTTC
CCGCCGACGC GGCGCGGCTT CGCCACCGGC GTGTTCGGCG CGGGCATGGG CGGCACCGCG
CTGTCGGCTT TCCTGACGCC GCAACTGGTC TCGGCCTTCG GGCTGCTGCG CACCCACCTG
GTGATGTGCG CGGCGCTGGC CGTCATGGGC GCGGTCATGT GGCTGTTCGC CCGCGACAGC
CCCGACTGGC GGCCCAGCAC CGAGGCGGCA CTGCCCCGGA TCCGTGACGC ACTCAAGATC
AAGGCCACCT GGCAGCTGTC GCTGCTCTAC GCGGTGGCCT TCGGTGGTTT CGTGGCCTTC
TCCACCTACC TGCCGACGCT GTTGACCATC TCCTACGAGT TCGTCCAGAC CGACGCGGGC
ATGCGCGCCG CCGGGTTCTC GCTGGCGGCC GTCGTGGCCC GCCCGGTCGG CGGCATGCTG
TCCGACCGGA TCGGGCCGGT CAAGGTCTGT CTGGCCTCGT TCTTCGGCGC GACCGGCATG
GCGGTGGTGC TGTCGTTCCA TCCGCCCGCC GAGATCCCGG CCGGGACGTC GTTCGTGCTG
ATGGCCGTGG CGCTGGGGCT GGGAACCGGC GGCGTGTTCG CACTGGTCGC CAAGCTCGTC
GAACCGGCCC GGGTCGGCAC CGTCACCGGC CTGGTCGGTG CCGCGGGTGG CCTGGGCGGC
TACTTCCCGC CGCTGTTGAT GGGCGTCATC TACCAGGCCA CCGGCGACTA CGTCATCGGC
TTCTGGCTGC TGGCCGTCAC CGCGCTCCTG GTGGGGCTGT TCACGATGCG GGTGTTCCGG
CAGGTGCGCT GA
 
Protein sequence
MSMMNSKTSA INGHTTARPG AGTNLALATV AFAVTFWAWN LVGPLSKTYT DALDLTPTQT 
SILVAFPVLV GSLGRIPVGV LTDRYGGRMM FTVICFVSII PTLLVGLSHG SFTGLLLWGF
FLGIAGTSFA VGIPFANAWF PPTRRGFATG VFGAGMGGTA LSAFLTPQLV SAFGLLRTHL
VMCAALAVMG AVMWLFARDS PDWRPSTEAA LPRIRDALKI KATWQLSLLY AVAFGGFVAF
STYLPTLLTI SYEFVQTDAG MRAAGFSLAA VVARPVGGML SDRIGPVKVC LASFFGATGM
AVVLSFHPPA EIPAGTSFVL MAVALGLGTG GVFALVAKLV EPARVGTVTG LVGAAGGLGG
YFPPLLMGVI YQATGDYVIG FWLLAVTALL VGLFTMRVFR QVR