Gene Snas_3084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3084 
Symbol 
ID8884283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3255203 
End bp3256450 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content73% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003511848 
Protein GI291300570 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.11728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.436987 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAACC CCTTCACCCC GAGGGCCGAG CCCGCCCTGG TCCGTGCCCG CGTCGGCGTC 
TTCGGCTACT TCGCGACCTC GGGCTTCGTC ATGGGCACCT GGGCCGCGGG CCTGCCCGCC
GTCGACGAAC GGCTCCACCT CGGCCCCGGA CGCCTGGGCA CCGCCCTGCT GCTCATCGCC
GGGGGAGCAC TGGTGTCCAT GCTCGTCGTC GGGCGCGTCA GCGACCGGTT CACCTCCCGC
GTCGTCGCGC GGATCTCCGG GCCCGTCGCC GCCCTGCTGC TGTTGGGCCC GGTACTGGCC
CCGTCCTATT CGTGGCTGCT GATCTGCTCG GCCGTCTACG GCATCAGCGT CGGCTTCATC
GAGGTGTCGA TGAACGTCAA CTCCATCGAG GTCGAAGTCC GCTATGGACG CCCGATCGTG
TCGGCCTTCC ACGGCCTGTG GAGCCTGGGC GGCGCGGCCG GGGGAGCGCT GACCACCGCC
GGACTGCACG CCCACCTCGA CCCGCAGGCG ATGCTGATCG GCTTCATCCT GCTGTCCACC
GTCGCGTTCG GGTACTTCGG ACGGATGCTG CTGCCGCCCC CGTCGCGACC CGAACCCGAC
CCGGCCACGG CCGGTGCGAA ACCGGGACGC GGCCTCGGGA TCGGCATGGG AATCGTGCTG
CTGCTGGGGA TCGTGGCCTT CGGCGGCCAC CTGGCCGAGG GCGCCGCGAT CGACTGGGCC
GCCATCCACG CCCGCCGGGT GCTGGACACG CCGCTGTCGA ACGCGCCGAT CGCCTACACC
GTCTTCGGCA CCGCCATGAC CCTGGTGCGA CTGGCGGGCG ACCCGATCCG GTCCCGGCTG
GGCCCGGGCC GGACGCTGCT GCTGGCCGGG GTGCTGTCGA CCGCCGGATA CGGGCTGGTG
CTGCTGTCAC CGGTCGCCGG GGGCGCGGGA CTCGTCGTGG CCTGCGTCGG CTGGGCGCTG
ACCGGCATGG GACTGGCCAC GGTGGTCCCG GTGGTGTTCT CCGCGATCGG GGCCGCGCAC
GGAGCCGTGG GCAAGGCGCT GTCACTGGTG ACCGTGTTCG GCAGCGCCGG ACTGCTCGTC
GGCCCGGCCG TCATCGGCCA CCTCGCCGAG GCGACCAGCC TGCCGACGGC GCTGATCGTG
CCCGCGGTGC TGGCGGCGGT GGTGGCGTTG GCCGGTCCGT CCGCGATCAA GGCGCTGGGC
CTGGGCCGCA CGACCCGACC GGCGGAGCCG GTCGCCGAGC CGGTCTGA
 
Protein sequence
MRNPFTPRAE PALVRARVGV FGYFATSGFV MGTWAAGLPA VDERLHLGPG RLGTALLLIA 
GGALVSMLVV GRVSDRFTSR VVARISGPVA ALLLLGPVLA PSYSWLLICS AVYGISVGFI
EVSMNVNSIE VEVRYGRPIV SAFHGLWSLG GAAGGALTTA GLHAHLDPQA MLIGFILLST
VAFGYFGRML LPPPSRPEPD PATAGAKPGR GLGIGMGIVL LLGIVAFGGH LAEGAAIDWA
AIHARRVLDT PLSNAPIAYT VFGTAMTLVR LAGDPIRSRL GPGRTLLLAG VLSTAGYGLV
LLSPVAGGAG LVVACVGWAL TGMGLATVVP VVFSAIGAAH GAVGKALSLV TVFGSAGLLV
GPAVIGHLAE ATSLPTALIV PAVLAAVVAL AGPSAIKALG LGRTTRPAEP VAEPV