Gene Snas_5058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5058 
Symbol 
ID8886265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5368515 
End bp5369792 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content68% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003513787 
Protein GI291302509 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.796866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTGGG CCATCATCTG GCTGGCCTTC GTCGGCCTGA CCGTCAACTA CCTCGACCGC 
GCCAGCATCA GCGTGGCGCT GCCGTTCCTG GGCGAGGACA TCCACCTCAC CAAGACCCAG
GAGGGCCTGA TCCTGGCGGC CTTCTTCTGG ACCTACGACT TCTTCCAGCT CGCCGCCGGT
TGGTACGTCG ACAAGGTCGG GCCCCGCCGG GCCTTCACCA TCGCCGCGGT GTGGTGGTCG
ATTTTCACCG CCGTCACCGC CGCCGTCCAC AGTTTCTGGA CGCTGGTCGC CGCCCGGCTG
CTGCTGGGCG CCGGGGAGAG CCCCGCCCCG GCCACCTCGG CGAAGGTGGT CGCCACCTGG
TTCCCGAAAC GGGAACGCGG CCTGGCCACC GGCATCTGGG ACTCCGGCTC CCGCGTCGGC
GCCGTCATCG CGATCCCGCT GGTCACCGGC ATCATCGCGC TGGCCGGGTG GCGGGTCACC
TTCGTCATCA TCGGCATCCT CGGCCTGGCC TGGGCTCTGG GCTGGTGGAA GGCGTACCGC
GACCCCGCCC AGCATCCGAA GGTCTCCGCC GCCGAACTGG AGTACATCAA CGAGGGCGGC
GCCCGCACCG CCGACAACGA CGCCGAGGGG GCGGCGAAAC TGCCGTGGCG CAAGCTGTTC
GGCTACCGCA CCGTGCGCGG CATGATGCTG GGCTTCTTCT GCCTCAACAG CGTCATCTAC
TTCTTCATCA CGTTCTTCCC CAGCTACCTT GTGGACGAAC GCGGCTTCAG CCTGCTCAAG
CTCGGCTTCT TCGGGATGAT CCCGGGAATC TGCGCCGTGG TCTCAGGCTG GCTGGGCGGC
TGGGTCGCCG ACCGCGCCAT CCGGCGCGGC GTCTCGGTCA CCCGGGTGCG CAAGACCGTC
ATCGTGGTCG GGATGGTCGG CGGCTCGGTC ATCATTGCCG CCGTCCTGGT GCCGCAGGCG
TGGATGGCGC TGGCGCTGCT GTCGCTGTCC TACGCCTCGC TGACCTTCGC CGGAACCGGC
ATCTGGTCGC TGCCCGCCGA CGTGGCCCCC AGTTCGGCGC ACGTGGCCTC CATCGGCGGC
ATCCAGAACT TCGCGTCCAA CTTCGCCGGA ATCCTCACCC CGATCATGGT CGGATACCTG
GTGGACACCA CCGGATCCTT CGTGATCCCC CTGTCCGTCA TCGGCGGGAT CGCACTGCTG
GGCGCCCTGA ACTACCTGTT CGTCGTCGGC AGGATCGAAC CGCTGCCGGT CCCGGCCGCC
GCCATCGCCA AGTCTTAA
 
Protein sequence
MRWAIIWLAF VGLTVNYLDR ASISVALPFL GEDIHLTKTQ EGLILAAFFW TYDFFQLAAG 
WYVDKVGPRR AFTIAAVWWS IFTAVTAAVH SFWTLVAARL LLGAGESPAP ATSAKVVATW
FPKRERGLAT GIWDSGSRVG AVIAIPLVTG IIALAGWRVT FVIIGILGLA WALGWWKAYR
DPAQHPKVSA AELEYINEGG ARTADNDAEG AAKLPWRKLF GYRTVRGMML GFFCLNSVIY
FFITFFPSYL VDERGFSLLK LGFFGMIPGI CAVVSGWLGG WVADRAIRRG VSVTRVRKTV
IVVGMVGGSV IIAAVLVPQA WMALALLSLS YASLTFAGTG IWSLPADVAP SSAHVASIGG
IQNFASNFAG ILTPIMVGYL VDTTGSFVIP LSVIGGIALL GALNYLFVVG RIEPLPVPAA
AIAKS