Gene Sare_4586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4586 
Symbol 
ID5705175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5205791 
End bp5207071 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content66% 
IMG OID641273995 
Productmajor facilitator transporter 
Protein accessionYP_001539342 
Protein GI159040089 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.101779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00026814 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGGGATG TTATCGGGCA ATTCGGCTCC TTCGGCCGGC CGGTACGCCT GCTTTTGATC 
AACCAGTTCG GCATCAATCT CGGCTTCTAC ATGCTGATGC CGTACCTCGC GAACTACCTG
TCGGCGCAGC TGCTGCTCGC CGGCTGGATC GTGGGGCTCG TCCTGGGCGT ACGCAACTTC
AGCCAGCAGG GCATGTTCGC CGTCGGCGGA AGCCTCGCCG ACCGGTTCGG GTACAAACCG
TTGATCGTCG CTGGCTGCAC CCTGCGTACT GTCGGCTTTG CCCTTCTCGG GCTGGTCGAT
TCCGTGGCTG TCCTCGTCGT CGCAGCGGCA GCCACCGGTC TGGCCGGGGC GCTGTTCAAC
CCAGCGGTAC GCGCCTACCT GGCTGAGGAC GCCGGCGAAC GGCGGGTGGA GGCGTTCGCC
GTGTTCAACG TCTTCTACCA GGCCGGCATC CTGGTCGGCC CGTTGGTCGG CCTTGCGCTG
ACAGCGGCCG ACTTCCGCCT GACGTGCCTC GTCGCCGCCA CCGTGTTCGC CATCCTGACC
GTCGCACAGG CTCGCGCACT ACCTGCTCGT GGTGGGCGAA GGGGCGAGCC AGGCGGCGGT
GGGTTGGTCA GCGGGATGCT GACTTCCTGG CGGCAGGTTC TGCACAACCG ACCGTTCATG
CTGTTCACCG TGGCGATGGC CGGCTCCTAC GTGCTGTCGT TTCAGGTCTA TCTCACGCTG
CCCCTCACCA TCGAAGCGAA CGCTCCCAGC CCGGCGGTGG AGAAGCTCGC CGTCGCGGCC
ATGTTCGCGG CCTCAGGTCT GCTTGCCGTC CTCGCACAGA TCCGGGTCAC CGCCTGGTGC
CGGCGACGCT GGTCACCACA CAGCTCACTC ACCTGGGGAG TCGCGCTGCT GGGCGCGGCG
TTCCTACCTG TAGCCGCGGC GAGTACCGGA CCACCCATGC TTGTCGTCGC GTCGGCCGTT
CTCGCCGGAG CGATCATAGC CGTGGCGACG ATGGTGGCTT ACCCGTTCGA AATGGACATG
GTGGTCACAT TGGCTGGTGA AGGTCGAATC GCCACGCACT ATGGGATCTA CAGCACCGTT
TCCGGTGTCG CTGTCGCCAT CGGGAACATG CTCGCTGGCG GCGCAGTCGA CTGGGCGCGA
TCTACCCAGA TGCCCATCGT GCTCTGGCTA GCGCTCGCGG CGCTCGGCGC GGCCTGCAGC
ACGGCAATTC TGTGGTTGGG CAGGCGAGGC TTGTTCGGGC CTGCGGTTCC CACGAAACAA
GCGACGACTG TCACCGCCTG A
 
Protein sequence
MRDVIGQFGS FGRPVRLLLI NQFGINLGFY MLMPYLANYL SAQLLLAGWI VGLVLGVRNF 
SQQGMFAVGG SLADRFGYKP LIVAGCTLRT VGFALLGLVD SVAVLVVAAA ATGLAGALFN
PAVRAYLAED AGERRVEAFA VFNVFYQAGI LVGPLVGLAL TAADFRLTCL VAATVFAILT
VAQARALPAR GGRRGEPGGG GLVSGMLTSW RQVLHNRPFM LFTVAMAGSY VLSFQVYLTL
PLTIEANAPS PAVEKLAVAA MFAASGLLAV LAQIRVTAWC RRRWSPHSSL TWGVALLGAA
FLPVAAASTG PPMLVVASAV LAGAIIAVAT MVAYPFEMDM VVTLAGEGRI ATHYGIYSTV
SGVAVAIGNM LAGGAVDWAR STQMPIVLWL ALAALGAACS TAILWLGRRG LFGPAVPTKQ
ATTVTA