Gene Sare_3912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3912 
Symbol 
ID5704985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4452464 
End bp4453861 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content70% 
IMG OID641273337 
Productmajor facilitator transporter 
Protein accessionYP_001538694 
Protein GI159039441 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.31796 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTCCG ACGAGCAACC TTCTACGGAG GGCCCGGCCA CCTTCCGCGA GGTGTTTGCG 
CAGCGCGAGT ACCGGTTTGT CTTCACGGCA GGCACCCTTG TCTGGATCGG CGACTACATC
GCGAAGGCCG CGGTCACCCT CCTGGTCTAC CGGGAGACCG AGTCGGTGGC GCTCTCCGCG
GCGGCATTCG CCATCAGCTA CCTGCCCTGG CTGATCGGTG GTCCGCTGCT CGCGACGCTC
GCCGAGCGGC ATCCCTACCG GCAGGTGATG GTCGCCTGCG ATCTGGTGCG GATGGCACTG
CTGCTGGTCA TCGCCATGCC CGGCATGCCG ACCCCGGTGA TCCTGGTACT CCTGTTCGCG
GCCAACCTCG CCAATCCGCC AAGTCAGGCC GCCCGGTCGG CACTGCTGCC ACTGATCCTG
ACCGGTGACC GCCTCGTGGT CGGGTTGTCG GTCAACGCCA GCGTCGGCCA GGCGGCGCAG
GTCGTGGGCT ACCTGGCCGG TGCGGCGATC GCGGCAGCCA ACCCCACCGT CGCCTTGCTG
GTCAACTCCG CGACCTTCGC GGTGTCGGCC ACGCTGGTGC GTCTCGGGGT GGCGGCCCGA
CCGCCGGCGA TGAAGGCCGC ACACCGCAGC CACCTGATAC GGGAAACCGG TGAGGGCTTC
CGCATGGTGT TTGGCCAGCC GGTGCTGCGC GCCATCGCGA TCCTCGTGTT CAGCGTCATG
CTTTTCTCCA TCGTCCCGGA GGGGCTGGCG GCGGCATGGG CCGGCCGGCA CGCCGAGGAC
ATGAGTCAGG GCCTCGCACA GGCGGTCATC ATGGCCGCCA ACCCGGTCGG TTTCATCCTT
GGCGGCCTGA TCGTCGGCCG CTTGGTCGCA CCGGCCCGAC GGATCGCCCT GATGCGCCCG
TTGGCGGTGC TCGCTCCGCT GGTCTTGGTT CCCGCCCTGC TCGACCCGTC GCCGCTGGTG
GTGGCGCTGC TGGCGACCGG GTGCGGGTTC GCCGTCGCCG GGATGCTGCC GACGGCGAAC
GGTCTGTTCG TACGGGTCCT TCCGGACGGC TTCCGGGCCC GTGCCTTCGG CGTCATGGCG
TCCGGCATCC AGGTCATCCA GGGTTTGGCG GTGCTGGTGA CCGGGCTGCT TGCCGAGCGG
TTCTCCATCC CGGTCGTGGT CGGTCTCTGG AGTAGCGCCG GGGTACTCCT GATGACCGTC
GCGGCGCTGA CCTGGCCGAG CCAGGCGACC GTTGACGCCT CGATCAACGC GGCTCGGCGG
CACGGCTCAC CCGACCCGGT GAGAAGTCCG TCGGGCAGCT CCCAGCCGGA GCCGGACCGC
CCGGCCGGCA AGCCCGACGA CGGCCCGGGC GGGCACCCGC CGGGCTCCCC TGACCCGCAT
CGGCACGCGG TGACCTGA
 
Protein sequence
MVSDEQPSTE GPATFREVFA QREYRFVFTA GTLVWIGDYI AKAAVTLLVY RETESVALSA 
AAFAISYLPW LIGGPLLATL AERHPYRQVM VACDLVRMAL LLVIAMPGMP TPVILVLLFA
ANLANPPSQA ARSALLPLIL TGDRLVVGLS VNASVGQAAQ VVGYLAGAAI AAANPTVALL
VNSATFAVSA TLVRLGVAAR PPAMKAAHRS HLIRETGEGF RMVFGQPVLR AIAILVFSVM
LFSIVPEGLA AAWAGRHAED MSQGLAQAVI MAANPVGFIL GGLIVGRLVA PARRIALMRP
LAVLAPLVLV PALLDPSPLV VALLATGCGF AVAGMLPTAN GLFVRVLPDG FRARAFGVMA
SGIQVIQGLA VLVTGLLAER FSIPVVVGLW SSAGVLLMTV AALTWPSQAT VDASINAARR
HGSPDPVRSP SGSSQPEPDR PAGKPDDGPG GHPPGSPDPH RHAVT