Gene Sare_0141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0141 
Symbol 
ID5706581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp153113 
End bp154549 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content74% 
IMG OID641269667 
Productmajor facilitator transporter 
Protein accessionYP_001535067 
Protein GI159035814 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000428453 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGCAGGCCC TCGTCATCGT GACCCCGTCG CTGCTGGCAC GATTCGCGAG CACAGCTACC 
ACGGGTGACG CCGATGCCGG TCCCGCCGGA CTGGCTAGGC TGATCTTCAC AATGACTCGG
ACCGCACACC GCCACCGGAC GCCGTTGCTG CTGGTCGAAG CGGCGACGCT GCTCTCGGCG
ACCGGCAACG GCGTGGCCAT CGTGGCACTG CCCTGGCTGG TGCTGGAGCG CACCGGCAGC
GCCACCGCGG CCGGCGTCGT GGCGGCGGCC AGCGGGCTAC CGCTGCTGCT GTCCAGCCTC
CTCTCCGGCA CGGTCGTCGA CCTACTCGGA CGCCGCCGGA CCGCGCTGGC CTCCGATGCC
CTGTCCGCCA TCTCGGTGGC CGCGATCCCG CTCGTCGACA CCCTGCTCGG GCTGAACCTC
GGCTGGATCG TCGCGCTGGC CGTCCTCGGC GCGGTCTTCG ACCCAGCCGG GATGACCGCG
CGGGAGACGC TGCTGCCCGC AGCCGCGCAG GCCGCCGGGT GGCGGATCGA GCGAGCGAAC
GGCATACACG AGGCGATCTT CGGACTGGCA TTCCTGATCG GCCCGGGGCT CGGCGGCCTG
CTCATCGCCA CGGTCGGTCC GGAGGCGACG TTCTGGGTCA CCGCCGCCGG CTTCGCACTG
TCCGTCCTCC TGATCGCCGC CGTCCGGCTG CCCGGCGCGG GCCGACCCGA ACGCCCGCCC
AACGGCCTGT GGCGGGGCAC CCAGGAGGGC CTGGTCTTCG TCTGGCGGGA TCCCCTGCTG
CGCACCATCG CCCTGATCAC GATGGTCCTG GTCGCGCTCT ACCTACCGGT CGAGGGGGTC
CTGCTGCCCG CCTGGTTCGT CGCCGAGGGG GAACCCGCCC GCCTCGGTGC CGTCCTGATG
GCGATGAGCG CCGGGGCCGT GGCAGGTGCG CTGGGCTCGT CGGCGGCCGG CCGGTTCGTC
CGCCGCCGGC ACCTGATGGC CGTCGCCCTC GTGGTCACCG GCGTGGCCCT GCTCGGGCTC
GCGTTCCTAC CGCCATATCC GGCGATGCTC GCCTTCGCCG TCCTGGTCGG GCTCGCGTAC
GGACCGGTCA ACCCGCTGGC CAACTACGCC ATGCAGACCC GCACCCCGGA GCGGCTACGC
GGCCGGGTGG TCGGGGTGAT GACGTCGTTC GCGTACGCCG CCGGGCCGGC CGGCTACCTG
CTGGCTGGCC CCCTGGTGGA GTGGTTGGGC CTGGCCACGG CGTTCCTGGT GCTCGCCGGT
GCGCTGCTGG TGACCGCGCT GGCCGCCGCG CCGCTGCCGG TGCTGGCGGC GTTGGACGAA
CCACCCCGGT ACCCACCCGC ACCACCCGGC GGGACCTCGG GTCGCAGTGA GGGACCGGTG
CCGCTGGGTG AACAGTGGTT GCCCGCCGCC CACCGCGACC CGCCAGCGAT CGGCTAA
 
Protein sequence
MQALVIVTPS LLARFASTAT TGDADAGPAG LARLIFTMTR TAHRHRTPLL LVEAATLLSA 
TGNGVAIVAL PWLVLERTGS ATAAGVVAAA SGLPLLLSSL LSGTVVDLLG RRRTALASDA
LSAISVAAIP LVDTLLGLNL GWIVALAVLG AVFDPAGMTA RETLLPAAAQ AAGWRIERAN
GIHEAIFGLA FLIGPGLGGL LIATVGPEAT FWVTAAGFAL SVLLIAAVRL PGAGRPERPP
NGLWRGTQEG LVFVWRDPLL RTIALITMVL VALYLPVEGV LLPAWFVAEG EPARLGAVLM
AMSAGAVAGA LGSSAAGRFV RRRHLMAVAL VVTGVALLGL AFLPPYPAML AFAVLVGLAY
GPVNPLANYA MQTRTPERLR GRVVGVMTSF AYAAGPAGYL LAGPLVEWLG LATAFLVLAG
ALLVTALAAA PLPVLAALDE PPRYPPAPPG GTSGRSEGPV PLGEQWLPAA HRDPPAIG