Gene Strop_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1823 
Symbol 
ID5058282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2086167 
End bp2087561 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content74% 
IMG OID640474093 
Productmajor facilitator transporter 
Protein accessionYP_001158663 
Protein GI145594366 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000162964 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.991228 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACCC TCTCGGTCCA TCAGGTCCGC CACCGCTATC TCATCCTGCT CGGCCTGCGC 
TGGCTGCCGG TCGGCCTGCT GGTTCCGGTC TTCGTCCTGC TGATGCAGGA GCGGGGCCTC
ACCCTTGCCG AGATCGGCAT GGCGTTCGCG GCCCAGGGTG TCCTCGTCCT CGCGCTCGAG
TTGCCGACCG CCGGGCTCGC CGACGCCGTG GGACGCCGGC CGGTGCTGGT GCTCGCCGCG
GTGGTCAACC TCGCCTCCGT GGCGCTCTTC GTCGTGGCCG ACTCGTTCGC CCTACTGGTG
GCGGTGTGGG CGTTGCAGGG GGTCTATCGG GCGCTGGACA GCGGCCCGCT CGAGTCCTGG
TTCGTGGACA GCACCCTCGC CGCCGACCCT GAGGCCGAGT ACGAGCGGGG CCTGAGCCAG
GGCGGGACGG CGGTCGGCGT CGCCATGGCG GCCGGTGCCC TGCTCAGCGG CGGGCTGGTC
GCGTTCGGCC CGGTCGGCCC GTTCAGCGCG TTGACCATGC CGATCGTGGC CATCGTGGTG
TTGCAGGTCG TCGGGCTGGT GGCCCTCCTC GCCCTGCTGC GCGAGCGTCG TCCCACCGCC
GGGTCGTCGG CGCTACGCGA GTCGATCGCG GAGGCCCCCC GGATGATCGG CCAGGCGATG
GGGCTGCTGC GCCGCTCCCG GGTGCTGCTC GCCCTGCTCT CGGTCGAGCT CTTCTGGGGC
TTCGGCATGG TCACTTTCGA AGCTCTCCTG CCGGTCCGAC TCGCCGAGGT GCTCGGCGAC
GCCGATCGGG CCGGGGCGCT GCTCGGGCCG GCGACCATGG TGGCCTGGCT CGCGAATGCC
GGCGGCGCGG CGGCGACCCC GCTGCTCACC CGGCGACTGG GCGCCGCGCC CGCCGCGGCG
CTGCTGCGGA TCGCGCAGGG CGTCACCGTG GTCGGGATGG CGCTGCTCGC CGGTCCGGTC
GGCGTGCTGG TCGCCTTCGT CGTGTGCTAC GTCGTGCATG GCGCCTCGAA CCCGTTGCAC
ATGGGGCTGG TGCACCGACA GGTCGACGGG CCGTACCGGA CCAGTGTGGT GTCCCTGAAC
TCGATGGTGT CCATGCCCGC CGGTGCCGCC GGGGGGATCG TGCTCGGCCT GGTCGCCGAC
CGTGCCGGCG TCAGCGTCGC GATGCTGGTC GGCGCGGTCG TGCTCGTGAT TGCCGCGCCG
CTGTACCTGC CCGCCTGGCG GGCGAGCCGG GACGGTTCTC CCGCCCCGGT CGGTGCGCCA
ACGGCCGCCG CCTCCCCCGA GCCCCCGGAA CCGGCGACCC AGCCCGACAC CTCGGCGCCG
GTCAGTGGGC AGCCCGGCGT GCCCCCGGCT CGCGACGGTG ATCCGACAAC CGGCGCGGGC
GCGGCACGGC GCTGA
 
Protein sequence
MATLSVHQVR HRYLILLGLR WLPVGLLVPV FVLLMQERGL TLAEIGMAFA AQGVLVLALE 
LPTAGLADAV GRRPVLVLAA VVNLASVALF VVADSFALLV AVWALQGVYR ALDSGPLESW
FVDSTLAADP EAEYERGLSQ GGTAVGVAMA AGALLSGGLV AFGPVGPFSA LTMPIVAIVV
LQVVGLVALL ALLRERRPTA GSSALRESIA EAPRMIGQAM GLLRRSRVLL ALLSVELFWG
FGMVTFEALL PVRLAEVLGD ADRAGALLGP ATMVAWLANA GGAAATPLLT RRLGAAPAAA
LLRIAQGVTV VGMALLAGPV GVLVAFVVCY VVHGASNPLH MGLVHRQVDG PYRTSVVSLN
SMVSMPAGAA GGIVLGLVAD RAGVSVAMLV GAVVLVIAAP LYLPAWRASR DGSPAPVGAP
TAAASPEPPE PATQPDTSAP VSGQPGVPPA RDGDPTTGAG AARR