Gene Strop_1095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1095 
Symbol 
ID5057542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1241943 
End bp1243184 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content73% 
IMG OID640473362 
Productmajor facilitator transporter 
Protein accessionYP_001157944 
Protein GI145593647 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.522817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.816365 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCGG AGGTACGGAC CAACGTGAAC CTGAAGCCTT ACCGGGCGGC GCTCGCCCTG 
CCCGGTCTCC GGGCCCTACT GATCGTGGCG GTGCTCGCCC GGATACCGCT CACCACGATC
GGTCTGACCC TGACGTTCTA CGTCGTCCAG GACCTCGACC GAGGGTACGG CGCGGCCGGG
CTGGTCGGCG GCGCGATCAC CGTCGGCGCG GCCCTCGGCG GCCCGCTGCT GGGTCGTCTG
ATCGACCGGC GCGGCCTTCG GCCGGTGCTG GTGCTGACCG CCGTCGCCGA AGCGGTGTTC
TGGTCCACCG CGCCGCTGCT GCCGTACGCG CTGCTGCTGC CCGCCGCGTT CCTCGCCGGT
ACGCTGGCGC TGCCGATCTT CGCGGTGGTC CGCCAGTCCA TCGCGGCCAT CGTGCCGGCG
GAGAAGCGCC GGCCGGCGTA CGCGCTGGAC TCGGTGTCGG TGGAGTTGTC CTTCATGATC
GGGCCGGCTC TGGCCACCCT CGCGGTCACC ACCATCTCCG CCCGCACCAC GCTGTACCTG
GTAGGCGCCG CCATCGTCGC CGCCGGCATC GGGCTCTTCC TGCTCAACCC GCCGATCCGG
GGCGCCAGCG AAGCGACCGG GCCGCGACGA AAGGTGCCGC GGCGGGAGTG GCTCACCGCC
CGGATGATCG CCGTGCTGGC AATCACCGCC GCCGCCACCA TGGTGCTGGG CGGCACCGAC
GTCGCCGTGA TCGCGGTCCT GCGCGACAAC GGCGACGTGG GCTTCACCGG CGTGGTGCTG
GGCTTCTGGG CACTCGCCTC GCTGCTCGGT GGCTTCGCGT ACGGGGCGAT CACCCGCTCC
CCGTCTCCGC TGGTGCTGCT CGCGGCGCTG GGCATCGCCA CGATCCCGGT CGGGCTGGCC
GGCGCGAACT GGTGGCTGCT CAGCCTGGTG CTGATCCCCG CCGGCCTGCT CTGCGCCCCC
ACCATCGCCG CCACCTCGGA TGCGGCCAGT CGACTGGCGC CCGCGGACGC CCGCGGTGAG
GCGATGGGGC TGCACGGCTC CGCCAACACC GTCGGCGTCG CGGTCGGCGC CCCACTGGCC
GGAGCCGTCA TCGACGCCTC CGCGCCGGCC TGGGGCTTCG CCGTGACCGG AGCGGTCGGT
GCACTGGTCG CTCTGGCGGT ACTCCCGGTG CAGTTGCGCC GCCGTCGGGA AGCCGAAGCA
CCGGCCCCCG TTCCCGAGCC CGAGCTGACC CACACTGCGT GA
 
Protein sequence
MSPEVRTNVN LKPYRAALAL PGLRALLIVA VLARIPLTTI GLTLTFYVVQ DLDRGYGAAG 
LVGGAITVGA ALGGPLLGRL IDRRGLRPVL VLTAVAEAVF WSTAPLLPYA LLLPAAFLAG
TLALPIFAVV RQSIAAIVPA EKRRPAYALD SVSVELSFMI GPALATLAVT TISARTTLYL
VGAAIVAAGI GLFLLNPPIR GASEATGPRR KVPRREWLTA RMIAVLAITA AATMVLGGTD
VAVIAVLRDN GDVGFTGVVL GFWALASLLG GFAYGAITRS PSPLVLLAAL GIATIPVGLA
GANWWLLSLV LIPAGLLCAP TIAATSDAAS RLAPADARGE AMGLHGSANT VGVAVGAPLA
GAVIDASAPA WGFAVTGAVG ALVALAVLPV QLRRRREAEA PAPVPEPELT HTA