Gene Strop_4432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4432 
Symbol 
ID5060918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp5026494 
End bp5027753 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content70% 
IMG OID640476695 
Productmajor facilitator transporter 
Protein accessionYP_001161238 
Protein GI145596941 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGCTA TGTTCGCTGC CATGTTTGAA CCTCTGCGGG CGGCGCGGCT GGCCACCTTC 
ACCTACTTCA CCCTCAACGG CTTCGTCCTG GGGGCCTGGA TCGTCCACAT CCCCGCCGTC
GAACACCGAG CCGACATCAG CCACGCCACG CTCGGCTGGC TCCTGCTGCT CCTCGGTGCT
GGCGCCTTCG CCGGGATGCA CGTCGTGGGG CCGCTCACCG ATCGCTTCGG CGCTCGTCGG
GTTGTCCCGC TGAGTGCGGT GCTGTGCAGC ACGACGCTCG TCCTACCCGC GTTCGCCGAA
AGCGCCGGGA CGCTGGGCCT GGCACTACTG GTCTTCGGCA TCGGCAACGG CAGCCTGGAC
GTCAGCATGA ACACCCACGC GGTTCAGGTC GAAGCCGGAT ACAAACGCCC GGTTATGTCC
GCTTTTCACG CTATGTTCTC CGTCGGCGGC GTCCTCGCCG CACTGGTCGG CGCCCGGACC
CTCAGCTGGG GCTGGAGCCC GACAACCACG CTGACCATGG TCAGCCTCCT CGGCCTGACG
GTCACCGCGG TCGTCGCACC CGCGCTACTG CCGCGGTTGG CGACCACCCG TACGCCAACC
CCCGCCAGTG CCACCAAGAG CCGCCGGGCC AGCATCTCCC CGCGCATCTG GGCACTCGCC
GGACTCGCGC TGATCCTGAT GCTCACCGAG GGCGTCGCCA ACGACTGGAG TGCACTGGCC
ATGCGCGATG CGCTCGACGC GCCCGCCGCC ACCGCCGCGC TCGCCTACGG CGCCTTCGCC
ACCGCGATGA CGGTTGGACG GTTCCTCACC GACGGCATCG CCGCACGCTT CGGCCCGGTA
GCAATCGTCC GCTACGGCTG CGCGCTGGCC GCGCTCGCCC TCACCACCAT CGCGCTCGCA
CCATCCATCT CACTCGCTCT CGTCGGATGG GCGCTACTCG GAGTCGGTCT GTCCGGCGCC
GTCCCGCAGC TCTTCAGCGC GGCCGGCCAC ACCGACCCCG ACGCCGCCGG CACCAACGTC
TCCCGCGTCG CGGGCCTCGG CTACCTGGGC ATGCTCAGCG GGCCCGCCAT CATCGGGCCC
CTCACCCAGC TCATGCCCCT GAACTACACG TTCATCCTGC CTGCGTTCCT CTGCCTGGTC
GCCGCCCTCA CGGCCCACAT CCTTCGCCCC CGGGAGGAAG CGCCGCGGCT CGAGGTCAAG
GTTCCGCAGC CGGTCACCGC ACGATCAGCA GAGACTGGGC ACGGGGCAGA GCAGGACTGA
 
Protein sequence
MFAMFAAMFE PLRAARLATF TYFTLNGFVL GAWIVHIPAV EHRADISHAT LGWLLLLLGA 
GAFAGMHVVG PLTDRFGARR VVPLSAVLCS TTLVLPAFAE SAGTLGLALL VFGIGNGSLD
VSMNTHAVQV EAGYKRPVMS AFHAMFSVGG VLAALVGART LSWGWSPTTT LTMVSLLGLT
VTAVVAPALL PRLATTRTPT PASATKSRRA SISPRIWALA GLALILMLTE GVANDWSALA
MRDALDAPAA TAALAYGAFA TAMTVGRFLT DGIAARFGPV AIVRYGCALA ALALTTIALA
PSISLALVGW ALLGVGLSGA VPQLFSAAGH TDPDAAGTNV SRVAGLGYLG MLSGPAIIGP
LTQLMPLNYT FILPAFLCLV AALTAHILRP REEAPRLEVK VPQPVTARSA ETGHGAEQD