Gene Sare_4959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4959 
Symbol 
ID5706481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5633165 
End bp5634433 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content68% 
IMG OID641274354 
Productmajor facilitator transporter 
Protein accessionYP_001539696 
Protein GI159040443 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.592693 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.327663 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCCG TGGAAGAGGC CCGCACGACC CTGCCGGCGC CGGAGGCGTC CGCCGCCCGC 
CCTGAGATCG CGCTGCTGCT GAGCGTCGTG TTTCTTGCGT ATCTGGCACA GATGACCCTC
AACCCGATCA TCGCCCCACT CTCACGTGAG GTCGGCCTGG CCGAATGGCA GATCGGGGCG
ACGATCAGCA TCGCCGCGGT CATGCTCGTG CTCACCAGTC AGTTCTGGGG GCGGCGCTCG
CAGTCCTGGG GGCGCAAACC CGTCCTGGTC GCCGCGTTCA CGCTCGCGAT GGTGACGATG
TCCCTGTTCG CCCTGCTTGC CTGGCTCGGC ATGATCGGCA CGATCGCCGG CATCGAACTG
TTCCTGCTGT TCGTCCTGCT GCGTGGCGTC GGCTTCGGCA CCGCCATCTC CGCGGTCCTG
CCGACGGCGC AGGCATACAT CGCCGACGTC ACCAGTGACG AGACCGCGCG CGTCAAAGGC
ATGGCCGGCA TCGGCGCCGT TCAGGGCATT TCCATGATCG CCGGATCGGT CGTCGGTGGC
GTCCTGTCCG TGCTCGGCGT CCTCGCTCCC CTCATCGCTG TGCCCGTGCT CCTGGCAGGC
GGACTCATCC TTGTCGCGGT CCGCCTCCGC CGTGAACCGC GTCACCGACT GGTGGATAAG
CCGGCCCGGG TCAGTCCACT CGATGCTCGC GTCTGGCCAT TCCTACTCGC CGGGTTCGGC
ATGTACATGG CTCTGGGCTT TATCCAGATC CTCCTTGGCT TCATCGTGCA GGACCGGCTC
GGACTCGACA CCGAGAGCAC CGGGCTGGTC ACCGGCGGCG CGCTGCTGCT GGCGGGGCTG
GGCCTCATCG TGGCGCAAGC GGTGGTCGTG CCCCGCAGCC GATGGAGTCC CGCGACCCTG
CTCCGCGTCG GCGGCGCCAT CGCCTTCGTG GGCTTCACCC TCCTCATCCC CGACGCCGGG
GCGGCACCTT TGTTTGCCTC CATCCTGTTG ATCGGACTCG GTCTCGGCAT CGCGACGCCC
GGCTTCACCG CCGGCCCGAC ACTCATGGTC GATCGCGACG AACAGGGCGG CCTCGCTGGA
CTCACCACGG CAACCGTCGG CCTGACTTTC GTGATCGCGC CCACCGCCAG TACCGCTCTC
TACGGATTCG GGGCCGCGAT ACCGATCGTC GTCGGGACGG CAGTCATGGC CGTCGTCACC
ATCTTCGTCC TCGTTCACCC GCGCTTCCGG CGTCTCCCCG TACCAGCGCC AGGACCGCCC
CCAGCGTGA
 
Protein sequence
MKPVEEARTT LPAPEASAAR PEIALLLSVV FLAYLAQMTL NPIIAPLSRE VGLAEWQIGA 
TISIAAVMLV LTSQFWGRRS QSWGRKPVLV AAFTLAMVTM SLFALLAWLG MIGTIAGIEL
FLLFVLLRGV GFGTAISAVL PTAQAYIADV TSDETARVKG MAGIGAVQGI SMIAGSVVGG
VLSVLGVLAP LIAVPVLLAG GLILVAVRLR REPRHRLVDK PARVSPLDAR VWPFLLAGFG
MYMALGFIQI LLGFIVQDRL GLDTESTGLV TGGALLLAGL GLIVAQAVVV PRSRWSPATL
LRVGGAIAFV GFTLLIPDAG AAPLFASILL IGLGLGIATP GFTAGPTLMV DRDEQGGLAG
LTTATVGLTF VIAPTASTAL YGFGAAIPIV VGTAVMAVVT IFVLVHPRFR RLPVPAPGPP
PA