Gene Sare_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0221 
Symbol 
ID5706125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp252999 
End bp254159 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content73% 
IMG OID641269750 
ProductRND family efflux transporter MFP subunit 
Protein accessionYP_001535147 
Protein GI159035894 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.514226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00341518 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTGTGC GACGACTCCT CCCGGCGCCT AGCTCGTCCC TGGTTGTCAA CGCCGGTCTC 
GGTGTGGCGG TGCTGCTCGG GGCGGCGCTG GCGTATCCGG CGTTCACCGG CCCGGATCGG
TCGGCGACCA CCGAACCGAG AACGCTGCCG GTCAGCCGGG CGACGGTGAC ATCCGTGGTC
TCCGTTGCCG GTTCGGTGCG CAGCGCGGTG ACCGCGAACG CGGTTTTCGC CACCAGTGGG
ACGGTGGCCG AGGTCTTGGC CACGGTCGGG GATTCGGTGA CCGAGGGGCA GGTACTGGCC
CGGCTTGACG CCACGGTGGT GGAACGCGAA CTGGCTGGTG CCGAGGCGGA CCTGACGGCG
GCGCGGGACG CGGTCGAGCG GGCGGCCGGG AACGGTGACT CCAGCACGCA GGCGGAAACC
GCGGTGGTCC AGGCGGAGCT GGCGGTCGCC GAGGCCCGGG AGAGGGTCGA CGGTGCCACC
CTCACCGCGC CGATGGCCGG GACGGTCGTC GCGGTCAACG GTGGCGTCGG GGATTCGGGG
GACGCGGGCG GCAGCACCGG TGGCACTGCC ACGGGTGGCA GCAGCACGGG CAGCACCAGT
GCCGGCTTCG TGCAGATCGC GGACCTCGAC CGGCTGGAGG TGAGCGCTGG GATACCGGAG
GCTGACGCGA CCTCGCTGAC CGTCGGGATG TCCGCCACGG TCAGCTGGAA CGCGCTGCCG
GGCACCACGG CCGGGGCGAG GCTCGCGGCT GTCGACCCGA ATGCCACCGT CACCAACGAT
GTGGTCACGT ACGGGATCAC CCTCAGCCTG GACGACCGTC CCGAGGCGGC TCGGCCCGGG
CAGAGCGTTC AGGTGACCAT TTCCCTTGGC ACGGCCGAGA ACGTCGTCGC GGTGAGCGCG
CTCGCCGTGA CCAGCATCGG CAACCGGCAC ACCGTGTCGG TGCTGGAAGA CGGCGCGCCG
GTCGTCCGCT CGGTCGAGCT GGGACTCCAG GGCGACCAAT TGGTGGAGGT CACCACGGGC
CTACGGGAAG GCGAGCGGGT GGTGCTGCCC AGCACCGGTG GGACGGACAC GCAGCCCGAC
ACCGACGGAG GGCGGGGCAG ACCAGCCGGG GGCGGCCTCC CCGGGGGCGG CGCTCCGGGT
GGCGGGCGGG GCGGCCGGTG A
 
Protein sequence
MRVRRLLPAP SSSLVVNAGL GVAVLLGAAL AYPAFTGPDR SATTEPRTLP VSRATVTSVV 
SVAGSVRSAV TANAVFATSG TVAEVLATVG DSVTEGQVLA RLDATVVERE LAGAEADLTA
ARDAVERAAG NGDSSTQAET AVVQAELAVA EARERVDGAT LTAPMAGTVV AVNGGVGDSG
DAGGSTGGTA TGGSSTGSTS AGFVQIADLD RLEVSAGIPE ADATSLTVGM SATVSWNALP
GTTAGARLAA VDPNATVTND VVTYGITLSL DDRPEAARPG QSVQVTISLG TAENVVAVSA
LAVTSIGNRH TVSVLEDGAP VVRSVELGLQ GDQLVEVTTG LREGERVVLP STGGTDTQPD
TDGGRGRPAG GGLPGGGAPG GGRGGR