Gene Sare_0296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0296 
Symbol 
ID5707749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp329313 
End bp332162 
Gene Length2850 bp 
Protein Length949 aa 
Translation table11 
GC content74% 
IMG OID641269823 
ProductABC transporter related 
Protein accessionYP_001535218 
Protein GI159035965 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.762546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.11249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATCGC CGTCAGCGGC CGGGTGGGTC CGGCGTGCCC TGCCCACCCG CCGGCGTGCC 
CTGGCCGCGG CGGCGGCCGT CGTCCTGCTC GCCGCCGGGG TCGCCTGGGC GGTTCGGCCG
ACCGAGCCGG ACCTGCGCAC CGAGGCGGGG CTGGTCACGG TCCGGTCCGG GCCGGCGGGT
GACGAACCGG TCGACCTGGA CACCACGCTG TACCTGCCCG CCGAGGCGAG GGCCGACGCT
CCGGTGCCGG CGGTGCTGCT CGCGCACGGC TTCGGCGGCA CCAAGGAGTC GGTCCGCGCC
GACGCCGAAG AATTCGCCGG CCAGGGGTAC GCGGTGCTGA CCTGGAGCGC GCGTGGCTTC
GGCCGCAGCG GCGGGCAGAT CCACCTGGAC CACCCGGACT ACGAGGTCCG CGACGCGCAG
CGCCTGCTGG ACTGGCTGGC CGCCCGACCG GAGGTCGCCA CCGACGCCCC CGGCGACCCG
CGGGTGGGCG TGGTCGGCGG CTCCTACGGC GGGGCGCTGG CGCTTCTGCT CGCCGCAACG
GACCAGCGGG TCGACGCGAT CGTGCCGATG ATCACCTGGA ACGACCTCGC CCGGTCCTTC
CTGCCGGAGA GCAGCGGCAA GGGACCGGGC CAGGGAGTGT TCAAACAGGG GTGGGCGGGG
CTGTTCTTCG GCAGCGGCGG CGCTGCCGGC TCCGGCTCGG TCGGCGGATC CTCCGGCTCG
GTCGGCGGAT CCTCCGGCTC GGTCGGCGGC TCCGCACCGG CCGGTTCCCG CTCAGCGGAC
GGACCTACCG CGGCCGGTGC GGTCGACCCG TCCTGCGGGC GGTTCGCCGC CGACGTGTGC
GCCGCGTACC TACGCATCGC CACCACCGGG CGGGCCGACC AGGGCGCCGT CGATCTGCTG
CGTCGCTCCA GCCCGGCCGG CGTACTCGAC CAGATCGAGG CGCCGACCCT GCTGGTGCAG
GGCGCGGCGG ACACCCTGTT CCCGCTGACC GAGGCGGATG CGAACGCACG CGGGATCGCC
GCGGCCGGCA CCCCGGTTCG GGTGGCCTGG TTCACCGGTG GTCATGACGG CGGCGACGGT
CCGAAGACCG ACTCGGACCG GGTGCGGTTC CTGACCGTGC AGTGGCTCGA CCACTACGTC
CGGGGCGAGG GCCCGGCGCC TGACGACGAT TTCACCTTCT CCCGGATCGC CGGGTTCGAC
GCCCTCGACC GGGGCCTGGT CGCCACCGGC TTCCGCACCA CCGACTACCC GGGCGTCACC
GGCACCGACC AGCGCACGGT GCCGGTGACC GGTCCGGCGC AGCCGATCGC CAACCCGCCG
GCCGGCAACC CGGCCGCGAT CACGTCGGTG CCGTTCGCCG GGGAACTCGC GTCGCTGTTC
GGCGGGGTGG CCGTGGACAT CCCCGGACAG CATGCCCGGT TCGAGTCCGC GCCGTTGTCG
GAGGCGGTGG ACGTGGTCGG CACGCCAACC GTCTCGATCC GGGCCGCGTC ACCGACCGGC
GAGGCCGTGC TCTTCGTCAA GCTCTACGAC GTGGACGGTG GCGGCGCGGC GACCCTGCCG
AACGGGCAGG TAGCGCCGAT CCGGCTGGTC GGGCTACCGG CCGCCATCGA GGACGCCGCA
CCGGTCACCG TCACCCTGCC GGCGATCGTC CGCCGGATCG AAGCCGGGCA CCGGCTGCGC
GTCGTGGTGG CGACCTCCGA CCAGGCGTAC GCCACCCCGG TCGAACCGGC CGTGCACACC
GTGGCCCTGG CGACCGGGGC GGTCACCCTG CCCACGGTGA CCAGCGCGCC GATTCCCACC
GCCGCCGTCG TCTGGCGTTG GGTGCTCGCC GGCCTGCTCG CCACGATCGT GGTCGGGCTC
GTCGTGGTCC TCGCCGTCCT CCGCCGGCGT CACCGCCGCC GGGAGAGCGA CATCCACCCG
GCGTACGCGA ACGTCCCGCT CGCCGTCCGG GAGCTGCGCA AGGAGTACGC CGACGGCTTC
GTCGCCGTGT CCAACGTCGA CTTCGAGGTG CATCCGGGGC AGGTGGTGGG CCTGCTGGGC
CCGAACGGCG CGGGCAAGAC CACCACCCTG CGGGTACTGA TGGGGCTGAC CCAGCCGACC
GCCGGTGAGG TCTACGTCTT CGGGCACCGG CTGGTGCCGG GTGCCCCGGT GCTGTCTCGA
ATCGGGGCCC TGGTGGAGGG CCCCGGCTTC CTGCCACATC TGTCCGGGCT GGACAACCTC
AGGGCGTACT GGCGGGCGAC CGGGCGGCCG TGGGCCCACG CGCGCTTCGA CGAGGCGTTG
GAGATCGCCG GACTGGGTGC CTCGGTACAC CGGAAGGTCC GGACGTACAG CCACGGCATG
CGGCAGCGCC TGGCGATCGC GCAGGCCATG CTCGGCCTGC CGGAGCTGCT GGTGCTCGAC
GAGCCGACCG ACGGCCTGGA CCCGCCGCAG ATCGCCGAGA TGCGCCGGGT CCTGCGGCGG
TACGCGACCG GCGGCCGGGC GGTGCTGGTC TCCAGCCACC TGCTCGCCGA GGTGGAGCAG
ACCTGCACCC ACGCGGTCGT GGTGAACAAG GGTCAGATCG TCGCGTCCGG CCCGGTCGAG
GAGATCGTCG GCGAGTCCCC CAGCGTCCTC GTGGAGGTCA CCGATCCGGT GGCCGCCCGG
GACGTGCTGG ATCGGCTGGC GGGTGTCCGG GTGCTGCCCG ACGGCGACGG GCACCTCGTG
GTGGACACCA ACGGCACCGC CCGCAGCGAG GTGGTCGCCG AGTTGGTGCG GGCCGGCATC
GGGGTGGACC GGGTGGTGCC CCGACGTCGC CTGGAGGACG CGTTCCTGGC CCTGGTCGGC
GAGAACTCTC GGGGAAGCGG GGATCGGTGA
 
Protein sequence
MRSPSAAGWV RRALPTRRRA LAAAAAVVLL AAGVAWAVRP TEPDLRTEAG LVTVRSGPAG 
DEPVDLDTTL YLPAEARADA PVPAVLLAHG FGGTKESVRA DAEEFAGQGY AVLTWSARGF
GRSGGQIHLD HPDYEVRDAQ RLLDWLAARP EVATDAPGDP RVGVVGGSYG GALALLLAAT
DQRVDAIVPM ITWNDLARSF LPESSGKGPG QGVFKQGWAG LFFGSGGAAG SGSVGGSSGS
VGGSSGSVGG SAPAGSRSAD GPTAAGAVDP SCGRFAADVC AAYLRIATTG RADQGAVDLL
RRSSPAGVLD QIEAPTLLVQ GAADTLFPLT EADANARGIA AAGTPVRVAW FTGGHDGGDG
PKTDSDRVRF LTVQWLDHYV RGEGPAPDDD FTFSRIAGFD ALDRGLVATG FRTTDYPGVT
GTDQRTVPVT GPAQPIANPP AGNPAAITSV PFAGELASLF GGVAVDIPGQ HARFESAPLS
EAVDVVGTPT VSIRAASPTG EAVLFVKLYD VDGGGAATLP NGQVAPIRLV GLPAAIEDAA
PVTVTLPAIV RRIEAGHRLR VVVATSDQAY ATPVEPAVHT VALATGAVTL PTVTSAPIPT
AAVVWRWVLA GLLATIVVGL VVVLAVLRRR HRRRESDIHP AYANVPLAVR ELRKEYADGF
VAVSNVDFEV HPGQVVGLLG PNGAGKTTTL RVLMGLTQPT AGEVYVFGHR LVPGAPVLSR
IGALVEGPGF LPHLSGLDNL RAYWRATGRP WAHARFDEAL EIAGLGASVH RKVRTYSHGM
RQRLAIAQAM LGLPELLVLD EPTDGLDPPQ IAEMRRVLRR YATGGRAVLV SSHLLAEVEQ
TCTHAVVVNK GQIVASGPVE EIVGESPSVL VEVTDPVAAR DVLDRLAGVR VLPDGDGHLV
VDTNGTARSE VVAELVRAGI GVDRVVPRRR LEDAFLALVG ENSRGSGDR