Gene Sare_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4031 
Symbol 
ID5705011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4584578 
End bp4587871 
Gene Length3294 bp 
Protein Length1097 aa 
Translation table11 
GC content69% 
IMG OID641273456 
Productacriflavin resistance protein 
Protein accessionYP_001538812 
Protein GI159039559 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00115859 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGCTGC TCGCCAGATT CAGTCTCGCC AACCGAGGGC TCGTCGTCCT CATCGCGGTG 
GTAACCACGG TGTTCGGAGC GTTCGCCGTC CCGTCGCTGA AGCAGCAACT CCTACCGTCG
CTGGAGTTCC CGGCCGCGTT CATCGTGGCG TCGTACCCGG GCGCCGGCCC GGAGGCCGTC
GAGTCGCAGG TCACCGAACC GATCGAGAAC AGCCTGCAGG GCACCACCGG GCTGGAGAAG
ATCACCTCGA CGTCGCGTGA GGGGTCGGCG ACCATTCAGG TGGAGTACGA GTTCGGTACC
GACGTGGACG CCGTGGTCAA CCAGCTCCAG GCCGCGCTCA ACCAGGCTGA CGTCCAACTG
CCGGAGGGGG TCGACCCACA GGTCGTCGCC GGTGGCACGG ACGACCTGCC GGCGGTGGTG
CTCGCCGCCT CCGGCGGAGA AGACCAGCGG GTGCTCGGCG AGCGGTTGCG CGACACGGTC
GTACCGGAAC TGGCCGCGAT CGAGGGTGTC CGGACGGTCG ACATGACGGG TACCCGCGAC
GAGGTGGTGG TCATCACCCC GAATCCGACG AAGCTTGCGG CGGCCGGGAT CCGGCCCAGC
GCCTTCGGCG GGGCACTGCG GGCCAACGGG GTCACCGTTC CGGCCGGCGC GGTCGTCGAC
GGCCAGCAGA CTCTCCCGGT GCAGGTCGGC ACCCCGCTTA CCACACTTGA CGAGCTGCGC
GGCATCATTC TCACGGTGGC ACCGGCCGCC CCGGTGCGAC TCGGCGATGT GGCGACGGTG
GAGGAGGGGC TCTCGCCGGC CACCGCGATC ACGCGTACCA ACGGCCAGGA CAGCCTGGGC
ATCGCCGTGA CCGCGACTCC GGACGGCAAC GCGGTGGAGA TCTCCCAAGA GATCCGGGAG
CGGCTCGCGG GCCTGCGCGT CGCCTCCGGA GCCGAGCTGA CCGTCGTCTT CGACCAGGCC
CCCTTCGTCG AGAAGTCAAT CAAGAGCCTC ACCACCAAGG GCCTGATCGG TCTCGTGATG
GCGGCGGTGG TGATCCTGGT CTTTCTCCTC TCGGTACGCT CCACCGTCGT CACGGCGGTC
TCCATTCCGT TGTCGGTGCT GGTCGCGCTG ATCGCGCTCT GGGTCGGCGA CTACTCGCTC
AACCTGCTCA CCCTTGGCGC CCTGACCATC GCCGTCGGCC GTGTGGTGGA CGACTCGATT
GTGGTGCTGG AGAACATCAA ACGGCACCTG GAGTACGGGG AGTCGAAACA GGAGGCCATC
CCTACCGCGG TCCGTGAGGT GGCCGGAGCG GTTACCGCCT CCACGCTCAC CACGGTCGCC
GTGTTCGCGC CGATCGCGCT GGTCGGCGGG TTCGTCGGGC AGCTCTTCAC GCCGTTCGCG
ATCACCGTCA CGGTTGCGCT GCTCGCCTCA CTGTTGGTGT CGCTGACCCT GATCCCGGTA
CTGGCGTACT GGTTCCTTCG GCCGGCTGCC GGGACGGCGG ACGAGGCGGC AGTGCGCCGG
ACAGCGGAGG AAAAGGAGCT GCGCAACCCG CTGCAACGCG CCTACCTGCC GGTCATCGCC
CTTGCCACCC GGGATCGGGC GACTCGGTGG ATCACTCTCG GACTCGGCGC CCTGCTCCTC
CTCGGCACGT TCGGCCTGTC TCGTCAGCTG GAGACGAACT TCCTCGACGA CTCCGGTCAG
GACACGATGA CCATCACCCA GGAACTACCG GCGGGCACCG GTCTGGCCGG GACGGACGCG
GCCGCCCGGC AGGTGGAGGA GGTACTGGCT GGCACCGAGG GTGTCGAGAC GTACCAGGTG
ACGGCTGGCG CGGGTGGCAC CCCGTGGGCC GGCGGCGGCA ACAACGTGGC GACCTGGTCA
CTCACTCTCG GCGGGGACAC CGACGCCGAG AAAGCGCGTG GTGTCCTGCG CGGCAAGTTC
GACGAACTCG GTGAGGAGGC TGGCGAGTTC CGGTTTGGGG CCGGGCAGGG CGGTGCCGCC
GCGAATCAGC TTGAGGTGAT CGTTCAGGCC GCCGATCCGG CGACGCTGAC CCCCGCGATC
GACGAGGCGG CGGCCGCGAT GGCGGGGCTC ACCGGCGTCG AGGATGTGAC CACCAGCGTG
GCCAGCCAGG TCGGACAGGT CGAGGTCAGG GTCGATCGCG CGAAAGCCGC GGCGGCAGGG
CTCACCGAGG CAGCGGTCGG GCAACTGGTC GCACAGGCGT TCCGGGGCGC CCCGTTGGGA
CAGCTCACGA TCGAGGGTGA GCAGCGGGAC GTGGTACTGC GAACCACCCA ACCGCCGCTG
TCGACAGAGC AACTGCGGGC GTTGCCGGTC GGCGTGGTCA CGCTGGGTGC CGTCGCCGAC
GTCAGTGAGG TTCTGGGCCC GCAGCAGATC ACCCGGATCG ACGGGGAGCG CAGTGTCTCG
GTTACCGGCA CGGCCACGGG CTCGAACCTC GGTGAGACCA GTCAGGAGCT TCAGGAACGG
CTGGACGCGA TCGACATTCC CGGGGCCACC TTCGTCGTCG GTGGGGTCAG CGCGGACCAG
GAGGAGGCAT TCGCCGATCT GGGTTTGGCG GTACTCGCCG CAATCGCGAT CGTTTTCCTG
ATCATGGTTG TGACGTTCCG TAGCCTGACC CAGGCGTTGA TTCTGCTGGT CTCGGTGCCG
TTCGCGGCGA CCGGCGCGGT CGGGCTGCTG CTGGTGACTG GCACCCCGCT GGGCGTGCCG
GCGCTGATCG GTGTCCTCAT GCTGGTCGGC ATCGTGGTGA CCAACGCGAT TGTGCTGCTC
GACCTGGTGA ACCAGTACCG AGCCCAGGGG CTGGGGATCC GCGAGGCGGT GGTCGAGGGC
GGCCGGCGGC GGCTGCGTCC GATCCTGATG ACCGCGGTCG CGACCATCTT CGCGCTGCTG
CCGATGGCCC TGGGGCTCAC CGGTGAGGGC GGCTTCATCT CGAAGCCGTT GGCGATCGTG
GTGATCGGGG GTCTGCTCAG TTCGACGCTG CTCACCCTGG TCCTGGTACC GACCCTGTAC
GTCTTGGTGG AGCACGCCAA GGAGTCGATC CGGGACCGAT GGGCCGGCCG TCGGGGCGGG
CCGCTGGAGG CTGCGCCGTC GGCCGTCACC CCGCCATTGG CCGAACCGGC GGCGACACCG
GTCTCCAGCC GCTCCGGGGC GGACGGATCG GGCGGTGGCA CCGCCGGGAG CGCCGGCGCC
GGTGACGAGC CCGACCGGCC CTCACCCTCG CCTGCGCTGG TCGACGGGAC CGACCAGTTC
GAGGTGCTCC GCCTTCCCCG CAGCCGTACC TCCCCGCTTC CGCCCACGGA GTAA
 
Protein sequence
MSLLARFSLA NRGLVVLIAV VTTVFGAFAV PSLKQQLLPS LEFPAAFIVA SYPGAGPEAV 
ESQVTEPIEN SLQGTTGLEK ITSTSREGSA TIQVEYEFGT DVDAVVNQLQ AALNQADVQL
PEGVDPQVVA GGTDDLPAVV LAASGGEDQR VLGERLRDTV VPELAAIEGV RTVDMTGTRD
EVVVITPNPT KLAAAGIRPS AFGGALRANG VTVPAGAVVD GQQTLPVQVG TPLTTLDELR
GIILTVAPAA PVRLGDVATV EEGLSPATAI TRTNGQDSLG IAVTATPDGN AVEISQEIRE
RLAGLRVASG AELTVVFDQA PFVEKSIKSL TTKGLIGLVM AAVVILVFLL SVRSTVVTAV
SIPLSVLVAL IALWVGDYSL NLLTLGALTI AVGRVVDDSI VVLENIKRHL EYGESKQEAI
PTAVREVAGA VTASTLTTVA VFAPIALVGG FVGQLFTPFA ITVTVALLAS LLVSLTLIPV
LAYWFLRPAA GTADEAAVRR TAEEKELRNP LQRAYLPVIA LATRDRATRW ITLGLGALLL
LGTFGLSRQL ETNFLDDSGQ DTMTITQELP AGTGLAGTDA AARQVEEVLA GTEGVETYQV
TAGAGGTPWA GGGNNVATWS LTLGGDTDAE KARGVLRGKF DELGEEAGEF RFGAGQGGAA
ANQLEVIVQA ADPATLTPAI DEAAAAMAGL TGVEDVTTSV ASQVGQVEVR VDRAKAAAAG
LTEAAVGQLV AQAFRGAPLG QLTIEGEQRD VVLRTTQPPL STEQLRALPV GVVTLGAVAD
VSEVLGPQQI TRIDGERSVS VTGTATGSNL GETSQELQER LDAIDIPGAT FVVGGVSADQ
EEAFADLGLA VLAAIAIVFL IMVVTFRSLT QALILLVSVP FAATGAVGLL LVTGTPLGVP
ALIGVLMLVG IVVTNAIVLL DLVNQYRAQG LGIREAVVEG GRRRLRPILM TAVATIFALL
PMALGLTGEG GFISKPLAIV VIGGLLSSTL LTLVLVPTLY VLVEHAKESI RDRWAGRRGG
PLEAAPSAVT PPLAEPAATP VSSRSGADGS GGGTAGSAGA GDEPDRPSPS PALVDGTDQF
EVLRLPRSRT SPLPPTE