Gene Sare_2987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2987 
Symbol 
ID5707635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3393011 
End bp3394678 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content68% 
IMG OID641272434 
ProductABC transporter related 
Protein accessionYP_001537802 
Protein GI159038549 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000223105 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCATCC ACACCACCGC ACCCGTCCTG TGCACTGGCC TGTCGTTCGC CTGGCCGGAC 
GGCACGCAGG TTCTGGACGA TCTTTCGTGG GCAGCCAGGG AAGGCAGTAC CGGCCTGATC
GGGCGCAACG GCAGCGGGAA ATCCACCCTC CTGCGATTGA TCGCCGGTCG GCTCCTTCCC
TCCTCGGGCA CCGTCCACGT CGCCGGCAAT CCCGGCTACC TGTCCCAGGA CCTCACCTTT
GATCCCGCGA TGCGCGTGGA AGAGGTGCTG GGTGTCGCCT CCGTACGTGC CGCCCTGCAC
GCGATCGAAG CCGGCGATGC CAGCGAGGAC CACTTCACCG CCCTAGGCGA CCAGTGGGAC
GTGGAAGAAC GCACCCACGC TGCCCTGGCC CGCCTGGGCC TGGGTCACAT TAGGCTGGAC
CGCACCGTCG GCGAGATCTC CGGCGGCGAA GCAGTCCTGC TGCGCTTGGC CGCCCTTCTG
CTGGCCCGCC CCGTCGTCCT GCTGCTGGAC GAGCCGACCA ACAACCTCGA CCGCCATGCT
CGCCACCGCC TCTACGACGC GGTCACCGGC TACCGGGGCA CCGTCATCGT CGTGACCCAC
GACCGCGAGC TGCTGGAGCG CGTTGACCAG ATCGCCGAGC TGCGCACCGG AACCGTGCGC
ACCTTCGGCG GCAACCTGAC AGCCTATGAG GACACCATCG CTGCCGAGCA GGAAGCTGCT
GAGCGGGCTC ACGCCACCGC CCGCGCCGAC CTCGCCCGGC AGCGCCGCGA TCTGGCCGCC
GCGCAGACGA GGCAGGCCCG CAGCGCCCGC TACGGCAAGC AGATGTACGC CGACAACCGG
CTGGACAAAG CTTCCGCCCA GACCATGAAG CGTGCCGCCC AGGTCTCACG CGGCAAGCAA
CAGGCCGTCC ACGACCAGCG CCTGGCGGAC GCCAAGGAGC ACCTGGACAC GACCGAGGAC
CGGCTCCGCG ACGACGACAC CATCCGCATC GACTTACCGG ACACCACCGT TACGCCCCAC
CGCACCGTCC TGACCGTCGA TCGCGCCCGT CTGCGCACCG GCCGCAAGGT CGACGACCTG
CGGGTTCACG GGCCCGAGCG CATTGCGCTC ACCGGCCGCA ACGGCGCCGG GAAGACCACC
CTCCTGCGTA CCATTGCCGG ACAGCTTCCG CCCGCGGAGG GCGAGGTCTG TGTCCATGTC
GCGCTGCGCT ATCTGCCACA GCGACTGGAC CTGCTCGACG GCCAGCGGAC AATCTTCGAC
AACGTCGCCC ATCTCAACCC GCACGCCACC GGAAACCAGA TCCGTGCCCA GCTGGCCCGT
TTCCTGTTCA AGGGCGGCCG CGCCGATCAG CTGGTGGACA CGCTTTCGGG AGGAGAGCGC
TTCCGCGCGA CGCTGGCGGC GCTACTGCTG GCCGACCCCG CTCCTCAGCT GCTTCTACTG
GACGAGCCAA CCAATAGTCT CGACATGGAC AGCGTCGGCC AGCTTGCCCA AGCGCTGGCC
GGCTACCGGG GCGCGCTGGT CGTCGTCAGC CACGACTTCC CGTTTCTTCG GCAGATTGGC
ATCACCCGCT GGCTGGACCT CGATGATGAC GCGGCTGTGG AAGCATCCGA GGAAGCTGCG
GATCAGAAGG TCAGGGGTTC GAATCCCTTC GGGCGCCCAA GATCGTAA
 
Protein sequence
MSIHTTAPVL CTGLSFAWPD GTQVLDDLSW AAREGSTGLI GRNGSGKSTL LRLIAGRLLP 
SSGTVHVAGN PGYLSQDLTF DPAMRVEEVL GVASVRAALH AIEAGDASED HFTALGDQWD
VEERTHAALA RLGLGHIRLD RTVGEISGGE AVLLRLAALL LARPVVLLLD EPTNNLDRHA
RHRLYDAVTG YRGTVIVVTH DRELLERVDQ IAELRTGTVR TFGGNLTAYE DTIAAEQEAA
ERAHATARAD LARQRRDLAA AQTRQARSAR YGKQMYADNR LDKASAQTMK RAAQVSRGKQ
QAVHDQRLAD AKEHLDTTED RLRDDDTIRI DLPDTTVTPH RTVLTVDRAR LRTGRKVDDL
RVHGPERIAL TGRNGAGKTT LLRTIAGQLP PAEGEVCVHV ALRYLPQRLD LLDGQRTIFD
NVAHLNPHAT GNQIRAQLAR FLFKGGRADQ LVDTLSGGER FRATLAALLL ADPAPQLLLL
DEPTNSLDMD SVGQLAQALA GYRGALVVVS HDFPFLRQIG ITRWLDLDDD AAVEASEEAA
DQKVRGSNPF GRPRS