Gene Sare_3169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3169 
Symbol 
ID5705844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3654756 
End bp3656381 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content70% 
IMG OID641272600 
ProductABC transporter related 
Protein accessionYP_001537967 
Protein GI159038714 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000739402 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGACG CTTCAATCGT CTGCACCCGC CTGTCCTTCG CCTGGCCCGA CGGCACCACC 
GTCTTCCAGG ACCTTTCCCT TACCGTCGGC ACCGGGCACA CCGGCCTGGT CGCGCCCAAC
GGCGTCGGCA AGACCACCCT GCTTCGGCTG ATCGCCGGCG AACTCGCGCC CACCGCCGGT
GCCGTCACCG TTACCGGCAC CCTCGGCTAT CTGCCGCAGA ACCTGCCGCT GTCCGGTGCC
CTGACCACCG CGGAGGTGCT CGGCGTCGCC GCTACCGTCG CCGCTCTGCA CGCCATCGAA
GCCGGTGATG CTCGTGAGGA ACACTTCACC ACGGTCGGCG ACGACTGGGA CGTCGAGGAG
CGCAGCCGCG CCGAGCTGGA CCGGGTCGGG CTCGGTGAAC TCGCCCTCGA CCGCCGACTG
GACACTCTCA GCGGCGGCGA GATCGTCGCC CTTGGGCTGG CCGCCCACCT GCTGCGTCGC
CCCGACATCC TGTTGCTCGA CGAGCCCACC AACAACCTCG ACGTCGACGC CCGGCACCGC
CTGTACGCGG TCCTGCAGGA GTGGCCCGGG GTCCTGCTGC TGGTCAGCCA CGACCGGGCG
TTGCTGGACC GGATGGAGCG TATCGCTGAA CTGAACCCGG GCGAGGTCCG GTTCTTCGGT
GGAAACTTCA CCGACTATCA GGAGGCAACG CGGGCTGCCC AGGAGGTTGC CCAGCGGCAC
GTCCGCTCCG CCGAACTGGA GGTCAAACGG GAGAAGCGGG AGATGCAGCA GGCCCGCGAG
CGGGCCGAGC GCCGGGCCGG AAACGCCGCG CGCAACGTCA AGAACGCCGG ACTACCCAAA
GTCATCGCCG GTGGCCTCAA GCGGCGTGCC CAGGAGTCGG CCGGCAAGGC TGACGAGACG
CACGCCGCTC GGGTCGCCGA CGCACGGGCT CGCCTCGACG AGGCCAGTCG CGCGTTGCGC
GACGATGCTT CCATCACCGT CGAACTACCC GACACGGCGG TGCCGGCCGG TCGTACCCTC
CTCGTGGCGA AGGGGCTGCG GGTCCGCGAC CTCTTTGCCG GGGAGGGCGT TGACCTGACC
ATCAAGGGCC CGGAACGCAT CGCGCTCACC GGCCCGAACG GCGTCGGTAA GTCGACCCTA
CTGCGTCTGC TGGACGGCTC CCTGCCCCCG GACGCGGGCA CTCTCCGGCG GGCAGACGGC
CGGATGGCAT ACCTGTCGCA GCGCCTCGAC CTGCTCGACC TCGAACGGAC GGTCGCGGAG
AACGTCGCCG GATACGCGCC GGACCGGCCG CAGTCGGAAC GGATGAACCT GCTCGCTCGG
TTCCTGTTCC GGGGAACGCG GGCGCATCTG CCGGTTGGGG TGCTCTCTGG TGGGGAACGC
CTGCGCGCCA CGCTCGCCTG CGTCCTGTAC GCCGAGCCGG CCCCACACCT GCTGCTGCTC
GACGAGCCCA CCAACAACCT CGATCTGGTC AGCGTCCGGC AGTTGGAGAC CGCGCTCCAG
GCGTACCGGG GGGCGTTCGT TGTGGTCAGC CACGACGAAC GGTTCCTCAC CGAGATCGGG
GTCCGGCGGT GGTTGCGGCT CGACGACGGG CGGCTGCGGG AGTTCGCAGC CCCCGACGGC
GACTGA
 
Protein sequence
MSDASIVCTR LSFAWPDGTT VFQDLSLTVG TGHTGLVAPN GVGKTTLLRL IAGELAPTAG 
AVTVTGTLGY LPQNLPLSGA LTTAEVLGVA ATVAALHAIE AGDAREEHFT TVGDDWDVEE
RSRAELDRVG LGELALDRRL DTLSGGEIVA LGLAAHLLRR PDILLLDEPT NNLDVDARHR
LYAVLQEWPG VLLLVSHDRA LLDRMERIAE LNPGEVRFFG GNFTDYQEAT RAAQEVAQRH
VRSAELEVKR EKREMQQARE RAERRAGNAA RNVKNAGLPK VIAGGLKRRA QESAGKADET
HAARVADARA RLDEASRALR DDASITVELP DTAVPAGRTL LVAKGLRVRD LFAGEGVDLT
IKGPERIALT GPNGVGKSTL LRLLDGSLPP DAGTLRRADG RMAYLSQRLD LLDLERTVAE
NVAGYAPDRP QSERMNLLAR FLFRGTRAHL PVGVLSGGER LRATLACVLY AEPAPHLLLL
DEPTNNLDLV SVRQLETALQ AYRGAFVVVS HDERFLTEIG VRRWLRLDDG RLREFAAPDG
D