Gene Sare_3541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3541 
Symbol 
ID5703922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4082516 
End bp4085947 
Gene Length3432 bp 
Protein Length1143 aa 
Translation table11 
GC content75% 
IMG OID641272968 
ProductTPR repeat-containing protein 
Protein accessionYP_001538334 
Protein GI159039081 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID[TIGR02243] conserved hypothetical protein, phage tail-like region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.604889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00807963 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCGCG GCTACGGTGA CCGATGGCAT CATGCCGCCG AGCCGCCCTG GATGTTCGAG 
CCGACCAACG AGTGGCGTCC CCAGTTCCCC GGCCAACGCT ACCCCGGCGA CATCGATATC
GAGCACACCC CACGACCCGC CCCGCGCGGG CGGGCTGCCG TCGTCGGCCG AGCCGAGATA
CCCCCGCTCG CGCCGACCCG CCCCGACGGG ACCTACGTCG GCCGGTCCTG GGACGAACAG
GACGACCGCC GCACACGTCA CGACGACGAG ACATACCGGC GTCCCCACGA CGAGGCGTAC
CGCCGGGGCG AGCTGCCCCG GGTAGACGAC CGGGGCCGGC GACCGGAGCC GCTCAACGCC
CGCGGCGGCG AGCCCCACTG GGAACGGGGC AACCGGGACG AGACCGGGCG CGGTCGTTAC
GCCGACCACG GCCGGTACGA CGACCGGCGC CCACGGCACG AGCCCGGCGG GCGGGATCAG
CCGGTCTCCC CCGCCCGGCA CCCGGAACCG GGCTGGCTAC CCGAACCGGA CGAGGAACTG
CCCCGGGGGC ATGACCACTA CGCCTTCGGC CGAGATGCCG GGGAGCCGCC GCCCGGGACG
CGCTACCCGT GGGGGCGGCC CGGCGGCCCG GATCGTGGAC GACCCGGTGA GCCGCCCCTC
AGCGATGACG GCCGCTCGAA GCCGGACTGG AGCGGGGCGA CGCGCGACGG GGACGTCAAC
CCGTACCCGC GACGGCATCC GGCCGGCCCC GGCCACCCCG CCGGGCGCGA TGCGCCCAGC
CCGCACCGGG ACGCGGCGCC CGGACCGGGC CGCCACCCCG GCCGGTACGC GGCCCAGCCG
CCAAACCCCA ACCGGCGACC GGACACGGCC GCACCCTGGC CACCACCCGG CCCACCCCGG
CCGGGTCCGG ACCAGCCTGC CCCCCAGCGC GGAAGACCGC ATGGGCCACC CCTCCCGCCG
CCCAGCCGAT CACGTCAGGG CGGACTTCCG ACCGACACGC CGAGAGCCCC GTATGGCCGC
CCGGTGTCCG GCACGCCGAA CGGACCAACG TCCGGCGCGC CCAGCCGACC GGCATCCGGC
ACCACCGGGA CGCCTCCGCC GCCCCGCCCG CCCCAGGGCC ACCCAGGGGA CGACGTACCC
CGGCGGCCAC CGCCTCCGGT TCCGGGGATG GCTGGCCCAC CGCCGGTCCG GCCGGTCTCA
CCGGCGGCGG GGAACCACCG GGACCGGACG GACCAGGCGC GGACGGACAC CGGAGCCGAT
CGGGAGACCG GGTCGCCGCC AGGCACCGCG CCATGGCAGC CACGACGGTA CGTTCCGCCG
CCAGCCTCAC CGCCACTCAC CACCGAACCC GCGCCCGCCG CGGGACCTCC GCCAGGTCGA
AATGGGCCGG GCGCCTGGTT CGGACCGGCC GTATCGCCCT CCAAGGCAGC ACCATCCGAC
CCTGCCGCGC CGTCGCCGAC CGCCGCATCC GACCGGTCGG TCGAGGGGTC GGATGCCACA
TCACCACCGC CCCCAGCAGC ACCCGCTGGC GCCTCCACGG ACACGGCACC AGGCCGCGCC
ACCGCCGACA CGGAACCAAC CGAGACCGCA CCGGCCGCCG TCGCCGACAC CACCAACGAC
ACGGAGCCAA CCGACACCGC CGAGGACACG GAACCAGGCC GCGCCACCGC GAACACGGAA
CCAACCGACA CCGCCGCGAA CACGCAACCA GGCCGCGCCG CACCGGCCGC CGTCGCCGAC
ACCACCAACG ACACGGAACC AACCGACACC GCCGCGAACA CGCAACCAGG CCATGCCGCA
CCGGCCGCCG TCATCGGCAG CGACGACATC GTGGACGACC GCCCCACGCC GGTTGACGAC
GATGCCCTCA CGGAAAGTTC GGACTCGACA CACGCCGACG CCGACGCCGA ATCAGCGCCG
GAAGGCGCGC AGGTTCCCGT CGAGCCCGCG CCACCGCTCA CACCGTCACC CAGCGCGGAG
TCGATCCAGG TGATCTCGGC GCCGCCCGCG CCACAGCCGG AGGGCGACCC GGCCGAGACG
CCGAAGGCGG CGGCACCGCC GTCGGAGGCG CCGGCACCGG TTGCGGCAGC GACTCCCACC
GCCCGCCACG ACATGCCGCC GGCTGGCGAC ACGGCCCCAG CAGACCCTGA GCGCGTGCTG
GCCAACCATC CCTGGCGGCT GGACCCGACC ACGCTCCGCG AGGTCGTCAC GGACCCGGAG
CAGTTCCGGG GTCTCCGGCA CCGCCTCACC GAGAAACTCG ACACGGCCAT CGACAACCGG
TCCCGAGCCC GGCTACTCAG CCTGCGGGCG GTGGTGTCCC GGGCGGCTGG CGACCTGGAC
GACGCCCTGG CAGACGGCCG GCTGGCCCTG ACCTACGCGG AGGCCACCGG CGAGTTACGA
CGGTCCGCTC TCGCGCAGGC CCGGCTGGCA CACGTGCTGC GGTGGCGGGG CGAGTATGAC
GAGGCCGACC GGCTCTTCGC GCAGGCGAAC TCACCGGAGC TACCCGACCG GCTACGGGCG
GCGTTGCACG AGCACGCCGG ACGATGCTGC GTCGACCAGG GCCGGCTGGT CGAGGCGTGC
GTTCACTTCG AACGTGCTCT GGACCTACGC GGTACCGCCG ACCCGACGTT GCTGGACCGG
GTGCGGGTGA GTCTGGACGC AGTCGCCGAT CGGGCCGCCG CGACCGGGTT CGGGCCGTAT
CCCCGTAGCA GGGCGGAGGT ACTGGAGCCG GAACGACCGC CGGTGCCGGC ACGCGACGGC
AGCCTCTGGG GGTACGCCGA CCCGAACGGA GACCTGGTCG TCGCGGCGGA GTATGCGGAG
GTGCAGCCGT TCCAGGAGGG CTTGGCCTGG GCTCGCCGGC CGGACGACCA GCGGTGGTCG
TTGCTCGACC GGACCGGCAC CACCGTGTTG GCGGCGTCCT GGCTGGAAGC CGGTCCGTTC
GCCGATGGGC TGGCCTGGGT GTCACAGGGC GAGCCCGGCG GCTGGTGTGC AATCGACCCG
CGCGGTGAGG TCGTGGTGCC ACCGGGTTTC GCCGAGGTAC GACCGTTCCG GCGCGGGATC
GCCGTGGTTC GCCGGGAGGG GTGGGGCGCG GTTGACCGCG CCGGCCGGCT GGTGGTCCCG
ACCCGCTACC ACGGATTCGC CACCGTGCTC GCGGACGATT CACCGGTGGA CGGCTTCACC
GACGACGGTT TGGCCGTCGT GGACCTGGCC GGACGGTACG GGGTGGTGGA CCGAACCGGG
CAGGTGGTCG TGCCACCAGC GCACGCGGCG CTGGTGGTGC ACCCCGTGGC CTTCTTGGTG
GCGACGGCCG CCGGGCGGTG GGGGGCACTG GACCGGCAGG GTGAGCCGCT GATCGACCCG
ACTCACACCG ACCGGGCCAT GGTGCTCGCC GAGATCGATC AACTGCTCAC CGACGCGGTG
CCGGTTCTCT GA
 
Protein sequence
MNRGYGDRWH HAAEPPWMFE PTNEWRPQFP GQRYPGDIDI EHTPRPAPRG RAAVVGRAEI 
PPLAPTRPDG TYVGRSWDEQ DDRRTRHDDE TYRRPHDEAY RRGELPRVDD RGRRPEPLNA
RGGEPHWERG NRDETGRGRY ADHGRYDDRR PRHEPGGRDQ PVSPARHPEP GWLPEPDEEL
PRGHDHYAFG RDAGEPPPGT RYPWGRPGGP DRGRPGEPPL SDDGRSKPDW SGATRDGDVN
PYPRRHPAGP GHPAGRDAPS PHRDAAPGPG RHPGRYAAQP PNPNRRPDTA APWPPPGPPR
PGPDQPAPQR GRPHGPPLPP PSRSRQGGLP TDTPRAPYGR PVSGTPNGPT SGAPSRPASG
TTGTPPPPRP PQGHPGDDVP RRPPPPVPGM AGPPPVRPVS PAAGNHRDRT DQARTDTGAD
RETGSPPGTA PWQPRRYVPP PASPPLTTEP APAAGPPPGR NGPGAWFGPA VSPSKAAPSD
PAAPSPTAAS DRSVEGSDAT SPPPPAAPAG ASTDTAPGRA TADTEPTETA PAAVADTTND
TEPTDTAEDT EPGRATANTE PTDTAANTQP GRAAPAAVAD TTNDTEPTDT AANTQPGHAA
PAAVIGSDDI VDDRPTPVDD DALTESSDST HADADAESAP EGAQVPVEPA PPLTPSPSAE
SIQVISAPPA PQPEGDPAET PKAAAPPSEA PAPVAAATPT ARHDMPPAGD TAPADPERVL
ANHPWRLDPT TLREVVTDPE QFRGLRHRLT EKLDTAIDNR SRARLLSLRA VVSRAAGDLD
DALADGRLAL TYAEATGELR RSALAQARLA HVLRWRGEYD EADRLFAQAN SPELPDRLRA
ALHEHAGRCC VDQGRLVEAC VHFERALDLR GTADPTLLDR VRVSLDAVAD RAAATGFGPY
PRSRAEVLEP ERPPVPARDG SLWGYADPNG DLVVAAEYAE VQPFQEGLAW ARRPDDQRWS
LLDRTGTTVL AASWLEAGPF ADGLAWVSQG EPGGWCAIDP RGEVVVPPGF AEVRPFRRGI
AVVRREGWGA VDRAGRLVVP TRYHGFATVL ADDSPVDGFT DDGLAVVDLA GRYGVVDRTG
QVVVPPAHAA LVVHPVAFLV ATAAGRWGAL DRQGEPLIDP THTDRAMVLA EIDQLLTDAV
PVL