Gene Sare_4935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4935 
Symbol 
ID5707082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5601729 
End bp5604818 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content72% 
IMG OID641274330 
Producttetratricopeptide TPR_4 
Protein accessionYP_001539672 
Protein GI159040419 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.179334 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCGGTG GGATCGACCG TGTCATCAAC AGCCATGGCG GTCTGGTGTT GGTCACCGGC 
GAGGCCGGCA TCGGCAAGAC CGCCCTGGTC ACCTGGGCGG CAAACGAGGC CCGCCGATCC
GGTGCCCTGG TGCTGAGCGG CTCGTGCTGG GATTCGGAGA GCGCGCCGGG TTACTGGCCG
TGGGTGCAGG TTATCCGCGG GCTGCGCCGC AACATGTCCG AGCAGGAGTG GAGGGCAGCG
GACAGGGTCG CCGGCGGTGG CCTGGCGGTG CTGTTCGGCG AGCGCCCCGG CGACGCACCG
GACGGGTTTC AGCTGTACGA CGCGGTGACC ACCGCCCTGG TGTCGGCCTC GCAGCGCTGC
CCGGTCGTGG TGGTCCTCGA CGACCTGCAC TGGGCCGACG CCGCCTCGCT GCGGTTGCTG
GAGTTCGCCG CCCAACACAC CTGGTTCGAG CGGCTGTTGC TGGTGGGCAC CTACCGCGAC
GTGGAAATGG AGCAGATCGC GCACCCGATG CGCCCGCTGC TCCTGCCGCT GGTGGCCAGG
GCGACCACCG TGACGTTGAC GGGGCTGGGA CCGGAGGACG TCCATACCCT GATGGCCCGC
ACCGGGGACG CCGAGTTGGA CCCTGCGCTC GTCGCCGAGG TGCACCAGCG CACCGGCGGC
AACCCGTTCT TCGTGGAGCA GACCGCACGG CTGTGGCACA GCGGCGGGTC GGTCGCCGTG
ATCCCGCCCG GTGTCCGCGA TGCCGTACGC CGGCGGTTGT CGCTCCTGCC CGGTCCGGTC
GAGCGACTAC TCAGCACCGC CGCCGTGCTC GGCCGCGAGT TCCACCGGCA GCTGTTGGCG
GCTGTGGCCG CGTCCCCGGT GCCGCACGTC GACCGGCTGC TCGACCAGGC GGCGACCACC
CGGCTCGTCG TGGCCAAGGG TGCGGGTCGG TTCGCGTTCA CGCACGATTT GGTCCGCGAG
ACGCTGTACG AAGAGGTTGC CGACGTGGGC CGGCAGCATG CCGCGGTGGT GCGGGCGATC
GACGACGTAC CGGCGCTGGC GCAGCGGGTG ATCCCGGCCG ACCTGGCGCA CCACGCCTAC
CTCGCCGGCG AGCATCTCGA CCCGGCCCGC GCGGTCGAGT TGCTCGTCGC GGCCGCTCGT
GCTGCCACCG GCCGGCTGGC CACCGAGGAG GCGACCGGGC ACTACCGGCG GGCACTTGAT
CGCGCCCGTG GCGGCGGGTC CTGCCTGCAC GTGGTGGCCG CGCTTGACCT CGGCGAGCAC
CTGCACCAGA TGGGTGACAT CGACGGTGCC TGGCAGATCT TCGACGAGGC GGTGGCGCGG
GCCCGCGAGC ACGGCGACCC GCAGCTGCTG GCCCGCGTCG CGCTGACGCT GCACGAGGCC
GCCGGACGGG ACACGATCGA CCACTCGACC ACGGAGTTGC TCGACCTCGC GCATGCCGCG
CTGGTGCGCG ACGGCGCCCC GGCGGAGGAG CCGATGTCCA CCGACCGGCT CACGCATGAG
CTCGTGACCC ACCTGTCGGC GTCGGTGCGT GGCGGTGACG ACGACGCACT GGCCTTCAGC
CTCTGGGCCC GCTATCACAC GGTCTGGGGT CTGGGCACCG CGGCGGCGCG GGTGACGCTG
GCCGATGAGA TGACGGACGT GGGGCACCAG GTCGGTGATC CGCAGCTGGA GCACTTCGGG
GCGTCACTGC GCTGGGTGAC GCTGCTCGAG CTGGGCGATC CGGGTTACCT CGAGAAGTAC
GATGCCTTCG TCGCCCAGGC CGAACGCGAC GGTATGCCGC TGGGCACCTT TGCCTCGGAT
GTCGACCAGA GCATCATCAG CACCTTCTCC GGCCGCTTCG CGCAGGCTGA GGCCCTCCTC
GACCGGGCCG TCGACGCGGT GGAGGAAGAC CAGTTCGCTA GTTTCGGCTA CAAAGCCGAC
CATCTTCGTT GGGCTACGTG GTTGCTGCAG GGCCGCTACG AGGGGCTGGA CGACCTGCAC
CGCACCGCCG CCGACCGCGG CCATCCCCAT CCGCGGCTGC TGGCCGGCAT CAGCGCCATC
GAGCAGGGCG ACGTCGCCGC CGCGCTGGAG CATCTGCAGG CAGCGCCCGG GCCGTACCCG
CGCGAGTACG CGCCGTTGGG GGTACGGTTC CGCGCGCAGG TGGCAGCCGC CACCCGGGAC
CGCCAGCTGT GCACGCGGGT GCGTGCCGAA CTGGCTCCCT ACCGCGGCCA GTGGCTGGTC
TCGCTGTACG GCTGGGACAT CAGTGGCCCG GTCGACCACT GGATCGCGCT CGTCGACGCC
GCCCTGGAAC AGTGGACCGA CGCCATTACC GGGTTCACCG TGGCGCGCGA GTCCGCCGGC
CGGCTACGGG CCCGACCCTG GGCGATCGAG GCCGGTGTCC AGTTGGCCGG TGCGATGCTC
GCCCGGGACG GGGCCACGGA CGCCGCCGCG GCGCTGCTGG ACGACGTACG GCGAGAGGCG
GCAGAGATCG GAATGCGCCA CATCGGCGCG CGGGTCGATC GGGTCGGCGG CGCCCGGCCA
GGTTCGACCC GCTCGCCCGC ACTGGCCGGC GAGTTCCGCC GCGACGGTGC CGTCTGGCTA
CTCGGCTTCG GTGGCCGCAC CGTTCACATG CCGGCCACCA AGGGCCTGAA TGACCTGCGT
CTACTGCTGA GCCGGCCGGG CGTCGACATG CCGGCGGTTC GCCTGCTGTC GCCCGAGGGC
GGCGAGGTGG TGGTTGCCAT GCGGCAACTG GGCGGTGACC CGGTGCTCGA CGACGAGGCC
AGGGCTCGAT ACAAGCAGCG CCTGGACGAC CTCGACGACG AGATCGACCG GGCGGCGGCA
CGTGGCGACA CGCGCCGGCT CGCCGAGTAC GACGGCGAGC GGCGGGCGCT GCTCGCCGAG
TTGCGGGCCG CCGCGGGGCT GGCGGGACGT ACCCGCCGCC TCGGCGACGA GTCCGAGCGT
GCCCGCAAGA CCGTGACCGC GCGCATCCGC GACACGCTGC GCAAGCTCGA CGACAGACAT
CCCGAACTCG CTGCCCACCT GCGCAGTGCC GTGACCACCG GTTCGACCTG CCGCTACCAA
CCAGCGTCTG AGGTGGCCTG GGTCCTGTGA
 
Protein sequence
MRGGIDRVIN SHGGLVLVTG EAGIGKTALV TWAANEARRS GALVLSGSCW DSESAPGYWP 
WVQVIRGLRR NMSEQEWRAA DRVAGGGLAV LFGERPGDAP DGFQLYDAVT TALVSASQRC
PVVVVLDDLH WADAASLRLL EFAAQHTWFE RLLLVGTYRD VEMEQIAHPM RPLLLPLVAR
ATTVTLTGLG PEDVHTLMAR TGDAELDPAL VAEVHQRTGG NPFFVEQTAR LWHSGGSVAV
IPPGVRDAVR RRLSLLPGPV ERLLSTAAVL GREFHRQLLA AVAASPVPHV DRLLDQAATT
RLVVAKGAGR FAFTHDLVRE TLYEEVADVG RQHAAVVRAI DDVPALAQRV IPADLAHHAY
LAGEHLDPAR AVELLVAAAR AATGRLATEE ATGHYRRALD RARGGGSCLH VVAALDLGEH
LHQMGDIDGA WQIFDEAVAR AREHGDPQLL ARVALTLHEA AGRDTIDHST TELLDLAHAA
LVRDGAPAEE PMSTDRLTHE LVTHLSASVR GGDDDALAFS LWARYHTVWG LGTAAARVTL
ADEMTDVGHQ VGDPQLEHFG ASLRWVTLLE LGDPGYLEKY DAFVAQAERD GMPLGTFASD
VDQSIISTFS GRFAQAEALL DRAVDAVEED QFASFGYKAD HLRWATWLLQ GRYEGLDDLH
RTAADRGHPH PRLLAGISAI EQGDVAAALE HLQAAPGPYP REYAPLGVRF RAQVAAATRD
RQLCTRVRAE LAPYRGQWLV SLYGWDISGP VDHWIALVDA ALEQWTDAIT GFTVARESAG
RLRARPWAIE AGVQLAGAML ARDGATDAAA ALLDDVRREA AEIGMRHIGA RVDRVGGARP
GSTRSPALAG EFRRDGAVWL LGFGGRTVHM PATKGLNDLR LLLSRPGVDM PAVRLLSPEG
GEVVVAMRQL GGDPVLDDEA RARYKQRLDD LDDEIDRAAA RGDTRRLAEY DGERRALLAE
LRAAAGLAGR TRRLGDESER ARKTVTARIR DTLRKLDDRH PELAAHLRSA VTTGSTCRYQ
PASEVAWVL