Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4935 |
Symbol | |
ID | 5707082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5601729 |
End bp | 5604818 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641274330 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_001539672 |
Protein GI | 159040419 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.179334 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGCGGTG GGATCGACCG TGTCATCAAC AGCCATGGCG GTCTGGTGTT GGTCACCGGC GAGGCCGGCA TCGGCAAGAC CGCCCTGGTC ACCTGGGCGG CAAACGAGGC CCGCCGATCC GGTGCCCTGG TGCTGAGCGG CTCGTGCTGG GATTCGGAGA GCGCGCCGGG TTACTGGCCG TGGGTGCAGG TTATCCGCGG GCTGCGCCGC AACATGTCCG AGCAGGAGTG GAGGGCAGCG GACAGGGTCG CCGGCGGTGG CCTGGCGGTG CTGTTCGGCG AGCGCCCCGG CGACGCACCG GACGGGTTTC AGCTGTACGA CGCGGTGACC ACCGCCCTGG TGTCGGCCTC GCAGCGCTGC CCGGTCGTGG TGGTCCTCGA CGACCTGCAC TGGGCCGACG CCGCCTCGCT GCGGTTGCTG GAGTTCGCCG CCCAACACAC CTGGTTCGAG CGGCTGTTGC TGGTGGGCAC CTACCGCGAC GTGGAAATGG AGCAGATCGC GCACCCGATG CGCCCGCTGC TCCTGCCGCT GGTGGCCAGG GCGACCACCG TGACGTTGAC GGGGCTGGGA CCGGAGGACG TCCATACCCT GATGGCCCGC ACCGGGGACG CCGAGTTGGA CCCTGCGCTC GTCGCCGAGG TGCACCAGCG CACCGGCGGC AACCCGTTCT TCGTGGAGCA GACCGCACGG CTGTGGCACA GCGGCGGGTC GGTCGCCGTG ATCCCGCCCG GTGTCCGCGA TGCCGTACGC CGGCGGTTGT CGCTCCTGCC CGGTCCGGTC GAGCGACTAC TCAGCACCGC CGCCGTGCTC GGCCGCGAGT TCCACCGGCA GCTGTTGGCG GCTGTGGCCG CGTCCCCGGT GCCGCACGTC GACCGGCTGC TCGACCAGGC GGCGACCACC CGGCTCGTCG TGGCCAAGGG TGCGGGTCGG TTCGCGTTCA CGCACGATTT GGTCCGCGAG ACGCTGTACG AAGAGGTTGC CGACGTGGGC CGGCAGCATG CCGCGGTGGT GCGGGCGATC GACGACGTAC CGGCGCTGGC GCAGCGGGTG ATCCCGGCCG ACCTGGCGCA CCACGCCTAC CTCGCCGGCG AGCATCTCGA CCCGGCCCGC GCGGTCGAGT TGCTCGTCGC GGCCGCTCGT GCTGCCACCG GCCGGCTGGC CACCGAGGAG GCGACCGGGC ACTACCGGCG GGCACTTGAT CGCGCCCGTG GCGGCGGGTC CTGCCTGCAC GTGGTGGCCG CGCTTGACCT CGGCGAGCAC CTGCACCAGA TGGGTGACAT CGACGGTGCC TGGCAGATCT TCGACGAGGC GGTGGCGCGG GCCCGCGAGC ACGGCGACCC GCAGCTGCTG GCCCGCGTCG CGCTGACGCT GCACGAGGCC GCCGGACGGG ACACGATCGA CCACTCGACC ACGGAGTTGC TCGACCTCGC GCATGCCGCG CTGGTGCGCG ACGGCGCCCC GGCGGAGGAG CCGATGTCCA CCGACCGGCT CACGCATGAG CTCGTGACCC ACCTGTCGGC GTCGGTGCGT GGCGGTGACG ACGACGCACT GGCCTTCAGC CTCTGGGCCC GCTATCACAC GGTCTGGGGT CTGGGCACCG CGGCGGCGCG GGTGACGCTG GCCGATGAGA TGACGGACGT GGGGCACCAG GTCGGTGATC CGCAGCTGGA GCACTTCGGG GCGTCACTGC GCTGGGTGAC GCTGCTCGAG CTGGGCGATC CGGGTTACCT CGAGAAGTAC GATGCCTTCG TCGCCCAGGC CGAACGCGAC GGTATGCCGC TGGGCACCTT TGCCTCGGAT GTCGACCAGA GCATCATCAG CACCTTCTCC GGCCGCTTCG CGCAGGCTGA GGCCCTCCTC GACCGGGCCG TCGACGCGGT GGAGGAAGAC CAGTTCGCTA GTTTCGGCTA CAAAGCCGAC CATCTTCGTT GGGCTACGTG GTTGCTGCAG GGCCGCTACG AGGGGCTGGA CGACCTGCAC CGCACCGCCG CCGACCGCGG CCATCCCCAT CCGCGGCTGC TGGCCGGCAT CAGCGCCATC GAGCAGGGCG ACGTCGCCGC CGCGCTGGAG CATCTGCAGG CAGCGCCCGG GCCGTACCCG CGCGAGTACG CGCCGTTGGG GGTACGGTTC CGCGCGCAGG TGGCAGCCGC CACCCGGGAC CGCCAGCTGT GCACGCGGGT GCGTGCCGAA CTGGCTCCCT ACCGCGGCCA GTGGCTGGTC TCGCTGTACG GCTGGGACAT CAGTGGCCCG GTCGACCACT GGATCGCGCT CGTCGACGCC GCCCTGGAAC AGTGGACCGA CGCCATTACC GGGTTCACCG TGGCGCGCGA GTCCGCCGGC CGGCTACGGG CCCGACCCTG GGCGATCGAG GCCGGTGTCC AGTTGGCCGG TGCGATGCTC GCCCGGGACG GGGCCACGGA CGCCGCCGCG GCGCTGCTGG ACGACGTACG GCGAGAGGCG GCAGAGATCG GAATGCGCCA CATCGGCGCG CGGGTCGATC GGGTCGGCGG CGCCCGGCCA GGTTCGACCC GCTCGCCCGC ACTGGCCGGC GAGTTCCGCC GCGACGGTGC CGTCTGGCTA CTCGGCTTCG GTGGCCGCAC CGTTCACATG CCGGCCACCA AGGGCCTGAA TGACCTGCGT CTACTGCTGA GCCGGCCGGG CGTCGACATG CCGGCGGTTC GCCTGCTGTC GCCCGAGGGC GGCGAGGTGG TGGTTGCCAT GCGGCAACTG GGCGGTGACC CGGTGCTCGA CGACGAGGCC AGGGCTCGAT ACAAGCAGCG CCTGGACGAC CTCGACGACG AGATCGACCG GGCGGCGGCA CGTGGCGACA CGCGCCGGCT CGCCGAGTAC GACGGCGAGC GGCGGGCGCT GCTCGCCGAG TTGCGGGCCG CCGCGGGGCT GGCGGGACGT ACCCGCCGCC TCGGCGACGA GTCCGAGCGT GCCCGCAAGA CCGTGACCGC GCGCATCCGC GACACGCTGC GCAAGCTCGA CGACAGACAT CCCGAACTCG CTGCCCACCT GCGCAGTGCC GTGACCACCG GTTCGACCTG CCGCTACCAA CCAGCGTCTG AGGTGGCCTG GGTCCTGTGA
|
Protein sequence | MRGGIDRVIN SHGGLVLVTG EAGIGKTALV TWAANEARRS GALVLSGSCW DSESAPGYWP WVQVIRGLRR NMSEQEWRAA DRVAGGGLAV LFGERPGDAP DGFQLYDAVT TALVSASQRC PVVVVLDDLH WADAASLRLL EFAAQHTWFE RLLLVGTYRD VEMEQIAHPM RPLLLPLVAR ATTVTLTGLG PEDVHTLMAR TGDAELDPAL VAEVHQRTGG NPFFVEQTAR LWHSGGSVAV IPPGVRDAVR RRLSLLPGPV ERLLSTAAVL GREFHRQLLA AVAASPVPHV DRLLDQAATT RLVVAKGAGR FAFTHDLVRE TLYEEVADVG RQHAAVVRAI DDVPALAQRV IPADLAHHAY LAGEHLDPAR AVELLVAAAR AATGRLATEE ATGHYRRALD RARGGGSCLH VVAALDLGEH LHQMGDIDGA WQIFDEAVAR AREHGDPQLL ARVALTLHEA AGRDTIDHST TELLDLAHAA LVRDGAPAEE PMSTDRLTHE LVTHLSASVR GGDDDALAFS LWARYHTVWG LGTAAARVTL ADEMTDVGHQ VGDPQLEHFG ASLRWVTLLE LGDPGYLEKY DAFVAQAERD GMPLGTFASD VDQSIISTFS GRFAQAEALL DRAVDAVEED QFASFGYKAD HLRWATWLLQ GRYEGLDDLH RTAADRGHPH PRLLAGISAI EQGDVAAALE HLQAAPGPYP REYAPLGVRF RAQVAAATRD RQLCTRVRAE LAPYRGQWLV SLYGWDISGP VDHWIALVDA ALEQWTDAIT GFTVARESAG RLRARPWAIE AGVQLAGAML ARDGATDAAA ALLDDVRREA AEIGMRHIGA RVDRVGGARP GSTRSPALAG EFRRDGAVWL LGFGGRTVHM PATKGLNDLR LLLSRPGVDM PAVRLLSPEG GEVVVAMRQL GGDPVLDDEA RARYKQRLDD LDDEIDRAAA RGDTRRLAEY DGERRALLAE LRAAAGLAGR TRRLGDESER ARKTVTARIR DTLRKLDDRH PELAAHLRSA VTTGSTCRYQ PASEVAWVL
|
| |