Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4089 |
Symbol | |
ID | 5704742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4649262 |
End bp | 4651076 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641273515 |
Product | TPR repeat-containing protein |
Protein accession | YP_001538870 |
Protein GI | 159039617 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0357411 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0415301 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCCGG ACAAACCAGA CCTGCCGGCG GCGGAGGAAC TGACCCTGGC CCGGCTGGCC CTGACCGAGG GCGACCTGCC GCACGCCGCC GCGCACATCG CGGGGGCACT GGCGCAGGCG CCGACCCTCC CCGAGGCCCA CGAGCTACTC GCCCGACTCG CCACCGTCAA CGGCGGCGGG CTGGACCTGT TCCCCCTGCG TCAGAACGTT TTCGTCGGGA CCGTGGTCGC TCGGGCCCAT CTGCTCGCCG CTGCCGGCCA GCCCGCCGAG GGACTGGACC TGCTCGTCGC AGCCACCGGC TACGCGCCCC ACACGCCATG GGCGGAGGTG CCCTGGGTGA CCGCGCCCGA CCTGGCCGAA CGCATCGAAC CCGACCGTAC CGCCCAGATT CTCATGCAGG TCTGCGCCGC GACCACCGAT CCGGTCCCCC GGTCCGGTCA GGCGGCCCTC ACTCCGTACC TCACACTGGC CCGCAACGCG GTGACCGTGC ACCAGGGACA CCCGTTGTTG TTCGGGGCGG CCTCCGCGCT GGCCCGCCGG CTCGGCGAGG TGTCCCTCGC CGTCCGCTGG GCGTCCCGAG GGTTACGCGT CGGGCCGTCG AAGATCGGTG AGGTGTGGCT CGGGTACGCC TACCGAAGCG CGGGTCGGAT CCAGGACGCC CTCGCGGCGC TCGAACGTGC CGTCGCACAC GACCCGGACG ACCTGGCCGT CTATGCCGAC ATCGCGGGGA CACTCGCCGA CCACAATCGG CTGGACGAGG CCCTGGACTG GATCGACCGG GCGCTTGCCC GGGATCCGAC GTTCGACTGT GCGGTACACA CCGCACACCG GCTGCGCTTC CAGCGTGACG GCGACGTCGC CCACCTGATC GCGTTGGCGG ATTTCGTCCG GGAGCACCCG GAGGATTCCC ACGAGCACAC CGACCTCGCC GAATCCTGCG CCGGACAGCC CTGGCTCGGG CAGATCACCC CAGCCGCCGG CCCGGCCGTT GACGCGCTCC GGACCGTCCA CTCCGGCGGT CAGCTGGGAG CCACCGTCGG GTTGGAGTCG CCGGTACCGC CGAGCGCCCT GCGCACCCTC AGCCAGGCCG CGCCGGGCCT GCGGGTCCAG ATTGCCGAGG GGCCGGGTCC CGACCCGTAC GAACCCCGAC GCGCGATCGG CCAGCACCTG TGGCGACATG ACGGCCTACC CGCGGTCGGG AACCCCTCGC CGCTGGCGGC CGACCACCTT CGGCAGTTCG TGCACCCGGC GTGGCCCCAT CCCCCGGCGG CGTACGACGC CGCCGTGGCC CTGGCGACCC TGGATCTGGC CGATCTGCTC GGACTGCTCA CCCACCCACC GGCGGTTCCG CCGACCGCAC TGGGCCGGGT CCTCGACGAG CAGGATCCGT CGCTGTGGGT CCGCAGCGTC CAGGTGTGGG CCTGCCTCGG GCTGCTGCAC CATCGCACCG ACGAGCCGTG GCCGACCTCC ACCCGGCGGA CGGTGCTCCT GGACCTCGTC TGGGGCGTCG AGGACTGGAC CACCGAGGCC GCGCTCTTCG CCCTGGTCAC CGCCGCCTGG GTAGACCCGG CGGTCCGCCC CGAGGTCGCC AGGGTGGTCG CCGAACGACT CGCGGACGCG GTCGAGGTGA CCCGGACCCG CCCGGTACCG ATCGCCGTGT CGCTGGGACA CCTGGCGCTG GCGACCCCGG ACCTGACCCA ACCGGTCCGG GCACTGGCCC GCACGCTGAC TGAGGGCCGA CCCCCCGTTC CGGAACCCCC GCGCCGACTC TCCCGGCTCT GGCGGCGCCT CACGGCAATC TTCCGCCGCT CCTGA
|
Protein sequence | MSPDKPDLPA AEELTLARLA LTEGDLPHAA AHIAGALAQA PTLPEAHELL ARLATVNGGG LDLFPLRQNV FVGTVVARAH LLAAAGQPAE GLDLLVAATG YAPHTPWAEV PWVTAPDLAE RIEPDRTAQI LMQVCAATTD PVPRSGQAAL TPYLTLARNA VTVHQGHPLL FGAASALARR LGEVSLAVRW ASRGLRVGPS KIGEVWLGYA YRSAGRIQDA LAALERAVAH DPDDLAVYAD IAGTLADHNR LDEALDWIDR ALARDPTFDC AVHTAHRLRF QRDGDVAHLI ALADFVREHP EDSHEHTDLA ESCAGQPWLG QITPAAGPAV DALRTVHSGG QLGATVGLES PVPPSALRTL SQAAPGLRVQ IAEGPGPDPY EPRRAIGQHL WRHDGLPAVG NPSPLAADHL RQFVHPAWPH PPAAYDAAVA LATLDLADLL GLLTHPPAVP PTALGRVLDE QDPSLWVRSV QVWACLGLLH HRTDEPWPTS TRRTVLLDLV WGVEDWTTEA ALFALVTAAW VDPAVRPEVA RVVAERLADA VEVTRTRPVP IAVSLGHLAL ATPDLTQPVR ALARTLTEGR PPVPEPPRRL SRLWRRLTAI FRRS
|
| |