Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1146 |
Symbol | |
ID | 5208097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1429027 |
End bp | 1432302 |
Gene Length | 3276 bp |
Protein Length | 1091 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640594763 |
Product | TPR repeat-containing protein |
Protein accession | YP_001275503 |
Protein GI | 148655298 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000383652 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCCAGGCA ATCGCGCATT GTTCGACCGT GCTATGGAGC AAAGCCGCGA GGCTGCGCGG CTCATGAACT GGGACGAGGC GCTGAAACAG GCGGTTCGCG CGCTTCAGGA GTTTCCCCAG GATCTCGACG CACGCTCAGC CGCTGCGGTG GCGCTCTTCA ACACAGCAAA ATATGTACAG GCGCTCCAGA TGTTCGATGA ACTGCGCCGC GCCGACGCCA GCAATCCGTT CTATCTGGAA TACCTGGCGC GCACCCATGA GCGTCTCGGC GATCCAAAGG CGGCAACCAC TGCCTATGTG CAACTCGCCG ATCTCCAGAT CAGTCGAAAA CTGGCAGCGC GTGCGATCGA TGCGTTGCGT GAAGCGCTGC GCCTGCAACC CGATGCCGAT GATCAGCGGG CGCGCCTGGC ACAGTTGCTC GCCGATCAGG GTGCACGGGC GGAAGCGGCA GCGCAATACC TCGATCTGGC GCGACGCGCG CAGGCGCACG GGCGCCTCGA ACAGGCGGTG GATCTGGCGG AAACGGCGCT GCGCTATGAG CCGGACAACC GCGAAGCCAA AGAATTGATT GCTGCACTTC ATGACGCCCT GGCGACAACA CTTCAATCAG CAATCGAAGC GACGCCTGCT GCCACCGAGG CAATCCCGCT CCCGATCGCC GGAACCGGCG GTCTGCGCAG CGCACAGATC GCAGTCGAAC GCATCGTTGC ACTGGCGCAC GAACGGCAGG AGGCCGGTGA TATCGATGGC GCCATCGAAC AGTATGAGCG CGCACTGAAA CTGGGCGCCG ACCGGAGTGA TGTCTTTTAC AGCCTGGGAC TGCTGTACCA GGAACGGGGC GACTATCAGC GCGCTATCGA GTTGCTGCAG AGCGCCGCCG GTGATCAGGA GTACGCGCTC TCGGCGCACT ATATGCTCGG TCAGGCGTAT CAGGAGTTGG GGAAACTTCC CGAAGCAGCG CACGAGTATG AACAGACCAT TCGCCTCCTG CCGCTGGAGT CGATCGGGCG CGCCGAAGCC GACGATATGA TCCAGATGTA CGAGAGCGCA GCGCAGATCT ATATCCAACT CAACGACATT GCACGCGCGG CGACTCTCTA CTCAACGCTG GCAAATTTCC TCCAGAGCAA ACGCTGGGGG CGCGAGCGCG CCGATGAGTT TCGTCAGAAA GCCAAAGATC TGACCGAACG AAACATGTTC GCCAAGCTCC GCACGCTTGG CACCGGCGCA CTCTCACTCC AACCGCCCGC TCCAGAACCC GAACCGCCGC CCGAAAGCCC CATGCCCGAA ACATGGGGCA AGATCCGACC GATCACCGAC TTCTTGCGCG CGCCCGAAGA ACAAAAGAAG GATGAGACCC GATTCGAAAC CACTCCTGTT GCCGTCGAAC CGATCGATCC GCTGGCGGCG CTCGAAGCGC TGCCACCTCC TGAACGCGCT CCGACTGCGC CGGTTACTCC ACTCGATACG ACCGGATTGG ACGAACTGTG TGAGCGGTAT GTGCAGGCAA GCGAAAAATA CATCGAGCAG GGACTGATGC TGGCAGCCAA CGATGCCTGC ATGGAAGTCA TCCGGCTCAA CCCGGATTAC CTTCCGATCC ATCTGCGGTT GGGCGAGATT TATGAACGTG ATGGCCGCAG AGATGAGGCA TTGATCAAGT ATCAGTTGCT GATCGACACG TATGTGGCGC GTGGTGAACC GCGACGTGCC ATTGATGTCT ACTATCGCCT GATCGAGTTA TCCCCCGATA CGATCCTGCC GCGTTCGCGG CTGGCAGAAT TATTACGCGC CGAAGGACGC AACGAAGAAG CCGCTCAACA ACTTTCTGTG GTGGCTGGCG CCTATTTCCG CATGGGGCAG ACCAATAAAG CGCTCGAAGA GTACCGTCGC GCGCTGCAAT GGTCGCCTGC CAGCGCCGAT CTCCACGCGC AGTATGGCCA GGCGCTGCTG AAACTCGAAC GCACCGAGGC GGCGCTGGTT GCCTTCCGCC GCGCCCTCGA ACTCGATCAG CAGAATCCGG TCAACATCGC GCGCATCAAC CTGACGCTGG CGATCATGGG AGAACAATCG GTCGCCGTCT GGCAATCGCT GGCAACCCTG CTCGAACAGA TCAAACAGCA TCCGCAGCGC CTGAATGACG TGCAGGCGGA ATATCGCGCT GCATTTCTGA TCGCCGATCT ACCGGTTCTC CACTATATTC TCGGAATCAT CCAGCAAAAC GCCGGTCAGC ATCAGTCGGC AATCCTTGAG TTCGAACAGG CGCTCGAACT GCTGCAGAAC GAGCGCGATC CATTCCTGAC GTTGCATCTT GTTCATCAAG CGCTGGCAGA CAGTCACATC GCGCTGGGTC AGGCTGGCGA GGCGCTGGAG CAACTCCAGC GCTGCCTTGC ACTGGAACCG GCAGCGCCGC CTCCCGATAG CGCCCGCTAT CCCTTCTCGA CGCCACTCTC CCAGGGTGAA ATTGTGCGTC GCATGGCGGA AGCGTATGCT GCAGTCGGCG ATCTGGCGGG CGCTGAACGC GCCTTGCAGG AAGCCAAGCA GTTCCTCCCC TACGATCGGG CGATCTATAC GAAACTCTCT GATATTTATT TCCGTCAGGG TCGCCTGAAC GAAGCGCTGG CGCAACTCGA AGAACTGGCA AGCCACTATG AACAGCGTCA AATGCTGGAT CGCGCTATCG AAGCTCTTGA AAGCGCGCTG CGCCTGGCGC CGAATAATAC TGCAATCGGC AACCGCCTGG CGAAAATGTA TATCCGACGC GGTTACCTGG ATAAGGGCAT CGAAGCGCTG GTGCGTGTCG CCGACTTACA GCGCAAAGAG GGGCAGATCA AGGATGCCGT CGCCAGCCTG CAACAGGCGG CTGAAGTTCA CTGGACGCTC GGCAAACACG CCGAGGCGCG CGCATTGTAC GACAAAATCG TGCATATCGC GCCCAACGAT ATCGAAGCGC GCCAGTGGCT TTCGTTCATG TATACGCTGG CAGGCATGAC GCGCGAAGCT ATTGCGCAAA AGAAGCAGAT TATCCGTATT CTGCTCCAGC GCCGCGATCT GGATAATGCA ATCGCCGAAA TGCACCAGAT CTACGGATTG GATCAGAACG ATACTGACAA TCTGTTCCAG TTGGGCGATG CCCTTATGCG TCGGCAGGAG TATGAACAGG CTATTCGCAT CTACAACCGC CTGGCAAAAC TGCCAGACGT GGAGATTGAG CGTGTCGAAG CATTGCAGGC AGCAGCCAGA CGCATGCTCG AACAGCAACA GGCAGAGAAA CGGTGA
|
Protein sequence | MPGNRALFDR AMEQSREAAR LMNWDEALKQ AVRALQEFPQ DLDARSAAAV ALFNTAKYVQ ALQMFDELRR ADASNPFYLE YLARTHERLG DPKAATTAYV QLADLQISRK LAARAIDALR EALRLQPDAD DQRARLAQLL ADQGARAEAA AQYLDLARRA QAHGRLEQAV DLAETALRYE PDNREAKELI AALHDALATT LQSAIEATPA ATEAIPLPIA GTGGLRSAQI AVERIVALAH ERQEAGDIDG AIEQYERALK LGADRSDVFY SLGLLYQERG DYQRAIELLQ SAAGDQEYAL SAHYMLGQAY QELGKLPEAA HEYEQTIRLL PLESIGRAEA DDMIQMYESA AQIYIQLNDI ARAATLYSTL ANFLQSKRWG RERADEFRQK AKDLTERNMF AKLRTLGTGA LSLQPPAPEP EPPPESPMPE TWGKIRPITD FLRAPEEQKK DETRFETTPV AVEPIDPLAA LEALPPPERA PTAPVTPLDT TGLDELCERY VQASEKYIEQ GLMLAANDAC MEVIRLNPDY LPIHLRLGEI YERDGRRDEA LIKYQLLIDT YVARGEPRRA IDVYYRLIEL SPDTILPRSR LAELLRAEGR NEEAAQQLSV VAGAYFRMGQ TNKALEEYRR ALQWSPASAD LHAQYGQALL KLERTEAALV AFRRALELDQ QNPVNIARIN LTLAIMGEQS VAVWQSLATL LEQIKQHPQR LNDVQAEYRA AFLIADLPVL HYILGIIQQN AGQHQSAILE FEQALELLQN ERDPFLTLHL VHQALADSHI ALGQAGEALE QLQRCLALEP AAPPPDSARY PFSTPLSQGE IVRRMAEAYA AVGDLAGAER ALQEAKQFLP YDRAIYTKLS DIYFRQGRLN EALAQLEELA SHYEQRQMLD RAIEALESAL RLAPNNTAIG NRLAKMYIRR GYLDKGIEAL VRVADLQRKE GQIKDAVASL QQAAEVHWTL GKHAEARALY DKIVHIAPND IEARQWLSFM YTLAGMTREA IAQKKQIIRI LLQRRDLDNA IAEMHQIYGL DQNDTDNLFQ LGDALMRRQE YEQAIRIYNR LAKLPDVEIE RVEALQAAAR RMLEQQQAEK R
|
| |