Gene RoseRS_1146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1146 
Symbol 
ID5208097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1429027 
End bp1432302 
Gene Length3276 bp 
Protein Length1091 aa 
Translation table11 
GC content59% 
IMG OID640594763 
ProductTPR repeat-containing protein 
Protein accessionYP_001275503 
Protein GI148655298 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000383652 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCAGGCA ATCGCGCATT GTTCGACCGT GCTATGGAGC AAAGCCGCGA GGCTGCGCGG 
CTCATGAACT GGGACGAGGC GCTGAAACAG GCGGTTCGCG CGCTTCAGGA GTTTCCCCAG
GATCTCGACG CACGCTCAGC CGCTGCGGTG GCGCTCTTCA ACACAGCAAA ATATGTACAG
GCGCTCCAGA TGTTCGATGA ACTGCGCCGC GCCGACGCCA GCAATCCGTT CTATCTGGAA
TACCTGGCGC GCACCCATGA GCGTCTCGGC GATCCAAAGG CGGCAACCAC TGCCTATGTG
CAACTCGCCG ATCTCCAGAT CAGTCGAAAA CTGGCAGCGC GTGCGATCGA TGCGTTGCGT
GAAGCGCTGC GCCTGCAACC CGATGCCGAT GATCAGCGGG CGCGCCTGGC ACAGTTGCTC
GCCGATCAGG GTGCACGGGC GGAAGCGGCA GCGCAATACC TCGATCTGGC GCGACGCGCG
CAGGCGCACG GGCGCCTCGA ACAGGCGGTG GATCTGGCGG AAACGGCGCT GCGCTATGAG
CCGGACAACC GCGAAGCCAA AGAATTGATT GCTGCACTTC ATGACGCCCT GGCGACAACA
CTTCAATCAG CAATCGAAGC GACGCCTGCT GCCACCGAGG CAATCCCGCT CCCGATCGCC
GGAACCGGCG GTCTGCGCAG CGCACAGATC GCAGTCGAAC GCATCGTTGC ACTGGCGCAC
GAACGGCAGG AGGCCGGTGA TATCGATGGC GCCATCGAAC AGTATGAGCG CGCACTGAAA
CTGGGCGCCG ACCGGAGTGA TGTCTTTTAC AGCCTGGGAC TGCTGTACCA GGAACGGGGC
GACTATCAGC GCGCTATCGA GTTGCTGCAG AGCGCCGCCG GTGATCAGGA GTACGCGCTC
TCGGCGCACT ATATGCTCGG TCAGGCGTAT CAGGAGTTGG GGAAACTTCC CGAAGCAGCG
CACGAGTATG AACAGACCAT TCGCCTCCTG CCGCTGGAGT CGATCGGGCG CGCCGAAGCC
GACGATATGA TCCAGATGTA CGAGAGCGCA GCGCAGATCT ATATCCAACT CAACGACATT
GCACGCGCGG CGACTCTCTA CTCAACGCTG GCAAATTTCC TCCAGAGCAA ACGCTGGGGG
CGCGAGCGCG CCGATGAGTT TCGTCAGAAA GCCAAAGATC TGACCGAACG AAACATGTTC
GCCAAGCTCC GCACGCTTGG CACCGGCGCA CTCTCACTCC AACCGCCCGC TCCAGAACCC
GAACCGCCGC CCGAAAGCCC CATGCCCGAA ACATGGGGCA AGATCCGACC GATCACCGAC
TTCTTGCGCG CGCCCGAAGA ACAAAAGAAG GATGAGACCC GATTCGAAAC CACTCCTGTT
GCCGTCGAAC CGATCGATCC GCTGGCGGCG CTCGAAGCGC TGCCACCTCC TGAACGCGCT
CCGACTGCGC CGGTTACTCC ACTCGATACG ACCGGATTGG ACGAACTGTG TGAGCGGTAT
GTGCAGGCAA GCGAAAAATA CATCGAGCAG GGACTGATGC TGGCAGCCAA CGATGCCTGC
ATGGAAGTCA TCCGGCTCAA CCCGGATTAC CTTCCGATCC ATCTGCGGTT GGGCGAGATT
TATGAACGTG ATGGCCGCAG AGATGAGGCA TTGATCAAGT ATCAGTTGCT GATCGACACG
TATGTGGCGC GTGGTGAACC GCGACGTGCC ATTGATGTCT ACTATCGCCT GATCGAGTTA
TCCCCCGATA CGATCCTGCC GCGTTCGCGG CTGGCAGAAT TATTACGCGC CGAAGGACGC
AACGAAGAAG CCGCTCAACA ACTTTCTGTG GTGGCTGGCG CCTATTTCCG CATGGGGCAG
ACCAATAAAG CGCTCGAAGA GTACCGTCGC GCGCTGCAAT GGTCGCCTGC CAGCGCCGAT
CTCCACGCGC AGTATGGCCA GGCGCTGCTG AAACTCGAAC GCACCGAGGC GGCGCTGGTT
GCCTTCCGCC GCGCCCTCGA ACTCGATCAG CAGAATCCGG TCAACATCGC GCGCATCAAC
CTGACGCTGG CGATCATGGG AGAACAATCG GTCGCCGTCT GGCAATCGCT GGCAACCCTG
CTCGAACAGA TCAAACAGCA TCCGCAGCGC CTGAATGACG TGCAGGCGGA ATATCGCGCT
GCATTTCTGA TCGCCGATCT ACCGGTTCTC CACTATATTC TCGGAATCAT CCAGCAAAAC
GCCGGTCAGC ATCAGTCGGC AATCCTTGAG TTCGAACAGG CGCTCGAACT GCTGCAGAAC
GAGCGCGATC CATTCCTGAC GTTGCATCTT GTTCATCAAG CGCTGGCAGA CAGTCACATC
GCGCTGGGTC AGGCTGGCGA GGCGCTGGAG CAACTCCAGC GCTGCCTTGC ACTGGAACCG
GCAGCGCCGC CTCCCGATAG CGCCCGCTAT CCCTTCTCGA CGCCACTCTC CCAGGGTGAA
ATTGTGCGTC GCATGGCGGA AGCGTATGCT GCAGTCGGCG ATCTGGCGGG CGCTGAACGC
GCCTTGCAGG AAGCCAAGCA GTTCCTCCCC TACGATCGGG CGATCTATAC GAAACTCTCT
GATATTTATT TCCGTCAGGG TCGCCTGAAC GAAGCGCTGG CGCAACTCGA AGAACTGGCA
AGCCACTATG AACAGCGTCA AATGCTGGAT CGCGCTATCG AAGCTCTTGA AAGCGCGCTG
CGCCTGGCGC CGAATAATAC TGCAATCGGC AACCGCCTGG CGAAAATGTA TATCCGACGC
GGTTACCTGG ATAAGGGCAT CGAAGCGCTG GTGCGTGTCG CCGACTTACA GCGCAAAGAG
GGGCAGATCA AGGATGCCGT CGCCAGCCTG CAACAGGCGG CTGAAGTTCA CTGGACGCTC
GGCAAACACG CCGAGGCGCG CGCATTGTAC GACAAAATCG TGCATATCGC GCCCAACGAT
ATCGAAGCGC GCCAGTGGCT TTCGTTCATG TATACGCTGG CAGGCATGAC GCGCGAAGCT
ATTGCGCAAA AGAAGCAGAT TATCCGTATT CTGCTCCAGC GCCGCGATCT GGATAATGCA
ATCGCCGAAA TGCACCAGAT CTACGGATTG GATCAGAACG ATACTGACAA TCTGTTCCAG
TTGGGCGATG CCCTTATGCG TCGGCAGGAG TATGAACAGG CTATTCGCAT CTACAACCGC
CTGGCAAAAC TGCCAGACGT GGAGATTGAG CGTGTCGAAG CATTGCAGGC AGCAGCCAGA
CGCATGCTCG AACAGCAACA GGCAGAGAAA CGGTGA
 
Protein sequence
MPGNRALFDR AMEQSREAAR LMNWDEALKQ AVRALQEFPQ DLDARSAAAV ALFNTAKYVQ 
ALQMFDELRR ADASNPFYLE YLARTHERLG DPKAATTAYV QLADLQISRK LAARAIDALR
EALRLQPDAD DQRARLAQLL ADQGARAEAA AQYLDLARRA QAHGRLEQAV DLAETALRYE
PDNREAKELI AALHDALATT LQSAIEATPA ATEAIPLPIA GTGGLRSAQI AVERIVALAH
ERQEAGDIDG AIEQYERALK LGADRSDVFY SLGLLYQERG DYQRAIELLQ SAAGDQEYAL
SAHYMLGQAY QELGKLPEAA HEYEQTIRLL PLESIGRAEA DDMIQMYESA AQIYIQLNDI
ARAATLYSTL ANFLQSKRWG RERADEFRQK AKDLTERNMF AKLRTLGTGA LSLQPPAPEP
EPPPESPMPE TWGKIRPITD FLRAPEEQKK DETRFETTPV AVEPIDPLAA LEALPPPERA
PTAPVTPLDT TGLDELCERY VQASEKYIEQ GLMLAANDAC MEVIRLNPDY LPIHLRLGEI
YERDGRRDEA LIKYQLLIDT YVARGEPRRA IDVYYRLIEL SPDTILPRSR LAELLRAEGR
NEEAAQQLSV VAGAYFRMGQ TNKALEEYRR ALQWSPASAD LHAQYGQALL KLERTEAALV
AFRRALELDQ QNPVNIARIN LTLAIMGEQS VAVWQSLATL LEQIKQHPQR LNDVQAEYRA
AFLIADLPVL HYILGIIQQN AGQHQSAILE FEQALELLQN ERDPFLTLHL VHQALADSHI
ALGQAGEALE QLQRCLALEP AAPPPDSARY PFSTPLSQGE IVRRMAEAYA AVGDLAGAER
ALQEAKQFLP YDRAIYTKLS DIYFRQGRLN EALAQLEELA SHYEQRQMLD RAIEALESAL
RLAPNNTAIG NRLAKMYIRR GYLDKGIEAL VRVADLQRKE GQIKDAVASL QQAAEVHWTL
GKHAEARALY DKIVHIAPND IEARQWLSFM YTLAGMTREA IAQKKQIIRI LLQRRDLDNA
IAEMHQIYGL DQNDTDNLFQ LGDALMRRQE YEQAIRIYNR LAKLPDVEIE RVEALQAAAR
RMLEQQQAEK R