Gene RoseRS_0641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0641 
Symbol 
ID5207579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp791447 
End bp794347 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content70% 
IMG OID640594258 
ProductTPR repeat-containing protein 
Protein accessionYP_001275011 
Protein GI148654806 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.249187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACC ATCGTCGACT CGCCATCCTC ACCAGGCTCC GCAAAGCGGC GCGCGTGACC 
CTGGCGCAGA TGGCCCGCGC CTGCGGTCTG GAGGGCAGCC GGGCCTATGA GTCGGTGAGC
GCCTGGGAGC GCGGCGAAGC CGTGCCGCGC GCTGCCGTTC GCACCGCATT TCTCGGCTAT
CTCGCGCACA CGCTCAACCT GGCCGACGAT CGTGAGACCC TTGCTGCGGT CTGGGAAACG
CTTGTGCACG AGTGGGGCTG GGAGCCGCTG CAGCCCGCCG ACTGGCAGTT GCTGGACTCA
GGGCAGCTTT TGGCCAGTGC GACGCCCGGG CGCTACCAAC TCACCAGGTC TGACTCGACG
AGCACCGGCG CGCCGCTCCC GGCGCACCTT TCGCCCGCGC CGGTTCCGCT CCCTCCCGGC
TCGCGCGCGC CCTTTGCGCC AAACCCGCAT TTTGTCGGCC GCGCCGCCGA GCTGCTGACC
ATCGCCGGGC ATCTGCACCG CCGCACGGCG CCGATCCACA TCGTCGTCTG CGGTCCCGGC
GGCATCGGCA AGACCCAGCT GGCGATTGAG ATCGTCCATC GTTGTGGTCA CGCCTTCCCC
GGCGGCGTCT TCTGGCTGGG CTGCGCCGAC CCCGCCGCTG TCCCGGCCGC GATTGCGGCC
TGCGGCGGGC AGGATGGCCT GCGCCTCCGT CCTGATTTCG ACCGGCTCAG TCTCGACGAG
CAGGTGCGAC TGGTGCTGGC GGCCTGGCAT GAGGAGACGC TGCGCCTGCT GGTCTTCGAC
AGCTGCGAGG ACCCGGCGCT GCTCCGCCGC TGGCTGCCGC GACGGGGCGG CTGCCGCGCG
CTGATCACCA GCCAGCGCCG CACCTGGGAC GCCGACCTGC CGGTGGCGAC GCTGCCGCTC
GGCTTGCTGA GCCGCGCCGA GAGCCTGGCG CTGCTGCGTC GCTACTGCCC GACGCCCCAG
ATCAGCGATG CGGAGCTGAT TGCGATCGCG GCCGAGCTCG ACGACCTGCC GCTGGCCCTG
CACCTGGCTG GCAGTCACCT GGCGCGCGGC GCCGATACGC TCACGCCGAC CTGCTACCTC
GCTGAGCTGC GTGCGCGGCT TGACCAGCAC CCGTCCCTGG CTGGCGTCGA CCCGCGCGGC
CGCCCGCTGC CAGCGCCGAC CGGTCACCAT AACCGGCTCG AACAGGTCGT CGCCCTCAGT
CTGTCGCGGC TCGATCCGGG CGGCGTGATC GACGCGCTCG CCTGCCAGCT CCTGTTGCGT
GCCGCGTGGT TCGCCCCCGG TGAGCCGATC CCAGGCGCGC TGCTCAGCGC CACGCTGGCG
CCCCGACCAG ACCCTGCGCT GCTTGCCGGC GCCCTCCAGC GGCTCGTGGA CATCGGCCTG
CTGCTCGATA GCGCCGGAGC GTACAGCGGT CGCCTGCACC GCCTGATAGC CGCCTTTCTC
CGCCGCCTCG ACAACGATGG GGCGGCGCAG CACGCCGTCG AGACGGCGCT GCTGGCCCAC
GCGCGGATGC TCAACGCCGC GCTCGATCAG CCGGCCCTGC TGCGCATCCA GGCGCATCTG
CGCGCGGTCA CCGACGCTGC GCTCGTGCGC GGCGATGCGA TCGCCGCCGA CCTGAGCTAC
GAACTGAGCC GCCATCTCGG CGAGATTGAC CAGTATGCAG CGGCGATTGC CTACAACCAG
CGCTCGCTGG ACCTGCGGAA TCAGATCTTC GGCGCCGACT CCGTGGCAAG CAGCGCGAAC
CTGCATTTTC ATGGGATGAT GCTGGATTGG CTGGGCGATT ACCCCGGCGC ACGCCCGTAC
CACGAGCGGG CGCTAGCGAT CCGGCGCGCG CGACTGGGGC CTGATCATCC TGATACCGCC
ACCAGCGTGC TGCACGCCGG CGAAATCGCC CATGCAGTCT GCGACTATGC GCAGGCGCGC
ACCTACTACG AGCAGGCGCT GGCGGCCCGT ATCAGGCATG ACGGCCCGGA TAGCCCAGGG
GCGGCGGAGC TCTACAACAA CCTGGGGCTC TTGCTTAACG CGATGGGCGA GTTTGATGCC
GCCCTACCCT ACGCGGAGCG CGCGGTGGCG ATCTGGGAAG CGCATGAGCA GCCCAATCGC
TCGCTCCAGT CAATGGCGAT CAACAACCTG GGCTACCTCC ACCGGGCCCG GGGCGAATAC
CGGCAGGCGC TGCCCCTCCT GCGCCGCGCG CTGGCGATCC GCGAGGAGGT CTACGGTCCG
ATGCATTCCT TCGTCGGCGT CACGCGCAAC CATATCGGGC GGGTCTACCA CTACCAGGGG
CGATTTGAGC AGGCCGGGGC GGAGCTTGAG GCAACTCTCC GGCTCTTCGA CGCCGCCATC
GGGCGCGAGC ACCCGATCAC GGCCTGTGCG CTCAGCAACC TGGGCATGCT GGCGCTGGAG
ATGGGCGACC GCGCCGGTGC GCGCCGGATG CTGGAAGCAG CGCTGGATAT CCATCGCCGG
ATCCTGGGCG CTGCGCATCG CCACACCGCG CGCGAGATGA ATCGGCTCGG GTTGCTCCAC
CAGGCGTTTG GGGATCGCGC GATGGCCACT CGTTACCTGC GCGCCGCGCT GGCTGTTCGC
CGACGCATCC TGGGCGATCT GCACCACGAT ACCGCCAATA CGCTGGGGCA CCTGGGCGTG
CTGCTGCTGA GCGCCGGACG ACCACGGCAG GCCCGGCCGC TGCTCGCTGC GGCGCTGCGC
CGCCATCTGG CCCGCCTGGG TGAGCGCCAC CCGTACACTG CGCGCAGCCT GCTGCGCATG
GGCCAGGTCT GCGCTGCACT TGGGGAACGA GAGATCAGTC GTGACTATCT GAGCCGGGCG
CTTGCTGTGT ACACCGCTGT GCTTGGCGCC GATCACCCGT ACACCCTGGC AGCGCGGCAG
GCTCAAGTTA CGGCGTGCTG A
 
Protein sequence
MPDHRRLAIL TRLRKAARVT LAQMARACGL EGSRAYESVS AWERGEAVPR AAVRTAFLGY 
LAHTLNLADD RETLAAVWET LVHEWGWEPL QPADWQLLDS GQLLASATPG RYQLTRSDST
STGAPLPAHL SPAPVPLPPG SRAPFAPNPH FVGRAAELLT IAGHLHRRTA PIHIVVCGPG
GIGKTQLAIE IVHRCGHAFP GGVFWLGCAD PAAVPAAIAA CGGQDGLRLR PDFDRLSLDE
QVRLVLAAWH EETLRLLVFD SCEDPALLRR WLPRRGGCRA LITSQRRTWD ADLPVATLPL
GLLSRAESLA LLRRYCPTPQ ISDAELIAIA AELDDLPLAL HLAGSHLARG ADTLTPTCYL
AELRARLDQH PSLAGVDPRG RPLPAPTGHH NRLEQVVALS LSRLDPGGVI DALACQLLLR
AAWFAPGEPI PGALLSATLA PRPDPALLAG ALQRLVDIGL LLDSAGAYSG RLHRLIAAFL
RRLDNDGAAQ HAVETALLAH ARMLNAALDQ PALLRIQAHL RAVTDAALVR GDAIAADLSY
ELSRHLGEID QYAAAIAYNQ RSLDLRNQIF GADSVASSAN LHFHGMMLDW LGDYPGARPY
HERALAIRRA RLGPDHPDTA TSVLHAGEIA HAVCDYAQAR TYYEQALAAR IRHDGPDSPG
AAELYNNLGL LLNAMGEFDA ALPYAERAVA IWEAHEQPNR SLQSMAINNL GYLHRARGEY
RQALPLLRRA LAIREEVYGP MHSFVGVTRN HIGRVYHYQG RFEQAGAELE ATLRLFDAAI
GREHPITACA LSNLGMLALE MGDRAGARRM LEAALDIHRR ILGAAHRHTA REMNRLGLLH
QAFGDRAMAT RYLRAALAVR RRILGDLHHD TANTLGHLGV LLLSAGRPRQ ARPLLAAALR
RHLARLGERH PYTARSLLRM GQVCAALGER EISRDYLSRA LAVYTAVLGA DHPYTLAARQ
AQVTAC