Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1097 |
Symbol | |
ID | 5208044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1365717 |
End bp | 1370963 |
Gene Length | 5247 bp |
Protein Length | 1748 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640594711 |
Product | large extracellular alpha-helical protein-like protein |
Protein accession | YP_001275455 |
Protein GI | 148655250 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000683737 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | GTGCGTTTTC TGCGACAGGT CGGTTTGATC TGGACTGCCG CCCTGCTGGT CGTTGTTCTG GCGTCTCTGG CGCTGCCTGG AGCGCGCTTT CTTCTTCTCC CCTGGCTCTC CGACTCGCCT GTGGTTGTGG CGGTCTCGCC GCCTGATGGG GCGCGTGATG TTTCGCCGCG CACAGCGTTG ATCATTCAGT TCAACACGCC GATGAATCCG CCCGGCGTCG AACGTGCGTT GCGCATCGAA CCCGAGAGCG ACGTCGTCTA CGCCTGGGAT GATTCGCGTA CAACGCTGAC TGTCACACCA ACGAAGACGC TCCAGGCAGG CATGCGTTAT CGGGTCAGCA TCGATGAGAC GGCGCTGAGC CGGTTCTTCC GCCCGCTGGA GGAACCGTTC GTTTTCACAT TCGAAACAGC GCCGCCGCCA GCGGTGACTT CTCTGTGGCC CCGTGATGGC AGCGTTGAGG TTCCTGTCGA TACGCTCATC AGTGTGCGGT TCAGCCGATC CATTGTTCCT CCCGACCGGC TTGCCGTTCC TGAATTGTCA CCTGCGTTCC GCACCGATCC GCCGGTTTCC GGCAGTGTTG TCTGGATCGA TCCTGCCACG CTTCTCTTTC GCCCGGATCA ACCGCTTCGT CCCGGTGTTC GTTATACCTG CTCGCTATCG CCTGATCTGA CCGATCAGAG CGGTACGCCG CTTGGTCGCG CCTATTCCTG GTCGTTCACC ACGCTGGCGC CAACCGTGCT CAGTGTGTCG CCGCCGCCAA ATGCGCGTCA GGTTGCGCTC CGCGAGCCGC TGCGCATCGT TTTCTCGCAG CCGGTCGATC GGCAGGCGCT CGAAGCGGCG CTGTCGGTGA CACCGCCGAT GCCCGGCGCT CTCGAGGACG CCGTGTTGCC CGACGGAACG CAGGTTGTCA CGTACACCCC GACTGCTGAA TGGCAGGCTG GCGTTGTGTA TACGATTGCC CTTCCAGAAA AGACGGCGGA TGGAAGTCCG CTGCTGGTAA AGCCGTATCG ATGGAGTTTT ATGACGGCGC CAAAGCCAGC GCTGATCGGA AGGTTCCCCG GTGAGGGTCA ACTGCTTCCG CCGGGAGGCA GTGTGCGGTT GATCTTCAGC ACCCCAATCG ATGCTGGCGC CCTGCGCGAC AATCTGCGCG TCGAACCGCC GGTTGCACAT CTGCGGGTCG TCACGAACGA CGGTGAAGCG CGCATTGATG CGCAACTTCA GGCTGCCACC CTGTATACGA TAACAATACC GGCTTCGCTT TCAGATCGCG CCGGTGTTGC GCTGGAGCGT GACTATCAGG TTCGCTTTTT TACCGCACCG GCTGCACCAT CGCTCACGCT GCCGGAAGCG AATGGTCGCG TTATACGGTC GCTCCCTGAT CGAGCGATTG ACCTGCTGAC GCGGCGGACG AACCTCTCGG AATTGCGGTT GACGCTCTAT CCGCTCGATG AGGCGACGTT GTTGCGTGCG TTGAGTTTCA GCGATGCCGA GTGGACGTCG TTCGAGCCGG CGCGCTATGG TCTGTCACCT TTGCGTTTCT GGACACAGCC GCTGACCGAT CCGCTCAATA CAGTCGTCGA AGAGCGAGTG ACGGTGACCC TCGATGGTGG CGCGCCGCTG CCGCCGGGTT TCTACTTCCT GCGCATACGC TCGCCTGAGC GTGCTGGCGC TGGCGTTCTC CTGGCGGTGT CGCGTGTCAC GCTCTCGTTT CAGGTCGTCG GACAGCGCGC CATCGTATGG GTGACGGACA TCGCCAGTAC ATCGGTCATT TCCGACACGC CGCTGGCGCT CTACCGGCAG GGAACACTGA TCGCCGTCGG ACGCAGTGAT GAGCGCGGGG TGTGGGAAAC CGATCTGTCT GGCGTCAATC CGCGCGATCT TGTCGCTGTT GCTACGCTTC TGCCCGCCTT CGCCACACCG GAGGCGCCGG TGCAATCGGC GCCTGCGCCA CGCCTGCGAG TCATCCTGGC GACGGATCGA TCCATCTACT CTCCGGGCGA ATCGGTGGCG ATCCGTGGCT TCATTCGCCA GGAGGGATCA CAGGCATTCG AGATTCCTGA TCCGGGACAG TCGCTGGACC TCGACATCCA GGGTCCGTCG GGTGTCCGCC TGCGAAAGCG GATTGTTCTC GATGCCTCAG GAATGATCGA CGCGACGCTG GCGCTTCCCG CCAATGCGCC GTCTGGCGTC TATCGCCTCT TCACTCCTCG CGATGAGCGT GCGGCGCTCC AATTTTATGT GCATCCGCCT TCGTCGCCAT TACGTGCGTC AATCACGCGC ATAACGCAGG ATCAGGTCGT CGTCTCTGTC CGTACACCGG AAGATCTTCC GATTGCAGGC GCAACGATTA CCTGGACAAT CGATCCAGAA CCGATACTGC TGCCGGTCAG GGATGACTTC GTGTTCAGCC GACCGGAAAC GCCGCTTGCA TCTCTGTCAG GTGTCGGCGT CACTAATGAA CAGGGTATGC TGACACTCGC GCTCCCTTCA GACTGGTATC ACGTTCGCAT CCAGGCGCAG ATTGTCGAAG CTGGCGGACT GGCGGCGACG CTTGATCGAA CGATCTATAC AGCACCCGCA CCGGCAGTCG GTCTGCGGGC TGCGACATCG CTCGTGGGTG CAGGAGGTCA GACAAGCGTC GAGGTGGTGA CCCTGGCGGG CGATCAACCG CTTGCCGCGC AGCGCGTGCA GATCGATGCA GTGCGGCTCA ACGGCGAGAC TGCCAGTGAT ATGGGCGTGT CGCCATCCGA ATGGCAACTC CTGAGTCGTG TGATCACCAC CGATAACGAT GGACGGGCGA CATTCGCTGT GTCGCTTCCT GAACCGGGGG TATACCGGGT TCGTGCAGCG CTGGTCGGCG GCGGTCTGGC GTCTCCGCCG ACCGATATTG TTCTGCGTGC GTATCAGCCC GGTTTTACTG CCTGGAGCGA ACCTCGCACC AGCGTGTCGC TGGTTGCTGA TCGCGCGCGG TATCAACCTG GCGATACGGC ATTGCTTTTG CCGCTGGCGC CGATCCCGGA AGGTCTGGCA TTGCTGACGG TTCAACGGGC GTCAGGGGAT GTTGTGACCG AACTTCGCAC TGTGCGCGCT GGAGAACCGC TGACGCTCAC GTTGACCCCC GCCGATGCGC CGGTTGTCCG GGTTACGTTG ACGTCTGGGG TGCAGTCGCC AGCGTATCGG CGGTTACAGG TCGATGTGCC GGTGACTGCG ATCACTCCAT CGCTCCTGGC GACAGTGACG ACCGATGCAC AGACGTATGA TCCGGGAGCA ACCGCAGCGC TGACGATCAC CGTTACGGAT GCGCGTGGCG CTCCAGTGTC TGCGGATGTG CTGGTGCGAA TTACGGCAGG CGATGATGAT CGGCAGGAAC CCGTCGTCTG GCGCACCGGG CGAACGGACA GGAACGGCGT GATCCGCTTC GATGCACCGT TGCCCCAGAC CCCAGGCACA CATGAGGTGC GGGTATGGGT TGCGGGTGAA CGTGGCTTTG GTGTAACCGG AGCTGCGTTG CAGGCAAGGC AACCAATTGT TGGGCAGATT GTTGCGCCAC AGTTCGCGCG CGCCGGGGAT CGGTTCGTTG TCGGCGTGCG TCTCACCACG CAAGAGGATG TCCCGCGTCA GACGCGCATC ACAATGCGGA TGCCAGATGG AACCGCCGTT GTGCAGACAA CCGCAGTTCC CACAGAAGGC GCTGCGTTAG CGACGTTCAC GGTGCAGGCG CCGTCATCTG GCGCTGCCAT GGCGGTGCAG GCGATTGTCG AGGCCGATGA CGCTTTCAGT GAGACACTGC GAACTGATCT GTCTGTGTTA CCGCCAGCGA CGACGGTGCT CAGCACAGGC AGCGCGCTTG TGACCGACCG GTTCGAGGCG GCGATACCGG CGCCCCAGGC AAGGTGGGGA AGCCTGGATA TCGCCGTTGC GCCCTCCCTG GACGCGCTGG CGCTTGAACA GGCGCGCGCG CTTGCCGCAC TGGCGGATCG CCACGCGCTC GACAATGTGG CGATCATTCT GATGGCGGCA TCACTGGCGG ATGCTCGCCA GGAAACACAG ATAGCGGTCG ATCATCTGGC GAAATTGCAG GCAGCCGATG GCGGATGGAC ATGGAAGCGG CAGGGATCTT CGAGTCCGGT CGTTACCGCA GCGACGCTTG AGGTGCTCGC CAGGGCGAAG GAGTCTGGTT TTGCCGTTCC GGATACAACA CTGGAACGCG CAATCAACCT GGCATCCCGA CTGGCGAATG ATCCCGTCCT TTCGCTCGAA ACGCGCATCT GCCTGAGTTA TGCGCTGACG CAGCTTGACG CTCCTGTCCC GCGCGAATGG GACGAAAACG CTTTGAATGC GTCCGGGCTG GCGTGTCGCC TGTTGATGCT GCCGCCGGAT CAGGCGCGCA TCGACCCTGC GCTTCCCCGT CTGATCAGTC TGGCGCAACG CACACAGACG GAAGCGTGGT GGACGGCGCC AGACGGCGGC GCATTCCCGC ACGATGATGT TGCAACGACT GCAATTGCAG CGCGTGCAGT TCACCACGCA TCACCGCGGC ATCCGCTGAC TGCTAATGCC GCGCGCTGGC TGATCAGCCG TATGACGCCA GCGGGATGGG GCGACGCATA TACAACGGCG CGCGTGGTGC AGGCATTGCG CGCGATTGCG CCTGCCAGCA CACCGGCGAC CGTTGCGATG ACGCTCAACG GTGCGCCTGT CGTAGCACCT GCCGCGCCTG ACGCTGTGTT ACGGATCGTT CCTATTCCTC TCAGCGATCT GCGCCCGGTC AACACGCTCG TGGTGACCGG CGATGGCAAC CCGGCGCTCG TTGCCTGGCA GGTGACCTCT GCGGATCACG CAGCGCTTCC TGCGGAAGGC GTCGGTCTGA TCCGCGAGTT TCTCGATCCG CAAACCGGCG TTCTGCTGGA TCCGGCGCGT CTCCGGGTTG GACAACTGAT CAAAGTACGA CTGACCTGTG TTGCTCACAC CGAGCGACAT TTTGTGACAC TGCGCGATGC GTTCCCTGCC GGATTCGTGC CGGTCGATGC TGGATCGAGT CCGGTGTTTC GTCAAATCGA TCTGTTTTCA GACCGGATCG AACTAGCGGT CGAGGCGCTT GCACCCGGCA TCTATCAATA CACCTATCTG GTGCGCGCGG TAACGCCGGG ATCGTATGCT GTTCCGCCGC CGGAACTGAT CCTGCCCGGC GCTCGCGCGC TGACCGGAAC CGCCACAACG ACCGTCGTGC AAATTGTGGC GCCGTAG
|
Protein sequence | MRFLRQVGLI WTAALLVVVL ASLALPGARF LLLPWLSDSP VVVAVSPPDG ARDVSPRTAL IIQFNTPMNP PGVERALRIE PESDVVYAWD DSRTTLTVTP TKTLQAGMRY RVSIDETALS RFFRPLEEPF VFTFETAPPP AVTSLWPRDG SVEVPVDTLI SVRFSRSIVP PDRLAVPELS PAFRTDPPVS GSVVWIDPAT LLFRPDQPLR PGVRYTCSLS PDLTDQSGTP LGRAYSWSFT TLAPTVLSVS PPPNARQVAL REPLRIVFSQ PVDRQALEAA LSVTPPMPGA LEDAVLPDGT QVVTYTPTAE WQAGVVYTIA LPEKTADGSP LLVKPYRWSF MTAPKPALIG RFPGEGQLLP PGGSVRLIFS TPIDAGALRD NLRVEPPVAH LRVVTNDGEA RIDAQLQAAT LYTITIPASL SDRAGVALER DYQVRFFTAP AAPSLTLPEA NGRVIRSLPD RAIDLLTRRT NLSELRLTLY PLDEATLLRA LSFSDAEWTS FEPARYGLSP LRFWTQPLTD PLNTVVEERV TVTLDGGAPL PPGFYFLRIR SPERAGAGVL LAVSRVTLSF QVVGQRAIVW VTDIASTSVI SDTPLALYRQ GTLIAVGRSD ERGVWETDLS GVNPRDLVAV ATLLPAFATP EAPVQSAPAP RLRVILATDR SIYSPGESVA IRGFIRQEGS QAFEIPDPGQ SLDLDIQGPS GVRLRKRIVL DASGMIDATL ALPANAPSGV YRLFTPRDER AALQFYVHPP SSPLRASITR ITQDQVVVSV RTPEDLPIAG ATITWTIDPE PILLPVRDDF VFSRPETPLA SLSGVGVTNE QGMLTLALPS DWYHVRIQAQ IVEAGGLAAT LDRTIYTAPA PAVGLRAATS LVGAGGQTSV EVVTLAGDQP LAAQRVQIDA VRLNGETASD MGVSPSEWQL LSRVITTDND GRATFAVSLP EPGVYRVRAA LVGGGLASPP TDIVLRAYQP GFTAWSEPRT SVSLVADRAR YQPGDTALLL PLAPIPEGLA LLTVQRASGD VVTELRTVRA GEPLTLTLTP ADAPVVRVTL TSGVQSPAYR RLQVDVPVTA ITPSLLATVT TDAQTYDPGA TAALTITVTD ARGAPVSADV LVRITAGDDD RQEPVVWRTG RTDRNGVIRF DAPLPQTPGT HEVRVWVAGE RGFGVTGAAL QARQPIVGQI VAPQFARAGD RFVVGVRLTT QEDVPRQTRI TMRMPDGTAV VQTTAVPTEG AALATFTVQA PSSGAAMAVQ AIVEADDAFS ETLRTDLSVL PPATTVLSTG SALVTDRFEA AIPAPQARWG SLDIAVAPSL DALALEQARA LAALADRHAL DNVAIILMAA SLADARQETQ IAVDHLAKLQ AADGGWTWKR QGSSSPVVTA ATLEVLARAK ESGFAVPDTT LERAINLASR LANDPVLSLE TRICLSYALT QLDAPVPREW DENALNASGL ACRLLMLPPD QARIDPALPR LISLAQRTQT EAWWTAPDGG AFPHDDVATT AIAARAVHHA SPRHPLTANA ARWLISRMTP AGWGDAYTTA RVVQALRAIA PASTPATVAM TLNGAPVVAP AAPDAVLRIV PIPLSDLRPV NTLVVTGDGN PALVAWQVTS ADHAALPAEG VGLIREFLDP QTGVLLDPAR LRVGQLIKVR LTCVAHTERH FVTLRDAFPA GFVPVDAGSS PVFRQIDLFS DRIELAVEAL APGIYQYTYL VRAVTPGSYA VPPPELILPG ARALTGTATT TVVQIVAP
|
| |