Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2885 |
Symbol | |
ID | 5209854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 3599559 |
End bp | 3602534 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640596481 |
Product | PKD domain-containing protein |
Protein accession | YP_001277203 |
Protein GI | 148656998 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATGCA ATCGTGCCCT TGTATGGCTT GGAATCCTCC TTATTGCGAG CATGCTGCCG GTTTCATCCC CCACCCACTC GACCGCCATT GCGCAGAGTG CGCCCGATGG ACCACAACCG GGTGACGTGT TTCGTGAGTA TGCATTTCCG CAACGCCTGA CACTCTGCTT CCGTCCTTCC AATTCTGCAT CGTGCAACAA CATTCCGGTA GAACGCTCTG CGGGTCTGAC ATTCAACCCG GCTGGCGCAG TGCGCGCTGA ACTGGCGGTC GAATACTGGG GCGGTCACAT CGGAACCCGC CAGCATTTTC GCGTCAACAG TCAGGATTAC TGGCAACCTT TGCCACCGAT CCAGAATGTT CCCGCAGGCG CTCGCCAGCA CTGCTATTTC CGCTCCGTGC TGGGACGCAC CCTGCCGCTT GATCTGAGTC AATTGCGAAA CGGCGAGAAT CGCTTCCGCT TTACAATCGG GCACAACAAT CAGTATGGTC AGGACATCAG TTGTCCAGGC AACTTTGGTT GGGGAACAAC GTATGTCTAC GGTTTCATCG CCCGCCTCTA CTACACACCC GGCAGTATTC CCGCTCCCGG CGGCACAATC GTATCGCCCG CCCAAAACGC GACCATCGGT GAAATGCCGC AGATCACCGT CCAGGCGACG CCTCCAACCG GACGGACAAT AGCGCGGGTT GATGTGATCG GTGAGTATCT CGATTACGAC TGGGATGGCA ACGGCGTGTT GCGCGAATGG CAGTACCAGA TGGCGCGCAC CCAGATGCAA CGCCATATCG GTCAGGCGGT GCGTGACGGA AATGTGTATC GCGTGGTGTG GAATACGACA TGGATTCCGG ATCAGGTCGA TGCGAACGGC AACCCGTTGC CGGTCCGTCT CATGGCGAAG ATCACCGATA GCGACGGGGT ATCGATTATG ACCGCAGCGC GCACCGTATG GTTTCAACGC AGCGGTCGTC AGGTGCGTAT GTACACCTCC TCACACGATC ACGGCGCTTC AGCGCAGCGC GTCTTTGGCG TGCCGCAGAA CTTTGGTCCG CGCAATTCGC AGACCGTGAC ATCGACCCTT ACCATCGCCG ATGATCCAAC CGGCGCCGCC GCCGCACGCA TGATCATCCA TACATGGGCT GGCGATCAGA CGAAAGAAGA GCCGACGGTT CGTGATCGTA TCACGATCAA CGATGTTCCA ATCGTTAATG TCAATGCCCG CGCTCCCGAA ACTCCCACCC CGCCGATTGG CGCCGATCAT CACTGGAGTT TCGACTGGCG CGATGTGCCG GTATCGGCAC TGATCCAGGG CGATAATCTC TTTCGTGTGT GGACGAATCG GCAGGGTCAT CACCTGGAGA TCGCGTGGCC CGGTCCGGCG CTGATGGTCG AATGGGTGGA CGCTCCTTAT GCCCATGACC AGGTGGTTGT CACTCCGGAA GATACGCCGA TAACAATCAC GCTGACCGGA TCGACGCCTT TTGTCAATCC ATTGACGATC GCCCCGATCA GCAATCCTTC ACGTGGAACG CTCAGCGGCG CAGGATCGAC CCGTATCTAT ACGCCGCAGG CACATTACAA CGGAACTGAC CGGTTCGACT TCCGCGTTTC CAGCAGTGAT GGGCAGGACA ACGGAACGAT CTATATCGTC GTTACACCGG TCAACGACGC ACCGGTGCTA AACCCTGCAC CACGAACATT GCCATCCATC GATGAAGATA TCCCTGACGC CAGCAATCCG GGCACAACGG TTGGCGCGCT GCTTACCGAC ATGGTTACCG ATGTCGATTC TGATGCTCTT CCGGCAGGAA TAGCGCTGGT GGGCGCCAGC GGCAACGGGA CATGGCAGTA CTCGCTCGAT GGCGCTGCAT GGCTGGATGT CGGCACGGTA TCGATAACCA GCGCGCTCAC GCTCGAACCA ACCGCCCGGA TCAGGTTCCG TCCCGCACCG AACTTTTTCG GGCAGGCGAG TGTCACCTAC CGCGCATGGG ATCGCAGTGA CGGCGCAGCC AATGGTCAGC GCGGCGTCAA TCCTGGCAAT GGCGGCGGAA CGAGCGCCTA CAGCAGCGCC ACGGCAACGG CAACGATCAC CGTGAATCCG GTCAACGACC CGCCGGAGAT CACACTCTCC AGCAACGTTA CCACGACCCA CATCCTGGCA TTGTTCAATG TGAGCGGATC GGTCGTCGAT ATCGAAGACG AGACATTGAA CGTGACGATC GATTTTGGCG ATGGCGTGTC CAATACCCTC ACGTCGCAGC GCACATTCGA TGTTACCCAT CAGTACCAAC GCAGTGGCGC CTATACTGTC ACCGTGACGG TGCGCGACTC AGCACAGGCA ACTGCCAGCG CGTCGTTCGT TGTGCGCGCT GTGAACGATC CGCCCGAAGT GACGCTCGAC GCGACCACCC CTTCTTCTGT GCATATCTTC ACGCCATTCA GCGGCGCCGG TTCGTTCTCC GACACCGAAA ATGACTCGAT CACACTCCAG ATCGACTTCG GCGATGGCAC GCCTGCGCAA ACGATAACTC CAGCGACGGA TCGGACGTTC ACCTTCAACC ATCGGTATGA ACGAAGCGGA CAGTACACGA TGACGGTCAC CGTGCGCGAC TCAGAATCGA CAACGCGCAC AACATACCCG GTGCGGGTGA TCAATCATCC GCCGGTTGTC ACCATCAGCG CACCAGCCCA GATCAAACGT GACGATCCCT TCGTCGGCGC CGGCTCATTC TCCGATGTCG AAGAGGTTCC GCTGAGCAAC TGGAATGCGA CCGTCGATTA TGGCGATGGG AGCGGACCAC AACCTCTCGC GCTGTCCGAC GAGAAGCGTT TTGTGCTGAA CCACTCCTAT GCGGAAGAGG GGACATACAC CGTCACTGTG CGCGTCTCCG ATCGTGAAGG GGCGATCGGA ACGGCGTCGA TCTCTGTTCG CGTGAGCCGA TACTTCGTCG TGTACGTCCC TATGGTCGTC CGCTAG
|
Protein sequence | MRCNRALVWL GILLIASMLP VSSPTHSTAI AQSAPDGPQP GDVFREYAFP QRLTLCFRPS NSASCNNIPV ERSAGLTFNP AGAVRAELAV EYWGGHIGTR QHFRVNSQDY WQPLPPIQNV PAGARQHCYF RSVLGRTLPL DLSQLRNGEN RFRFTIGHNN QYGQDISCPG NFGWGTTYVY GFIARLYYTP GSIPAPGGTI VSPAQNATIG EMPQITVQAT PPTGRTIARV DVIGEYLDYD WDGNGVLREW QYQMARTQMQ RHIGQAVRDG NVYRVVWNTT WIPDQVDANG NPLPVRLMAK ITDSDGVSIM TAARTVWFQR SGRQVRMYTS SHDHGASAQR VFGVPQNFGP RNSQTVTSTL TIADDPTGAA AARMIIHTWA GDQTKEEPTV RDRITINDVP IVNVNARAPE TPTPPIGADH HWSFDWRDVP VSALIQGDNL FRVWTNRQGH HLEIAWPGPA LMVEWVDAPY AHDQVVVTPE DTPITITLTG STPFVNPLTI APISNPSRGT LSGAGSTRIY TPQAHYNGTD RFDFRVSSSD GQDNGTIYIV VTPVNDAPVL NPAPRTLPSI DEDIPDASNP GTTVGALLTD MVTDVDSDAL PAGIALVGAS GNGTWQYSLD GAAWLDVGTV SITSALTLEP TARIRFRPAP NFFGQASVTY RAWDRSDGAA NGQRGVNPGN GGGTSAYSSA TATATITVNP VNDPPEITLS SNVTTTHILA LFNVSGSVVD IEDETLNVTI DFGDGVSNTL TSQRTFDVTH QYQRSGAYTV TVTVRDSAQA TASASFVVRA VNDPPEVTLD ATTPSSVHIF TPFSGAGSFS DTENDSITLQ IDFGDGTPAQ TITPATDRTF TFNHRYERSG QYTMTVTVRD SESTTRTTYP VRVINHPPVV TISAPAQIKR DDPFVGAGSF SDVEEVPLSN WNATVDYGDG SGPQPLALSD EKRFVLNHSY AEEGTYTVTV RVSDREGAIG TASISVRVSR YFVVYVPMVV R
|
| |