Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0471 |
Symbol | |
ID | 5537934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 608494 |
End bp | 611673 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640892634 |
Product | hypothetical protein |
Protein accession | YP_001430620 |
Protein GI | 156740491 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.128084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCGCA ATATTGCTGT TCTCTGGCTT CTTGGTTTCC TCGGTGGTCT TTTGGTATTC GCTCCCCCAA CAGGACGCAC ACCGGTCGCT CACGCGACAG TTCCGAACGA ACAGAACTGG ACGGTGACGC CTGGCACATG CGATCCGTCG TTTCAAGGCG CCGTCTTTGC TGCGTGGAAT CCGGCATCGG GCAATCCCAA CTGTGGCGTC TTCACCTCTG GCGCGCCCTC GCTTAATCAG TTCGATACCG GTCAAGTTTT TCCTCCGAAT GTGCTCCGGG ATGAAGCGAG TGTCACTGCA CCATGTGAAA ATGGTCGCAC CGCCGGGCTG TGTTACCGAA TGTGGTACGT CGGCACGCGC TCCGGCGAGC CGTATCGCCG GATTGGATAC GCTGTCTCGC CGGATGGCGT CTCCTGGTAT CGCGTGCAGG GTCCGCACAC CGGCGGCAGC GTCTTCGAGG GGTCGGGACA GCCGGGCAGT TTCGATGAGA ATGGCGCGAC CACGTTTCAC GTGATCAAAG ATGGCGGGGA GTACCGTATG TGGTACACCG GCGTCAACAG CAGTGGGACC TGGAGAGGTT TCGGCTATGC AACCTCGAAC AATGGCATTA CCTGGACGCG ACAAAACGAC GGGCTGCCGG TGTTGACTCG GCGCCTGGGA TCGGGTTTGT TCGACGATGA CCGCATTATC GGACCGTTTG TGTTAATTGA TGAGGCAAGC GCCACTGCGC CGTGTGAAAG CGGTCGCGCG AACGGGCGCT GCTTCCGCAT GTGGTATGAG GGATTCCGGG CGGATAATAA CTTTTACATC GGACATGCAT TGTCGCCTGA TGGCATTAAC TGGACGATTG TTAATGGACC TGACGAACTC GGCTCGGTTC TCTCCAATTC GGGCGGATTT ACCGCCTTTG ATTCTAATGA TGTCGGCTTG ACTGCGGTCA TCAAAGATGG CGCGATCTAC CGCATGTGGT ACCAGGCAAA AGATTACAAC ACGCCGGACA CCTTCAGAAT CGGGCATGTC ACTTCGGTGA ATGGGGTCAA CTGGGTGCGT CCCGATCCGA ATGATCCTGC GTTCTATGGC GGCTTAGACA CGATCAATCT TCCCGGAACC AACGATGATG TGTGGGTCGT CCGTCTTTTG AAGGAAGACC TGACCTACCG TATGTGGTAC GCCACGGCGG GCACGCCTAA CAGCACCCGT TTTGGTCTGG TCGAGATGAC GCAGGGCGTG CCGATTACGC CAACGGTGCG GCGCAGCGGT GATGAGTTCA GGATTGAGTT CAACACACAG CGAACGATAC CGGTTAGTGG CAGTGTGTTG ATCACCCTGC CGCCGGGCGT TTCGCTCGAC CAGTTTTCGG TCATCGAATT GCAGGGATTT GAACCTGGCG CGGTCCTTGC GCGCGAGCGC GGCGCGATTA CCGATGCGTA CAGCGGCTTC TCGGCGCGCG ATGCGCTGCT GCTGCGCCTG CCAAATGGCG CGGTTCCAGG ACCAAAGGTG ATCCGCTTTA GCCTTGGCGC AGAGGCGCCC AATCCTGCCT ATCTTCTGCT CCAGACCTTC GACACCCACA AGGTGCTCGA ACGGGCGCGC GTCAATCTGG GAGATCTGCG AATCACACAG AGTGTGGGAA CAGTCGTTGC GGGCGCATCG GTCGTTTACA CCGTCACGGT CAGTAATGTC GGTCCGAATG CGGTCTCGAA CGCCTTGCTG AACAGCGTCT TCCCAACGCA ACTGACGGGA ATCACCTGGA CCTGCGCTCC ATCGGGAGGC GCGTCGTGCG CTCCGGGAAG CGGCAGCGGC AACTGGAGCA GTAAGCCGCT CGATCTCCCA TCGGGCAGCA GTGTCACGTT TACGGTGACC GGCAATCTGC CGCCTGCGGA AACCGGTAAC CTGACGGATA CTGTCAGCGT TTCCACACCG GCTGTGCTCA ATGAACTGAC GCCTGGCGAT AATGTCTCGA CTCTGGTGAC GCCGATTGAG GTGCGCGGCG ATCTGTCGAT CACACGTGCC AGCAATCCGG TGGTTCCGCA GGCGGGTCAA CCGATCACCT ATACGCTGAC CGTGGCGAAC AGCGGACCGA GCACCGTCGT CGGCGCGAAT GTGGTCAATA TGTTCCCGAT CAGTGTGACG AACGTCATCT GGAATTGCAG CGCCACATCC GGTTCTGTCT GCCCGGCGCC GGGCAGCGGA AACATGAGTG CAGCGGTCAC CCTGGCACCT GGCGGCATTG CCACGTTCAC TGCAACCGGT ATGGTGTCGC CATCTGCCGT TGTGATGCCG CCGCACTCGG CGATGGTGAC CGTGCCGGGG AATGTGACAG ATCCCAACCC GGACAATAAT GTGTTCACCG ATGGCGGCGG ATTGGGGCGG TCCGCCGATC TGGCGATCAG CAAGGCGGTT GCGCCCGCAA CGGTTGTTCC CGGTCTTCCG GTCACCTACA CGATCACCGT GACCAACACC GGTGCTGCCG ACGCAGATGG CGCAACGGTG CTCGACCTGT TCCCGCCGAC GATCACGAAT GTGACGTGGA CGTGCAGCGG CGCGGCCGGC GCAGGTTGTG CGCAGGCAAG CGGCAGCGGC GATCTGATGG TGACCCTGTC ATCATTCCCC GTGGGCGGAT CGGCAACGAT CACGATGACC GGCATCGTAG CGGCGCAGGC GACCGGGAAC CTGATCAACA CAGCGCAGGT GCTGCCGCCG GTTGGGGTTG AAGACCCGGC GTTTGCCAAT AACAGCGCGT CGATTTCGAG TGTGCTCCAA CCGCGCACCG ACCTGTCCAT TGCGCAGACG ACGCCATCCC ACGCAGTTGT GGGGCAGACC ATCACCTATA CCATCACGGT GCAGAACAAC GGTCCAAGTG TTGCTGCCGG CGCGCGCGTC AGCACGACGA CCCCGGCGCA TGTTGTGGTG ACGGGATGGG TGTGCGCGGC GTCGGCGGGG TCGCAGTGTG GCGCAGCGAG CGGCGCCGCG CCGGTAGACG ACGTCGTAAC GCTGGCGCCC GGCGGCGCGA TCACCTACAC GGTCACCGGC ACGGTTTTCA ATCGGGCTGT GGGACAACTT CCCTTCTCCG GCGCCGTCGT TGCGCCGGCA AGCGCCGAAG ACCCGGTGCT GACGAACAAC CAGGCGCAGA GTTCAACCCA GGCGCTGTAT GTGGTGACGC TGCCGGTAGT TGTGCGGTGA
|
Protein sequence | MRRNIAVLWL LGFLGGLLVF APPTGRTPVA HATVPNEQNW TVTPGTCDPS FQGAVFAAWN PASGNPNCGV FTSGAPSLNQ FDTGQVFPPN VLRDEASVTA PCENGRTAGL CYRMWYVGTR SGEPYRRIGY AVSPDGVSWY RVQGPHTGGS VFEGSGQPGS FDENGATTFH VIKDGGEYRM WYTGVNSSGT WRGFGYATSN NGITWTRQND GLPVLTRRLG SGLFDDDRII GPFVLIDEAS ATAPCESGRA NGRCFRMWYE GFRADNNFYI GHALSPDGIN WTIVNGPDEL GSVLSNSGGF TAFDSNDVGL TAVIKDGAIY RMWYQAKDYN TPDTFRIGHV TSVNGVNWVR PDPNDPAFYG GLDTINLPGT NDDVWVVRLL KEDLTYRMWY ATAGTPNSTR FGLVEMTQGV PITPTVRRSG DEFRIEFNTQ RTIPVSGSVL ITLPPGVSLD QFSVIELQGF EPGAVLARER GAITDAYSGF SARDALLLRL PNGAVPGPKV IRFSLGAEAP NPAYLLLQTF DTHKVLERAR VNLGDLRITQ SVGTVVAGAS VVYTVTVSNV GPNAVSNALL NSVFPTQLTG ITWTCAPSGG ASCAPGSGSG NWSSKPLDLP SGSSVTFTVT GNLPPAETGN LTDTVSVSTP AVLNELTPGD NVSTLVTPIE VRGDLSITRA SNPVVPQAGQ PITYTLTVAN SGPSTVVGAN VVNMFPISVT NVIWNCSATS GSVCPAPGSG NMSAAVTLAP GGIATFTATG MVSPSAVVMP PHSAMVTVPG NVTDPNPDNN VFTDGGGLGR SADLAISKAV APATVVPGLP VTYTITVTNT GAADADGATV LDLFPPTITN VTWTCSGAAG AGCAQASGSG DLMVTLSSFP VGGSATITMT GIVAAQATGN LINTAQVLPP VGVEDPAFAN NSASISSVLQ PRTDLSIAQT TPSHAVVGQT ITYTITVQNN GPSVAAGARV STTTPAHVVV TGWVCAASAG SQCGAASGAA PVDDVVTLAP GGAITYTVTG TVFNRAVGQL PFSGAVVAPA SAEDPVLTNN QAQSSTQALY VVTLPVVVR
|
| |