Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3157 |
Symbol | |
ID | 5540655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4097716 |
End bp | 4098864 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640895278 |
Product | peptidase M50 |
Protein accession | YP_001433229 |
Protein GI | 156743100 |
COG category | [R] General function prediction only |
COG ID | [COG0517] FOG: CBS domain [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATGGT CATTTCGCAT TGCACAGGTC GCCGGCATCG ACATCAAGAT TCATCTGACG TTCTTTTTGA TCGTCATTCT TGGCGCTATC GCTGGCGGAG CATCATATGG CGCAGTCGGC GCAGCATTCG GCGCATTGCT GATCCTGTTG CTGTTTCTCT GTGTGACACT CCACGAATTG GGGCACGGTA TTGCGGCGCG CGCCTTCGGC ATCCCGGTGC GCGAGATCAT TCTGTTGCCG CTCGGCGGTC TGGCATTGCT GGGGCGCAAT CCGTCGAAGG CATGGCACGA ACTGGTCATC GCCGCCGCCG GACCACTGGT GAATGTCATC ATCGCCGCTG TGCTGCTGCT GGTGACGGGA ACGGCGCTGG CCTTCGGCAT TTTTGACCTG AACACACTGG AGATTGGGCG TGGTGCGTTT CCGGCGCCGT CGATCCAGGG GTTAACGCTC TGGCTGTTGC AGGCAAATGT GTTGCTGGTG CTCTTCAACA TGATCCCGGC GTTTCCGCTC GATGGCGGGC GCATTCTGCG CTCGGTACTG GCGATGATCA TCGGTTTTCG CCGCGCTACG CGCATTGCGA CGTTCCTCGG TCAGGGGATT GCTATTGTTC TTGGCATTCT GGGTATTCTC AGCGGCAACT TTTTGCTGGC GCTCGTCGCC GTGTTCATCT TTCTCGGCGC CGGGCAGGAA AATGCCGAGG GGCAGGCGCG CACAATGCTC GACACTATGC GCGTCGGTGA TGCATACAAT CGGCATGCCC TCACGCTCGA TATTGGCGAC CGTGTGAGCA AGGTGGTCGA TTATATTCTG ACCAGTTATC AACCCGACTT CGCCGTTATG CAGAATAGTC GTCTGATCGG TATTGTGACG CGCGAAGATG TGCTGCGTGC CCTGGCGAGC GACACGCGCG ATCTGTACGT CACCGGCATT ATGCAACGTG AGTTTGTGCG CGTCCCAGCG AGCGCCACTC TCGATGAGGT GCGCCAGGTG ATGAGCGCGC AGGGTACGCG CGTTGTGGCA GTGTATGAAG GAGAAGTCTA CCTGGGGCTA GTCAGTATCG AGGACATTTC CGAGGCTTAC GCCGTCCTAT CGTATCTGGA ACGCCAACAA GAAGCGCGCC GCGCTCAAAT GGCGCGTGAC GCAACGTAG
|
Protein sequence | MRWSFRIAQV AGIDIKIHLT FFLIVILGAI AGGASYGAVG AAFGALLILL LFLCVTLHEL GHGIAARAFG IPVREIILLP LGGLALLGRN PSKAWHELVI AAAGPLVNVI IAAVLLLVTG TALAFGIFDL NTLEIGRGAF PAPSIQGLTL WLLQANVLLV LFNMIPAFPL DGGRILRSVL AMIIGFRRAT RIATFLGQGI AIVLGILGIL SGNFLLALVA VFIFLGAGQE NAEGQARTML DTMRVGDAYN RHALTLDIGD RVSKVVDYIL TSYQPDFAVM QNSRLIGIVT REDVLRALAS DTRDLYVTGI MQREFVRVPA SATLDEVRQV MSAQGTRVVA VYEGEVYLGL VSIEDISEAY AVLSYLERQQ EARRAQMARD AT
|
| |