Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2274 |
Symbol | |
ID | 5539755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2927860 |
End bp | 2931207 |
Gene Length | 3348 bp |
Protein Length | 1115 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640894407 |
Product | protease domain-containing protein |
Protein accession | YP_001432375 |
Protein GI | 156742246 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0667485 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000000204116 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAAACGCT ACACGACACG CCTGAGTCTG GCAGCGATGA CTTTGTTGCT GTCGCTCATT CTGGCAGCGC TGTCGCCGCT GCGCGCCGTT GAGGCGCAAA CCCGTCCAAA TCGCCCAACA CCCGGAACGC CGCTGACAAT CGAACCGGCG ACCGTGACGA ACCGCGAAGA CCTGGCAGTC AACAAGAACA TGGCTGTCTC TCGTGATGGC TCGATGGCGA GTGTCTTTCT CAAGATCGAC TCGCCCTCGC TGGCTACGTT TATGGCGCAA AATGGCATAA CGGATATGAA TGCGCTGGCT GCACAAAACT ATCTGCGCCA GTTGAACGCC GAACTCGATG CGCTCGTCGC GCGGGCGAAG CAATTGGTTC CCGGTCTGAG TGTCACCCAT CGTTTTGACC TGATCATCGG CGGCGTTGCG GTGGTGGCGC CGGTCGGTGA GATCGATAAA CTGCGGCGCC TGCCGAACGT GGTGGACGTG ATCAACGACC GGATCGAGAA AATCGAAACC TACCGCACTC CCGCTTTCAT TGGCGCAACC ACTGCATGGG GCAGAGGCGG CGGCTCGGCT TTTGCCGGCG AAGGGGTCAT TTTCGGTGTG CTCGATAGCG GCGTCTGGCC CGAACACCCT TCGTTCTCCG ATCCCGATCC GCTCGGCAAG CCATACGCGC CGCCGCCACC TGCTCCGGGC AACCCCGGCG GTGTGCGCGC GTGCGATTTC GGCAGTGCGA CCCCCGGCGA CGCGCCGTTT GCCTGCAACA ACAAACTGAT CGGCTCCTAC CGCTTTATGA CCGCCTATGA CTTCTTCGTC GGCACCGAGC CATACGAATT CCGATCCGGT CGTGATGATG ACGGCCACGG CACCCACACG GCCTCGACCG CAGCCGGCAA TCGCGGCGTG GCAGCCAGTG ATGGCAGCCG GGTGTTCGGC GTGATCTCCG GCATCGCCCC GCGCGCTTAT GTCGTCAACT ACAAAGTCTG CGGTGAATTG GGCTGCTTCT CGACCGACTC GGCGGCAGCG GTGCAGCAGG CGATCCGCGA TGGCGTGCAT GTGATTAACT TCTCGATCAG CGGCGGTACC AACCCCTACA GCGATATTGC TTCGCTCGCC TTCCTCGACG CCTACAACGC CGGTGTTTTG GTCTCTGCCT CGGCTGGCAA CTCGGGTCCC GCAGCCGACA CGGTCAACCA TCGTGAGCCG TGGGTGGCCA CGGTTGGCGC CAGCACGTCG GACCGGTCGT ACCTGAGTAC GCTCACCGTG CAGGGCAGCA GCGGTACGTT CACCGCCGTC GGCGCATCGA GCGGCGCTGG TATTGCCACA CCAACACAAA TTGTGGTCAA CAACGCCGAT CCGCTCTGTC TGAACCCGGC GGCGCCGGGT AGCTTTACCG GGCAGATCGT TGTCTGTCAG CGTGGCGTGA TTGCGCGCGT CGCCAAGAGC GCCAATGTCG CCGCCGGCGG CGCCGTCGGC ATGATCCTGT ACAATCCGTC GCCGAGCAGC CTCGATGCCG ATTTCCATGT CATTCCCACG GTTCACATCC AGAATACCGA CGGTACGGCG CTGCTGGCAT TCCTGACCGC CAACCCCGGC GCGACGGCGA CCTTCACTGC CGGCGCCCCC GGCTCGATTC AGGGTGATGT GATGGCCGGC TTCAGCTCGC GCGGCGGACC GGGCCAGACG CTTGGTATCA GTAAGCCCGA CGTTACTGCG CCCGGCGTCA ATATTCTGGC CGGCTACACT GCGATTGAAT ATGGCACGCC GGTGCCGCAA TTCGCCTTCC TCAGCGGCAC CTCGATGTCC AGCCCGCATA ACGCCGGCGC CGCCATTTTG CTGAAGTGGC TGCACCCGAC GTGGACCCCA GGGCAGATCA AGTCGGCGTT GATGACGACA GCCAAAGCGG CCGGAGTGTT CAAGGAGGAT GGTGTGACGC CGTTCACCCC GTTTGATGCT GGCTCTGGGC GCATTAATCT GCGCAAGGCG TGGGATCCGG GGTTGACGTT CGATGAAACC GGTGCAAATT ACGTGACGTT GCAGAACGAA CTGTGGAAGG CAAACTATCC CAGCCTGTAC GTCCCGAAGA TGCCGGGGTT GATCACGGTC AGCCGCACGG CGCGTGAGGT GTCGGGCTAC GATAGTTTCT ACAAGAGCAC TGTCTCGTAT CAGGCCGGTC AGCCACGTGA TTTTACGGTA ACGGCGCCAA AAGAGTTCTT CGTTCCGGCC AATGGCACGT ACACCTTCGA CATTACGGTC GATGCGCGCA ATGTTCCGGT AGGTCAGGTG CGCCACGCGG TCGTTATCTT CACCGAACGC AACGGTTGTC AGGTCCGCTT CCCGATCACC ATCGTGCGCG GCGAGCCGGA CATTCGCATG GAGAAGCGGT GCAATCCGGC GACGCTGGCG CTGCGCGGCA CAACTGACTG CACGATCACG ATCTCGAACA CGACCTTCAG CAATGCAAGC GTGACGCTCA ATGACACGAT GCCCAGGCAG TTGAAACTGG TAAGCGGCAG CGTGACCGGT GGGGCAACTG AGATGGGCAA CGGTCTCACC TATAGCGGGA CCTTGACCGG CGCAGCGCCG CCGGATGTGA GCGCCGGCCG GCTGACCTTT AACGGGTATC TCTCGCTGGC GAGCCTCGGC GCTTCTCCCA ATGTCTCCCT TGGCGACGAG TCGCTCGTCA ATCTTACCCT CTCGCGTCCG TTCCTGTTTG GCGGGCAAAG CTACAACACC GTCGCTATGG TCAGCAACGG TTATGCCGTG GTTGGCGGCG GCACTGCGGC GGATATTTCA TTCGTCAACC GCACTTTCCC CAACTCGGCG CGCCCGAACA ACACGCTCGG CGCGTTCTGG ACTGACCTCG ACGGCAGCGC TGGCGGCAAC TACTACGCTT ACCTTGTCGG CTTCGGTCCG TGTAGCAATC CGGCCAATGC CTGCTGGCTG ATTCTGGAAT GGGAGAATGC GCCAAACTGG AGTGATAGTC AAGTCAATAC GTTCCAGATC TGGATCGGAC TGAACGGCGT CGAGGACATC ACCTACTCGT ATGGTCCGGT TCTTTCAAGC GGTGATGGCG GGTTCGTCAC GGTCGGCGCT GAGAATGCGT TTGGCAATCG CGGCGCGAAT ATCTACAGTA ACGACGGCTT CGGACCGGTT ATCGGCACGA TTCCGACGGC AGGGTCGAAC GATGTCTACA TCGCAACGAC CCCTGGCGCT CCCGGTCAGA CGGTGACCAT CGGGTTCCGG GCGCAGGGCG TGCGTGTCGG TCGCTGGACG AACTACGCCG AGTTGACCAG TCCGGCGTTC TTCGGCACGT ACATTAAGTC GTTCAGCGGC GAAGTTGTGC GACCGTAA
|
Protein sequence | MKRYTTRLSL AAMTLLLSLI LAALSPLRAV EAQTRPNRPT PGTPLTIEPA TVTNREDLAV NKNMAVSRDG SMASVFLKID SPSLATFMAQ NGITDMNALA AQNYLRQLNA ELDALVARAK QLVPGLSVTH RFDLIIGGVA VVAPVGEIDK LRRLPNVVDV INDRIEKIET YRTPAFIGAT TAWGRGGGSA FAGEGVIFGV LDSGVWPEHP SFSDPDPLGK PYAPPPPAPG NPGGVRACDF GSATPGDAPF ACNNKLIGSY RFMTAYDFFV GTEPYEFRSG RDDDGHGTHT ASTAAGNRGV AASDGSRVFG VISGIAPRAY VVNYKVCGEL GCFSTDSAAA VQQAIRDGVH VINFSISGGT NPYSDIASLA FLDAYNAGVL VSASAGNSGP AADTVNHREP WVATVGASTS DRSYLSTLTV QGSSGTFTAV GASSGAGIAT PTQIVVNNAD PLCLNPAAPG SFTGQIVVCQ RGVIARVAKS ANVAAGGAVG MILYNPSPSS LDADFHVIPT VHIQNTDGTA LLAFLTANPG ATATFTAGAP GSIQGDVMAG FSSRGGPGQT LGISKPDVTA PGVNILAGYT AIEYGTPVPQ FAFLSGTSMS SPHNAGAAIL LKWLHPTWTP GQIKSALMTT AKAAGVFKED GVTPFTPFDA GSGRINLRKA WDPGLTFDET GANYVTLQNE LWKANYPSLY VPKMPGLITV SRTAREVSGY DSFYKSTVSY QAGQPRDFTV TAPKEFFVPA NGTYTFDITV DARNVPVGQV RHAVVIFTER NGCQVRFPIT IVRGEPDIRM EKRCNPATLA LRGTTDCTIT ISNTTFSNAS VTLNDTMPRQ LKLVSGSVTG GATEMGNGLT YSGTLTGAAP PDVSAGRLTF NGYLSLASLG ASPNVSLGDE SLVNLTLSRP FLFGGQSYNT VAMVSNGYAV VGGGTAADIS FVNRTFPNSA RPNNTLGAFW TDLDGSAGGN YYAYLVGFGP CSNPANACWL ILEWENAPNW SDSQVNTFQI WIGLNGVEDI TYSYGPVLSS GDGGFVTVGA ENAFGNRGAN IYSNDGFGPV IGTIPTAGSN DVYIATTPGA PGQTVTIGFR AQGVRVGRWT NYAELTSPAF FGTYIKSFSG EVVRP
|
| |