Gene Rcas_2274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2274 
Symbol 
ID5539755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2927860 
End bp2931207 
Gene Length3348 bp 
Protein Length1115 aa 
Translation table11 
GC content62% 
IMG OID640894407 
Productprotease domain-containing protein 
Protein accessionYP_001432375 
Protein GI156742246 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0667485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000204116 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAACGCT ACACGACACG CCTGAGTCTG GCAGCGATGA CTTTGTTGCT GTCGCTCATT 
CTGGCAGCGC TGTCGCCGCT GCGCGCCGTT GAGGCGCAAA CCCGTCCAAA TCGCCCAACA
CCCGGAACGC CGCTGACAAT CGAACCGGCG ACCGTGACGA ACCGCGAAGA CCTGGCAGTC
AACAAGAACA TGGCTGTCTC TCGTGATGGC TCGATGGCGA GTGTCTTTCT CAAGATCGAC
TCGCCCTCGC TGGCTACGTT TATGGCGCAA AATGGCATAA CGGATATGAA TGCGCTGGCT
GCACAAAACT ATCTGCGCCA GTTGAACGCC GAACTCGATG CGCTCGTCGC GCGGGCGAAG
CAATTGGTTC CCGGTCTGAG TGTCACCCAT CGTTTTGACC TGATCATCGG CGGCGTTGCG
GTGGTGGCGC CGGTCGGTGA GATCGATAAA CTGCGGCGCC TGCCGAACGT GGTGGACGTG
ATCAACGACC GGATCGAGAA AATCGAAACC TACCGCACTC CCGCTTTCAT TGGCGCAACC
ACTGCATGGG GCAGAGGCGG CGGCTCGGCT TTTGCCGGCG AAGGGGTCAT TTTCGGTGTG
CTCGATAGCG GCGTCTGGCC CGAACACCCT TCGTTCTCCG ATCCCGATCC GCTCGGCAAG
CCATACGCGC CGCCGCCACC TGCTCCGGGC AACCCCGGCG GTGTGCGCGC GTGCGATTTC
GGCAGTGCGA CCCCCGGCGA CGCGCCGTTT GCCTGCAACA ACAAACTGAT CGGCTCCTAC
CGCTTTATGA CCGCCTATGA CTTCTTCGTC GGCACCGAGC CATACGAATT CCGATCCGGT
CGTGATGATG ACGGCCACGG CACCCACACG GCCTCGACCG CAGCCGGCAA TCGCGGCGTG
GCAGCCAGTG ATGGCAGCCG GGTGTTCGGC GTGATCTCCG GCATCGCCCC GCGCGCTTAT
GTCGTCAACT ACAAAGTCTG CGGTGAATTG GGCTGCTTCT CGACCGACTC GGCGGCAGCG
GTGCAGCAGG CGATCCGCGA TGGCGTGCAT GTGATTAACT TCTCGATCAG CGGCGGTACC
AACCCCTACA GCGATATTGC TTCGCTCGCC TTCCTCGACG CCTACAACGC CGGTGTTTTG
GTCTCTGCCT CGGCTGGCAA CTCGGGTCCC GCAGCCGACA CGGTCAACCA TCGTGAGCCG
TGGGTGGCCA CGGTTGGCGC CAGCACGTCG GACCGGTCGT ACCTGAGTAC GCTCACCGTG
CAGGGCAGCA GCGGTACGTT CACCGCCGTC GGCGCATCGA GCGGCGCTGG TATTGCCACA
CCAACACAAA TTGTGGTCAA CAACGCCGAT CCGCTCTGTC TGAACCCGGC GGCGCCGGGT
AGCTTTACCG GGCAGATCGT TGTCTGTCAG CGTGGCGTGA TTGCGCGCGT CGCCAAGAGC
GCCAATGTCG CCGCCGGCGG CGCCGTCGGC ATGATCCTGT ACAATCCGTC GCCGAGCAGC
CTCGATGCCG ATTTCCATGT CATTCCCACG GTTCACATCC AGAATACCGA CGGTACGGCG
CTGCTGGCAT TCCTGACCGC CAACCCCGGC GCGACGGCGA CCTTCACTGC CGGCGCCCCC
GGCTCGATTC AGGGTGATGT GATGGCCGGC TTCAGCTCGC GCGGCGGACC GGGCCAGACG
CTTGGTATCA GTAAGCCCGA CGTTACTGCG CCCGGCGTCA ATATTCTGGC CGGCTACACT
GCGATTGAAT ATGGCACGCC GGTGCCGCAA TTCGCCTTCC TCAGCGGCAC CTCGATGTCC
AGCCCGCATA ACGCCGGCGC CGCCATTTTG CTGAAGTGGC TGCACCCGAC GTGGACCCCA
GGGCAGATCA AGTCGGCGTT GATGACGACA GCCAAAGCGG CCGGAGTGTT CAAGGAGGAT
GGTGTGACGC CGTTCACCCC GTTTGATGCT GGCTCTGGGC GCATTAATCT GCGCAAGGCG
TGGGATCCGG GGTTGACGTT CGATGAAACC GGTGCAAATT ACGTGACGTT GCAGAACGAA
CTGTGGAAGG CAAACTATCC CAGCCTGTAC GTCCCGAAGA TGCCGGGGTT GATCACGGTC
AGCCGCACGG CGCGTGAGGT GTCGGGCTAC GATAGTTTCT ACAAGAGCAC TGTCTCGTAT
CAGGCCGGTC AGCCACGTGA TTTTACGGTA ACGGCGCCAA AAGAGTTCTT CGTTCCGGCC
AATGGCACGT ACACCTTCGA CATTACGGTC GATGCGCGCA ATGTTCCGGT AGGTCAGGTG
CGCCACGCGG TCGTTATCTT CACCGAACGC AACGGTTGTC AGGTCCGCTT CCCGATCACC
ATCGTGCGCG GCGAGCCGGA CATTCGCATG GAGAAGCGGT GCAATCCGGC GACGCTGGCG
CTGCGCGGCA CAACTGACTG CACGATCACG ATCTCGAACA CGACCTTCAG CAATGCAAGC
GTGACGCTCA ATGACACGAT GCCCAGGCAG TTGAAACTGG TAAGCGGCAG CGTGACCGGT
GGGGCAACTG AGATGGGCAA CGGTCTCACC TATAGCGGGA CCTTGACCGG CGCAGCGCCG
CCGGATGTGA GCGCCGGCCG GCTGACCTTT AACGGGTATC TCTCGCTGGC GAGCCTCGGC
GCTTCTCCCA ATGTCTCCCT TGGCGACGAG TCGCTCGTCA ATCTTACCCT CTCGCGTCCG
TTCCTGTTTG GCGGGCAAAG CTACAACACC GTCGCTATGG TCAGCAACGG TTATGCCGTG
GTTGGCGGCG GCACTGCGGC GGATATTTCA TTCGTCAACC GCACTTTCCC CAACTCGGCG
CGCCCGAACA ACACGCTCGG CGCGTTCTGG ACTGACCTCG ACGGCAGCGC TGGCGGCAAC
TACTACGCTT ACCTTGTCGG CTTCGGTCCG TGTAGCAATC CGGCCAATGC CTGCTGGCTG
ATTCTGGAAT GGGAGAATGC GCCAAACTGG AGTGATAGTC AAGTCAATAC GTTCCAGATC
TGGATCGGAC TGAACGGCGT CGAGGACATC ACCTACTCGT ATGGTCCGGT TCTTTCAAGC
GGTGATGGCG GGTTCGTCAC GGTCGGCGCT GAGAATGCGT TTGGCAATCG CGGCGCGAAT
ATCTACAGTA ACGACGGCTT CGGACCGGTT ATCGGCACGA TTCCGACGGC AGGGTCGAAC
GATGTCTACA TCGCAACGAC CCCTGGCGCT CCCGGTCAGA CGGTGACCAT CGGGTTCCGG
GCGCAGGGCG TGCGTGTCGG TCGCTGGACG AACTACGCCG AGTTGACCAG TCCGGCGTTC
TTCGGCACGT ACATTAAGTC GTTCAGCGGC GAAGTTGTGC GACCGTAA
 
Protein sequence
MKRYTTRLSL AAMTLLLSLI LAALSPLRAV EAQTRPNRPT PGTPLTIEPA TVTNREDLAV 
NKNMAVSRDG SMASVFLKID SPSLATFMAQ NGITDMNALA AQNYLRQLNA ELDALVARAK
QLVPGLSVTH RFDLIIGGVA VVAPVGEIDK LRRLPNVVDV INDRIEKIET YRTPAFIGAT
TAWGRGGGSA FAGEGVIFGV LDSGVWPEHP SFSDPDPLGK PYAPPPPAPG NPGGVRACDF
GSATPGDAPF ACNNKLIGSY RFMTAYDFFV GTEPYEFRSG RDDDGHGTHT ASTAAGNRGV
AASDGSRVFG VISGIAPRAY VVNYKVCGEL GCFSTDSAAA VQQAIRDGVH VINFSISGGT
NPYSDIASLA FLDAYNAGVL VSASAGNSGP AADTVNHREP WVATVGASTS DRSYLSTLTV
QGSSGTFTAV GASSGAGIAT PTQIVVNNAD PLCLNPAAPG SFTGQIVVCQ RGVIARVAKS
ANVAAGGAVG MILYNPSPSS LDADFHVIPT VHIQNTDGTA LLAFLTANPG ATATFTAGAP
GSIQGDVMAG FSSRGGPGQT LGISKPDVTA PGVNILAGYT AIEYGTPVPQ FAFLSGTSMS
SPHNAGAAIL LKWLHPTWTP GQIKSALMTT AKAAGVFKED GVTPFTPFDA GSGRINLRKA
WDPGLTFDET GANYVTLQNE LWKANYPSLY VPKMPGLITV SRTAREVSGY DSFYKSTVSY
QAGQPRDFTV TAPKEFFVPA NGTYTFDITV DARNVPVGQV RHAVVIFTER NGCQVRFPIT
IVRGEPDIRM EKRCNPATLA LRGTTDCTIT ISNTTFSNAS VTLNDTMPRQ LKLVSGSVTG
GATEMGNGLT YSGTLTGAAP PDVSAGRLTF NGYLSLASLG ASPNVSLGDE SLVNLTLSRP
FLFGGQSYNT VAMVSNGYAV VGGGTAADIS FVNRTFPNSA RPNNTLGAFW TDLDGSAGGN
YYAYLVGFGP CSNPANACWL ILEWENAPNW SDSQVNTFQI WIGLNGVEDI TYSYGPVLSS
GDGGFVTVGA ENAFGNRGAN IYSNDGFGPV IGTIPTAGSN DVYIATTPGA PGQTVTIGFR
AQGVRVGRWT NYAELTSPAF FGTYIKSFSG EVVRP