Gene Rcas_3157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3157 
Symbol 
ID5540655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4097716 
End bp4098864 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content58% 
IMG OID640895278 
Productpeptidase M50 
Protein accessionYP_001433229 
Protein GI156743100 
COG category[R] General function prediction only 
COG ID[COG0517] FOG: CBS domain
[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATGGT CATTTCGCAT TGCACAGGTC GCCGGCATCG ACATCAAGAT TCATCTGACG 
TTCTTTTTGA TCGTCATTCT TGGCGCTATC GCTGGCGGAG CATCATATGG CGCAGTCGGC
GCAGCATTCG GCGCATTGCT GATCCTGTTG CTGTTTCTCT GTGTGACACT CCACGAATTG
GGGCACGGTA TTGCGGCGCG CGCCTTCGGC ATCCCGGTGC GCGAGATCAT TCTGTTGCCG
CTCGGCGGTC TGGCATTGCT GGGGCGCAAT CCGTCGAAGG CATGGCACGA ACTGGTCATC
GCCGCCGCCG GACCACTGGT GAATGTCATC ATCGCCGCTG TGCTGCTGCT GGTGACGGGA
ACGGCGCTGG CCTTCGGCAT TTTTGACCTG AACACACTGG AGATTGGGCG TGGTGCGTTT
CCGGCGCCGT CGATCCAGGG GTTAACGCTC TGGCTGTTGC AGGCAAATGT GTTGCTGGTG
CTCTTCAACA TGATCCCGGC GTTTCCGCTC GATGGCGGGC GCATTCTGCG CTCGGTACTG
GCGATGATCA TCGGTTTTCG CCGCGCTACG CGCATTGCGA CGTTCCTCGG TCAGGGGATT
GCTATTGTTC TTGGCATTCT GGGTATTCTC AGCGGCAACT TTTTGCTGGC GCTCGTCGCC
GTGTTCATCT TTCTCGGCGC CGGGCAGGAA AATGCCGAGG GGCAGGCGCG CACAATGCTC
GACACTATGC GCGTCGGTGA TGCATACAAT CGGCATGCCC TCACGCTCGA TATTGGCGAC
CGTGTGAGCA AGGTGGTCGA TTATATTCTG ACCAGTTATC AACCCGACTT CGCCGTTATG
CAGAATAGTC GTCTGATCGG TATTGTGACG CGCGAAGATG TGCTGCGTGC CCTGGCGAGC
GACACGCGCG ATCTGTACGT CACCGGCATT ATGCAACGTG AGTTTGTGCG CGTCCCAGCG
AGCGCCACTC TCGATGAGGT GCGCCAGGTG ATGAGCGCGC AGGGTACGCG CGTTGTGGCA
GTGTATGAAG GAGAAGTCTA CCTGGGGCTA GTCAGTATCG AGGACATTTC CGAGGCTTAC
GCCGTCCTAT CGTATCTGGA ACGCCAACAA GAAGCGCGCC GCGCTCAAAT GGCGCGTGAC
GCAACGTAG
 
Protein sequence
MRWSFRIAQV AGIDIKIHLT FFLIVILGAI AGGASYGAVG AAFGALLILL LFLCVTLHEL 
GHGIAARAFG IPVREIILLP LGGLALLGRN PSKAWHELVI AAAGPLVNVI IAAVLLLVTG
TALAFGIFDL NTLEIGRGAF PAPSIQGLTL WLLQANVLLV LFNMIPAFPL DGGRILRSVL
AMIIGFRRAT RIATFLGQGI AIVLGILGIL SGNFLLALVA VFIFLGAGQE NAEGQARTML
DTMRVGDAYN RHALTLDIGD RVSKVVDYIL TSYQPDFAVM QNSRLIGIVT REDVLRALAS
DTRDLYVTGI MQREFVRVPA SATLDEVRQV MSAQGTRVVA VYEGEVYLGL VSIEDISEAY
AVLSYLERQQ EARRAQMARD AT