Gene Rcas_3546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3546 
Symbol 
ID5541047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4624698 
End bp4626008 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content62% 
IMG OID640895665 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_001433613 
Protein GI156743484 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.984037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.12163 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAAT TTCCTGCCAA CCGCGCCAGC ACCATTGCAA CTGCAATTCG CTATACGCTG 
CCGAACGGCA TGGTAGCGCT GGTGCAGCGC AACCCGACCG CTCCAACGGT GAGTGTCTAC
GGCGAGGTGC GTGTTGGAGC AGTGCATGAG CCTGCCGCAC AGAATGGTGT GGCTGCATTC
ACCGGCGCCG CATTGATCCG TGGCACACAG CGACGCAGCT TCCAGGAGAT TGTCGCCACC
ACGGAAGCGG TTGGCGCAAG CGTCAACGCC GGCGGCGGTC TGCACGCCAC CCATTTCGGC
GGGCGATCAT TGAGCGAAGA CCTGGCGCTG ATCCTCGATC TTCTGGCAGA TATGCTGCGC
ACGCCCTCCT TTCCCGACGA AGAAGTCGAG CGCCTGCGCG GTCAGTTTCT GATGATGCTA
CGCGAATATG AGCAGGATAC CTCGGTGCGC GCCTCACGCG CGCTGCGGTC GCTGATGTTT
CCGCCAGCGC ATCCCTACAG TCGCCTGAGC AGCGGCACGA CCGAGACGAT CTCGGCGTTG
ACGCGCGATG ACCTGGTGCG TTTCCACACT CGCTACCACC CGGCAGTCAC AACGATTGCC
GTGGTCGGCG ATATCGAACC GGCTGACGTC ATCGATCTGA TCGAACGGTT CTTCGGCGAC
TGGCAGGCGC CTGGAAATCC GCCCCACATG ACGCTGCCCG ACCTGCAACC GTTGCCCGAT
CAGCGGCGTG TCCACGTCGC CCTCGAAGGA AAGAGTCAGA CGGACGTTAT CTGGGCGGTC
CATGGACTCG ACCGCTGTTC GCCGGATTAC TACGCCGCCA GCGTTGCCAA TATGATCCTG
GGACGCATCG GCATTGGCGG GCGTCTCGGC GAGCGGGTGC GCGAAGAACA GGGGCTTGCC
TATTCCTGCG GCAGCAGCCT CGACGCCGAC CTCGGCGCCG GTCCGTGGGC AGCGATGGCA
GGGGTCAACC CCACACACGT CGAGCGAGCA ATCGCGGCGA TCATTGCCGA AATTAAACAG
TTTGCCGCTG AAGGACCGAC GGAACAGGAA CTTGCCGATG TGCACGACTT TATGACCGGC
AGCCTGGCGA TCAGCCTCGA AACGAATGAC AGCATCGCCG GGACGCTGCT CGGCATCGAA
CGGTATCACC TTGGTCTCGA TTATGTCGAG CGCTATCCGT CGATCATTCG GCGCATCGAC
CGTGAGCAGG TTATGGATGT GGCACGTCGC TATCTGGCGA CCGACAATTA TGTCGTGGTG
ACTGCCGGAC CGGCGGTGGG AGAGGAACAC AATGAGCATA GTAACGGATG A
 
Protein sequence
MNQFPANRAS TIATAIRYTL PNGMVALVQR NPTAPTVSVY GEVRVGAVHE PAAQNGVAAF 
TGAALIRGTQ RRSFQEIVAT TEAVGASVNA GGGLHATHFG GRSLSEDLAL ILDLLADMLR
TPSFPDEEVE RLRGQFLMML REYEQDTSVR ASRALRSLMF PPAHPYSRLS SGTTETISAL
TRDDLVRFHT RYHPAVTTIA VVGDIEPADV IDLIERFFGD WQAPGNPPHM TLPDLQPLPD
QRRVHVALEG KSQTDVIWAV HGLDRCSPDY YAASVANMIL GRIGIGGRLG ERVREEQGLA
YSCGSSLDAD LGAGPWAAMA GVNPTHVERA IAAIIAEIKQ FAAEGPTEQE LADVHDFMTG
SLAISLETND SIAGTLLGIE RYHLGLDYVE RYPSIIRRID REQVMDVARR YLATDNYVVV
TAGPAVGEEH NEHSNG