Gene Rcas_2641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2641 
Symbol 
ID5540123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3403522 
End bp3405582 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content59% 
IMG OID640894764 
Productoligopeptidase B 
Protein accessionYP_001432731 
Protein GI156742602 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACAC CACCTCTACC GCTCAGGCGA CCACACGTGG TGTCGCTCCA TGGTGATACG 
CTTATCGACG ATTACTTCTG GATGCGTGAG CGTGATAATC CTGCGGTGAT TGCCCATCTC
GAAGCCGAAA ACCGCTACAC GGAGGAGATG ATGGCGCACA CCGCCGCGCT GCGTGAGCGC
CTGTACAACG AAATGCGCAG CCGCGTGAAG GAAGACGATG AAAGCGTCCC AGAGCGTTTT
GGGGCATTTG TGTACTATAG CCGCACTCAG ACAGGTCAAC AGTATCCGGT GGTCTACCGC
CGCGCTATCG GCGCAGACGA TGAAGAACTG CTACTCGATA TCAATGCGCT TGCCGGGGGA
TACCCATTCA CCCGAATTGG GGTCTTTTTG CCAACATACG ACGGACGCCT GCTCGCCTAT
TCGGTCGATG TAGAAGGTTC GGAAACATAC ACGCTCTACC TCAAAGACCT GACGACCGGC
ATATTGCTCG ACCAGCCCAT TGTGAACACA TACTACGGCG CCGCCTGGAG CAGCGATGGA
CGGTGCCTGT TCTACACCAC GCTCGATGAT GCCAGGCGTC CATATCGGGT CTATCGCCAC
GTTATTGGGA GCGACTCAGC GGATGATGTG CTGGTGTACG AAGAGACGGA CCCGCTCTTC
CATGTCAGTC TGTCGCTCAC GCGCAGTCGC GCATACATCC TGATTGCGTC ACACAGCAAC
ACCACGTCGG AAGTGCGCGC TCTTCCCGCC GATGCGCCGA TGAATGCACC CCGTCTATTG
TTGCCGCGTC GTCATAGAAT CGAGTATACG GCACATCACC ACGGCGATCA TTTCTACTTT
CTGACGAATG ATGGTGCGCT CAACTTCCGC GTGGTGCGCG CGCTGGCGAA CGACCTACAC
CCTGACCGTC TGGAGGAGGT CGTGCCGCAC CGCGCTGATG TGATGATTGA CGCTATCGAC
CTGTTTGCGG ATCATCTGGT CGCATACGAG CGGTCCAATG CGCAAGAGCG TGTCGCGATC
ATCGATCTGC ACAGCGGAGA AACCCACCTG CTGACCTTCC CGGAACCGGT GTACACCCTG
CAACCGTGGG ACAGGGATGC TCTTTGGGCG CCGAACCTGG AGTTCGATAC GACCGTATTG
CGCCTGCACG TGATGTCGCT CACACAACCG CGCACCATCT ATGAGTACGA TATGACCGCG
CAAACGCTGA CGCTCCTGAA GCGTGACGAT GTGCCAGGCT ACGATTCATC GCGCTACCGC
AGCGAACGGT TGTGGGCAAC CGCCAACGAC GGTGCGCGCG TACCGATTTC GCTCGTCTAT
CGCGCTGATG TGAACTGCCC GGCGCCGCTG CTCCTGTATG GCTACGGTTC ATACGGCGCC
ACTGCCGACC CGCGTTTCTC GATTGAGCGG ATCAGCCTGC TGGATCGGGG CGTGATTTTT
GCAATTGCGC ACATACGCGG TGGTGGGGAA TTGGGGCGTG CCTGGTACGA AGCGGGCAAG
ATGCTGCACA AGCGCAACAC CTTCACCGAT TTCATCGCCT GCGCTGAATA CCTGATCGCC
GAAGGCTACA CCACCCCCAA GCAACTGGCG ATCATGGGGC GCAGCGCCGG CGGGCTGCTG
GTCGGCGCGA TTGTCACCAT GCGCCCTGAT CTGATGCGGT GCGCGGTCGC CGATGTGCCG
TTCGTCGATG TTGTGAACAC CATGCTCGAT CCATCGATCC CGCTGACCGC CATCGAATTC
GAGGAGTGGG GGAATCCTGC GAATGCAGAA CAATACGCAT ATATGCAATC ATATTCGCCG
TATGATAACA CGACTCCACG CGCCTATCCG GCAATCCTGG CGACTGCCGG CCTGCACGAT
CCACGCGTGC AGTATTGGGA ACCCGCAAAA TGGGTTGCGA AACTGCGCGA TGTCAAAACC
AACGATGCGC CGGTGCTGTT GAAAACGGAC ATGACCGCCG GGCACGCCGG TCCTTCAGGG
CGCTACGACC GCCTGCGCGA AACCGCGTTC GAGTATGCCT TCCTGCTCGA TTGCCTGGGT
CTGGCATCAG AAATGTGCTG A
 
Protein sequence
MPTPPLPLRR PHVVSLHGDT LIDDYFWMRE RDNPAVIAHL EAENRYTEEM MAHTAALRER 
LYNEMRSRVK EDDESVPERF GAFVYYSRTQ TGQQYPVVYR RAIGADDEEL LLDINALAGG
YPFTRIGVFL PTYDGRLLAY SVDVEGSETY TLYLKDLTTG ILLDQPIVNT YYGAAWSSDG
RCLFYTTLDD ARRPYRVYRH VIGSDSADDV LVYEETDPLF HVSLSLTRSR AYILIASHSN
TTSEVRALPA DAPMNAPRLL LPRRHRIEYT AHHHGDHFYF LTNDGALNFR VVRALANDLH
PDRLEEVVPH RADVMIDAID LFADHLVAYE RSNAQERVAI IDLHSGETHL LTFPEPVYTL
QPWDRDALWA PNLEFDTTVL RLHVMSLTQP RTIYEYDMTA QTLTLLKRDD VPGYDSSRYR
SERLWATAND GARVPISLVY RADVNCPAPL LLYGYGSYGA TADPRFSIER ISLLDRGVIF
AIAHIRGGGE LGRAWYEAGK MLHKRNTFTD FIACAEYLIA EGYTTPKQLA IMGRSAGGLL
VGAIVTMRPD LMRCAVADVP FVDVVNTMLD PSIPLTAIEF EEWGNPANAE QYAYMQSYSP
YDNTTPRAYP AILATAGLHD PRVQYWEPAK WVAKLRDVKT NDAPVLLKTD MTAGHAGPSG
RYDRLRETAF EYAFLLDCLG LASEMC