Gene Rcas_2306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2306 
Symbol 
ID5539787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2973974 
End bp2975470 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content61% 
IMG OID640894439 
Productcarboxypeptidase Taq 
Protein accessionYP_001432407 
Protein GI156742278 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.601407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.828222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATACGC CTCCTCAACT CACCGAACTC AAAGCGCGTC TCCGCGAGAT CGACGATCTG 
GAGATGGCGG CTGCGCTCCT GAACTGGGAT CAGACGACCT ACATGCCTCC CGGTGGCGCC
GCGGCTCGCG GTCGCCAACT GGCGACACTG GGGCGCATTA TTCACGAGAA GCGGATCGAT
CCGGCGATTG GGCGTTTGCT CGATGCGCTG CGCTCCTACG AAGAGTCGCT CCCGCCTGAT
TCGCCCGATG CTGCGCTCAT TCGGGTGACG CGACGCGACT ATGAGCGCGC GATGCGCGTT
CCCGCCGCGT TCACTGCTGA ACTCTACGAG CACACCGCTG CCAGTTACGA TGTCTGGTCG
CGTGCGCGTC CGGCGAACGA CTTTATGGCG GTTTTGCCCT ATCTGGAGCG CACCCTTGAT
CTCAGCCGCC GCTTTGCGGA GTTCTTTCCC GGCTATGAGC ACATTGCCGA TCCGCTGATC
GATATGGCGG ATTATGGCAT GCGCGCAGCA ACGATCAAAC AGGTCTTCGC GGAACTCCGC
CAGGGGTTGA TCCCGCTGGT CGAACAGATC ACCGTCCAGC CGCCGGTCGA TGACTCTTGC
CTGCGCCAGT TCTTCCCCGA AGCGCAACAG TGGGCGTTTG GCGTTGAAGT CATCACGGCA
TTGGGGTACG ACTTCAGCCG TGGAAGGCAG GATAAGACGT TGCACCCGTT TATGACGAAG
TTCTCGCTGA ACGATGTGCG CATCACCACG CGAGTCGATG AATATGACCT CGGTTCGGCG
CTCTTCAGCA CCATCCACGA AGCCGGGCAC GCAATGTACG AGCAGGGTAT TGCGCAGGCA
TTCGAGGGTA CGCCGCTCGC GTCTGGCACA TCCGCCGGCA TGCACGAGAG TCAGTCGCGT
TTGTGGGAGA ATATCGTTGG GCGCAGCCTC CCCTTCTGGG AGTATTTCTA TCCGCGCCTC
CAGGCGACTT TTCCCGATCA GTTGGGAAAC GTGCCGCTTG AAACGTTCTA TCGGGCGATT
AACAAAGTGC AGCGCTCCCT CATCCGCACT GAAGCCGATG AAGTGACCTA CAACCTGCAC
GTCATTCTGC GTTTCGACCT GGAACTGGCA TTGCTCGAAG GAACGCTCGC CGTGCGCGAC
CTGCCCGAAG CCTGGCGTGA ACGCTATCGC AGCGATCTTG GCGTGGCGCC GCCGGACGAC
CGCGACGGTG TGTTGCAAGA TGTTCACTGG TACGGTGGTC TCATCGGCGG CGCGTTTCAG
GGATATACAC TGGGGAATAT TATGAGCGTG CAACTGTTCG ATGCCGCGCT GCGCGACCAT
CCCGACATCC CGCAGCAGAT TGGCAGCGGC AGGTTCGACA CGCTCCGCGA ATGGATGCGC
GAACATGTCT ACCGTCATGG GCGCGCCCTC GACGCCGACG ACATCCTGCG ACGCGCCACC
GGCAGATCAC TCGATGTGCA GCCGTATCTG GCATACCTGT GGCGCAAATA CGGATGA
 
Protein sequence
MHTPPQLTEL KARLREIDDL EMAAALLNWD QTTYMPPGGA AARGRQLATL GRIIHEKRID 
PAIGRLLDAL RSYEESLPPD SPDAALIRVT RRDYERAMRV PAAFTAELYE HTAASYDVWS
RARPANDFMA VLPYLERTLD LSRRFAEFFP GYEHIADPLI DMADYGMRAA TIKQVFAELR
QGLIPLVEQI TVQPPVDDSC LRQFFPEAQQ WAFGVEVITA LGYDFSRGRQ DKTLHPFMTK
FSLNDVRITT RVDEYDLGSA LFSTIHEAGH AMYEQGIAQA FEGTPLASGT SAGMHESQSR
LWENIVGRSL PFWEYFYPRL QATFPDQLGN VPLETFYRAI NKVQRSLIRT EADEVTYNLH
VILRFDLELA LLEGTLAVRD LPEAWRERYR SDLGVAPPDD RDGVLQDVHW YGGLIGGAFQ
GYTLGNIMSV QLFDAALRDH PDIPQQIGSG RFDTLREWMR EHVYRHGRAL DADDILRRAT
GRSLDVQPYL AYLWRKYG