Gene Rcas_1469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1469 
Symbol 
ID5538943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1875883 
End bp1878294 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content62% 
IMG OID640893607 
Productpeptidase S45 penicillin amidase 
Protein accessionYP_001431582 
Protein GI156741453 
COG category[R] General function prediction only 
COG ID[COG2366] Protein related to penicillin acylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.169908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTTGC TGAAACGCCT GCTTCTCTGG ATGGTGCTGG TCCTGGTCAT CGTGACCGCG 
CTTGGCGCTG GCGGCGGGTA TCTATGGTTG CGCCGCTCGC TGCCACAGAC AAGCGGTGAG
ATTCGGGTCA GCGGGATCAG CGGCCCGGTG ACGATTGTGC GCGATAGCGA TGGCGTGGCG
CACATTACCG GCGCAACTGA GACGGATGCT GCGTTCGGGT TGGGGTTCGT TCATGCACAG
GAACGGCTCT GGCAGATGGA GGTGCAGCGC CGTATCGGTC ATGGTCGACT TTCCGAGGTC
TTCGGCGCAA CCACGCTCAA CACCGACCGC TTCCTGCGCG CGCTTGGCGT GGCGCGCGCT
GCGCGCCGCG CCCTTGAACG GCTCGATGCC AACACTATTG CCATCCTTGA AGCCTACGCA
GCTGGTGTCA ACGCCTTTCT GTCCACCAAT CCGACACTGC CGCCGGAATT CCTCATCCTC
GGCGTTCAGC CTGAACCGTG GCAACCGGTC GACTCGCTCG TCTGGGCGAA AATGATGGCG
TGGGACCTCG GCGGCAACTG GAACGATGAG GTGATGCGCG CGCTGCTCAT TGCGCAGATT
GGACCGGAGG ACGCCGATTT TCTGATGCCC GCCTACACGA CTGATGGACC GTTGATCCTG
CCGGACGCGG CGTTGGTCCG TCCCGCAGCA GCATCCACCC CACCGGACGC TGCGATCCAG
CCCGACACGG CGCGCGCGAT GCTCGATCTT TGGGAAACAG TGCATGCGAC GACTGGATTG
GGTGACCGGC TGGCCGGCTC AAACAACTGG GTAATCGGCG GCGCGCGCAC CGCGAGCGGC
AAGCCGCTGC TGGTCAACGA TCCGCATCTC GGCAACCGCA TCCCTTCGAT CTGGTATCTG
GCGCATATGC AGGGCGGCGC GATCAATTCC ATCGGCGCCA CCTTCCCCGG TCTGCCGGCA
GTCGTTATTG GCTACAATGA GCGGATCGCC TGGGGTGTGA CCAATACCGG TCCTGATGTG
CAGGACCTCT ATATCGAACG GATTGACGCC CAAAATTATG TCGAATACAA TGGCAAGCGC
GAACCGGTGA CGCTGATCGA CGAGACGATC AATGTCAAAG ACGCAGAGCC GGTGACGCTG
ACGGTGCGCA TCACCCGCCA TGGTCCAATC ATCAGCGATG TAACCTCCGG CACCGGCGAG
ACGCTGGCAT TTCGCTGGAC ATCGCTCGAC GAGGAGGATG CCACGATCCG CGCCTTTCTC
AATATCAACC GGGCGCACAA CTGGGAAGAG TTCACGACGG CGCTGCGCGA TTACAAAGCG
CCAATGCAGA ACTTCGTCTA CGCTGATGTC GAGGGCAATA TTGGCTACTA TGCCCCAGGT
GCGCTCCCCA TCCGTCGCAA TGGCGATGGG AGGTTGCCTG TGCCGGGATG GACCGACGAA
TACGAATGGG CTGGGTACGT GCCGTTCGAG GAGTTGCCCC ACGTCTACAA TCCGCCACAA
GGCTATATCG TCACTGCCAA CAATCGGGTG ATTGGCGACG ACTATCCGTA TCTCCTGGGA
ACTTCGTGGG CAGCGCCATA TCGGGCGCAG CGCATCGTTG AGATGATCGA TCAGGGTAAT
CGGTTGACCG TCGCCGATAT GCGCGCGATG CTTGGTGACG TTGTCTCGGT GCAGGCGCGT
GAGTTATTGC CGACGCTCCT CGCCGTTTCG CCAACCGGAC CGCGTGAAGC CGCTGCGCTC
GAACTGCTGC GCGGCTGGGA CGGCGCTATG CGCGGCGATA GCGCGGCTGC CTCCGTCTAC
CAGGGGTACT ACTATGCCGC CATTGAAGCG ATCTTTGCCG ATGAACTGCG TGATCTCTTC
ACCCGCCAGT ACCAATCCAG ACGCGACTTT CCGGCGATGG CGCTGCGCGC CGTGTTGCTC
GACGGTCACA ACGAGTGGTG CGACAACGTA ACGACGGTTG CTGTCACAGA AGACTGCGCA
ACCACGCTGG CAAAGGCGCT GACGCAGGGG CTGGAGGCGA TGGCAAAAGC GCAGGGCGAA
AACGATCCGG CGCAGTGGCG TTGGGATCGC GTTCACCAGG CAGTCTTTCC TCATAATCCA
TTCAGCCAGG TGGAGGCGCT GCGCAATGTC TTCGAGCGAC GCATCCCCAA CGGCGGCGAC
AATTTCACCG TCAATGTCGC GCCGGTGCGC ATCACCGAGC CATACTTGCA ATACAACGGA
CCGTCATACC GCCAGATTAT CGACCTTAGC GATCTGAGCG CGTCGCGGTT CATGCACACG
ACCGGACAAT CGGGCAATGT GTTGAGCAGC CGCTACAGCG ACTACCTGGA ACGCTGGCAG
CGAGTGGAAG ACGCGCCCAT GCGGTTGAGC GGCGCCATCG ACGGTGATCG ACTCGTCCTG
GTGGGGGAGT GA
 
Protein sequence
MRLLKRLLLW MVLVLVIVTA LGAGGGYLWL RRSLPQTSGE IRVSGISGPV TIVRDSDGVA 
HITGATETDA AFGLGFVHAQ ERLWQMEVQR RIGHGRLSEV FGATTLNTDR FLRALGVARA
ARRALERLDA NTIAILEAYA AGVNAFLSTN PTLPPEFLIL GVQPEPWQPV DSLVWAKMMA
WDLGGNWNDE VMRALLIAQI GPEDADFLMP AYTTDGPLIL PDAALVRPAA ASTPPDAAIQ
PDTARAMLDL WETVHATTGL GDRLAGSNNW VIGGARTASG KPLLVNDPHL GNRIPSIWYL
AHMQGGAINS IGATFPGLPA VVIGYNERIA WGVTNTGPDV QDLYIERIDA QNYVEYNGKR
EPVTLIDETI NVKDAEPVTL TVRITRHGPI ISDVTSGTGE TLAFRWTSLD EEDATIRAFL
NINRAHNWEE FTTALRDYKA PMQNFVYADV EGNIGYYAPG ALPIRRNGDG RLPVPGWTDE
YEWAGYVPFE ELPHVYNPPQ GYIVTANNRV IGDDYPYLLG TSWAAPYRAQ RIVEMIDQGN
RLTVADMRAM LGDVVSVQAR ELLPTLLAVS PTGPREAAAL ELLRGWDGAM RGDSAAASVY
QGYYYAAIEA IFADELRDLF TRQYQSRRDF PAMALRAVLL DGHNEWCDNV TTVAVTEDCA
TTLAKALTQG LEAMAKAQGE NDPAQWRWDR VHQAVFPHNP FSQVEALRNV FERRIPNGGD
NFTVNVAPVR ITEPYLQYNG PSYRQIIDLS DLSASRFMHT TGQSGNVLSS RYSDYLERWQ
RVEDAPMRLS GAIDGDRLVL VGE