Gene Rcas_0547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0547 
Symbol 
ID5538010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp727273 
End bp728598 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content61% 
IMG OID640892709 
Productpeptidase M24 
Protein accessionYP_001430695 
Protein GI156740566 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCCG ATGGGACGCG CAGCCGTTCC CGGTTCTCGG TTCTCAGTTC TTCACTCTCT 
TCACCCCTGA CACAGGAGGA CATCTTGAAA CACGATCTTG ACCGGTTGAT GGCGGAACGG
AACCTGGACG CCATCGTTGT CGAAGGACCT GACGGACTGG AAAGCGCTAA TCCCGACTAC
AACTACTTTG TAGGAGGGCG GCACATTCCC GGACTGATCA TCAAAAAGCG CGGCGAACCC
ACGATGCTGC TCCACAGTCC GTGGGAGCAG AATGAAGCCG AACAGACAGG ACTCGCACTT
GTATCGCTCA ATCGCTGGAA TCTGCGTGAG ATACTCCAGG AGTTTCCCGA CCGCCTGGAA
GCACGGGTAG AACATCGCCG CCGTATCTTC ACCGACCTTG GCGTGCGCGG GCGCGTCGGC
ATCTACGGCA CAGTCAAAGC CGGACCGTTC TTTGCGCTGA TGGCGCGCCT GGCGCAACAG
ATCGATGGGC TGGAGATCGT TGCCGAACTC GACCGCGATG TCATTTCAAT GGCGCGCCTG
ACCAAAGACG CCGACGAGGT CGAGCGCATG CGCGCCGTCG GGCGCAAGAC GTGCGCCGTC
CTCCAAGTGG CGGTTGATTT CATTTGCACA GGTTGGATCG ATGGCGGGAG CGTGCGTGGC
GTCGACGGCG CGCCACTCAC AATCGGCGAT GTGCGGCGGG TGATGCTGCG AGAAATCGCG
GCGCAGGGAT TGGAAACGCC GGCCGGGATG ATCATCTCCC AGGGACGCGA CGCTGGTCTG
CCGCATGCGC GCGGCGATGA CGCGATGCCG CTGCGTCCGG GGCAGGCGAT CGTGATCGAC
ATCTTCCCCC GCGAGGCAGG CGGAGGATAT TTTCACGACA TGACGCGCAC CTTTGCCATC
GGATATGCGC CGCCGGAACT GCAACAGGCG TATAATGACG TGCTCGGCGC GTTCGAGATG
GTGACCGCCG CATTCGAGGC GGGTGCGCCA ACCAGGAAGT ATCAGGATAT GGTGTGCGAT
TACTTCGAGG CGCGCGGTCA CGACACCATT CGCCGCACCT ACCCGATTGA GGAAGGATAC
ATCCACTCGC TTGGTCACGG GCTGGGGCTG GAAGTCCATG AAGACCTGAG TTTTTCGTCG
CTCGTGGATC GCGGCGATAC CATCGAACCC GGCGCAGTGT TCACTGTCGA GCCGGGGCTT
TATTACCCGA GCCGCGGCTT CGGCGTCCGG ATCGAGGACA CATACTACTG CGCGCCGGAT
GGTCACTTCG AGAGCCTGAC GCCATTCCCG AAGGAACTGG TGATCCGCCC TTATGAAAGG
GATTGA
 
Protein sequence
MRADGTRSRS RFSVLSSSLS SPLTQEDILK HDLDRLMAER NLDAIVVEGP DGLESANPDY 
NYFVGGRHIP GLIIKKRGEP TMLLHSPWEQ NEAEQTGLAL VSLNRWNLRE ILQEFPDRLE
ARVEHRRRIF TDLGVRGRVG IYGTVKAGPF FALMARLAQQ IDGLEIVAEL DRDVISMARL
TKDADEVERM RAVGRKTCAV LQVAVDFICT GWIDGGSVRG VDGAPLTIGD VRRVMLREIA
AQGLETPAGM IISQGRDAGL PHARGDDAMP LRPGQAIVID IFPREAGGGY FHDMTRTFAI
GYAPPELQQA YNDVLGAFEM VTAAFEAGAP TRKYQDMVCD YFEARGHDTI RRTYPIEEGY
IHSLGHGLGL EVHEDLSFSS LVDRGDTIEP GAVFTVEPGL YYPSRGFGVR IEDTYYCAPD
GHFESLTPFP KELVIRPYER D