Gene Rcas_0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0937 
Symbol 
ID5538403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1243854 
End bp1245008 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content62% 
IMG OID640893086 
Productpeptidase C60 sortase A and B 
Protein accessionYP_001431069 
Protein GI156740940 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3764] Sortase (surface protein transpeptidase) 
TIGRFAM ID[TIGR01076] LPXTG-site transpeptidase (sortase) family protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.1124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATGGAA GTTGCGGTCG TTCCTGGTTC TCGAAGCAGC ATAGTATGAT CGCGCGTCTC 
CTGATGATCA TCATCGTCTG CGCCGGGGCA TTGCTTCCGT CAAGCGCGCA TGCGCAGCGC
GGGGAGCGCT GCTTCACCGA AACAGGTTAC TGTATTGGCG GGCGCATTCG CGAGTTCTGG
GAGCAAAACG GCGGCTTGCG CGTGTTTGGT TACCCTATCA CGCCATTGCA AACAGAGACG
ATCGAAGGGC GAACATTGCA GGTGCAATGG TTCGAACGCG CGCGCCTGGA ATTGCATCCG
GCAAACCGGC GCCCCTACGA CGTGCAACTG GGTCGTCTCG GCGCCGAACT GCTCGCGCGC
AGTGATCGTG GCGGAACGCC GGTGAATGTC GCCGCGTCGG GTGAGTGTCG CCTGTTCCCT
CAGACCGGCG TCAGCGCCTG TGGTCCGATC CTGGCGGCAT GGCGCTCCGT CGGTTTGCAA
CTCGATGGGA AACCTGGCGT GAGCGAGGCG GAAAGTCTGG CGCTCTTCGG CGTTCCGTTG
ACCGACGCGC GCCTGGAGAC CCTTGCCGAT GGCAAAGCGT ATGTGGTGCA GTGGTTCGAG
CGAGGGCGTT TCGAGGTGCA TCCCGAAAAC ATGCCTCCAG CGAATGTCTT GCTCGGTCTG
CTGGGGCGCG AGTATGGACC GGTGGCGCGC GCGGGGCGAA CCGCAGGCGA CGTTCCGGCG
CGTATTGTCG CCGGCGATGT CGGCATGGAT GCCCGCATTG TCGCTGTTGG ACTGGATGCA
CAGGGAATGC CAATCGTGCC CGACCACGAC GTTGGCTGGT ACAACCGCAG CGCATTGCCG
GGGCAGGGCG AGAATGTGGT GCTATGGGGT CATGTGCTGC GTTTCAGCCA TGCACCGCGC
ATTCCTGCGC CATTTGCGCG TCTGAAAGAG TTGCGTCCCG GTGCGCGTCT GACAGTGTAT
GACTCAAACG GAACCGCCTT TGACTATGTG GTGACGCGCC AGGTGTGGGT GCGCCCGACC
GATGTGGAGT GGATGCTGCC GCAGGGTCGT GAGCGCCTGA CCCTTATCTC CTGCATCGGC
GATAAGGTGA TCGTCGGACG AGAAGTGGTG GATATGAGTC ACCGCCTGAT CACGATTGCC
GAACCGGTGC GCTGA
 
Protein sequence
MDGSCGRSWF SKQHSMIARL LMIIIVCAGA LLPSSAHAQR GERCFTETGY CIGGRIREFW 
EQNGGLRVFG YPITPLQTET IEGRTLQVQW FERARLELHP ANRRPYDVQL GRLGAELLAR
SDRGGTPVNV AASGECRLFP QTGVSACGPI LAAWRSVGLQ LDGKPGVSEA ESLALFGVPL
TDARLETLAD GKAYVVQWFE RGRFEVHPEN MPPANVLLGL LGREYGPVAR AGRTAGDVPA
RIVAGDVGMD ARIVAVGLDA QGMPIVPDHD VGWYNRSALP GQGENVVLWG HVLRFSHAPR
IPAPFARLKE LRPGARLTVY DSNGTAFDYV VTRQVWVRPT DVEWMLPQGR ERLTLISCIG
DKVIVGREVV DMSHRLITIA EPVR