Gene Rcas_2799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2799 
Symbol 
ID5540286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3616849 
End bp3620202 
Gene Length3354 bp 
Protein Length1117 aa 
Translation table11 
GC content61% 
IMG OID640894926 
Productpeptidase S41 
Protein accessionYP_001432888 
Protein GI156742759 
COG category[S] Function unknown 
COG ID[COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.38727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCAC AAGGATACTA CCGCTGGCCA ACCATTCACA ACGATACCGT TGTCTTCGTC 
TGCGAAGACG ATCTCTGGAC GGTTCCGGCG TCGGGCGGCG TGGCGCGGCG GCTGACCGCC
AATCCCGGCA GCGTTCAGTC GCCGGCGCTG TCGCCGGATG GCGCCCTTCT GGCATTCGTC
GGACGCGACG AAGGACCCGG CGAGGTATTT GTGATGCCCG CCGAAGGCGG CGAGGCGCGT
CGTCTGACGT TCCTCGGCGC AACCATGCGT GTTTGCGGCT GGGGTCGCAA TGGTCGTGAG
ATACTGTTTG CCAGTTCCGC CAGCCTTCCC TTTTCACGGA TGGCATTGCT CTATGCTATT
CGTGCCGACG GTGGCGAGCC TCGCCTCCTG CCGACGGGTC CGGCGGTTAC GATCTCGTAT
GGTCCAACCG GAGGTGCTGT GATTGGGCGC AACGAAAGCG ACCCGGCGCG CTGGAAACGC
TATCGCGGCG GGCGTACCGG CGATGTGTGG ATCGATCCAG ACGGAACCGG CGAATGGCGG
CGCCTGATTG CGCTGCCCGG CAATATTGCC ATTCCCCTGT GGGTGGGCGA GCGTATCTAC
TTTGTTTCCG ACCACGAAGG AGTCGGCAAT CTCTACTCCT GCCTGCCGAC GGGCGACGAC
CTGCAACGCC ACACCTGGCA CCGCGAGTAC TACGCTCGTT TTCCTTCCAC CGACGGACGG
CGGATTGTGT ATCACGCCGG CGCCGATCTG TATCTGTTCG ATCCCGCAAC CAACACATCG
CGCAGGATCG AGGTTGAACT GCGCAGCCCG CGAACGCAAC GGAAGCGTCG TTTTGTCGAT
CCGGCGCGTT TCCTTCAGAG CGTTGCGTTG CATCCCGAAG GTCATTCGCT TGCCGCAATC
GTGCGTGGCA AACCGTTTAC CTTCGGCAAT TGGGAAGGCG CCGTCTTGCA GCACGGCGAT
CCCGGCGCTG TGCGCTATCG CCTGGCTGAC TGGCTGCCCG ATGGCCGACG CATTGTGGTG
GTCAGCGATG CAGTTGGCGA GGAGACGCTC GAAATCCATT CAGTTGCGTT CAGTAATGGC
GGACAAAGAG CGGCAGCGTC AGATGCCGCT GCTGTGGATG CGGCTCTTTC CCCGTTTGAT
GCACCGGTGC GCCTGGAAGG TCTCGACATT GGGCGTCCTA TGGCACTTGC GGTTTCTCCC
AAAGCGCCGC TCGTCGCGGT TGCGAACAAT CGAAATGAAT TGCTGCTGGT CGATCTGACT
GAACGCACCG TGCGGTTGCT CGATCGGAGT CGATATGCCT CGATGCTGGG CATTGCCTGG
TCGCCGGATG GACGCTGGCT GGCGTATGGC TTTTGGGAGA CGGGGCAGAC CTCGATCATT
AAGGTGTGCG AGGTTGCGAC CGGGACCATC ACGCCGGTCA CCCGACCGGT TCTGGTTGAT
CGGTCGCCCG CGTTCGATCC AGAAGGCAAA TATCTTTACT TCATCTCGTC CCGCGATTTC
GATCCGGTCT ATGACGACAT GCACTTCGAT CTTGGCTTCC CGCGCGGCAC GCGCCCATTC
CTGGTGACGC TGCGCGCCAA CCTGCGCTCG CCCTTCGTGC CGAGACCACA TCCGCTCGAT
CAGGCGACAC CCAAGCCATC GGTTGGCGAG GCGAAACCTG CCGGTGAGAC GGGCGGCGAT
GCGCCGGGTG CTGAGGGAGC AGCACAAGGA ACGACAAAAT CCGAGCCGAT GATGACCATT
GATCTGGAGG GAATTGCCAA TCGGATTGTG GCATTTCCGC TGCCGGTCGG ACGATACCGG
CAGATAGCGG GTATCCCCGG CAAGGCGCTC TTCACGGTGT TTCCGGTCGA AAGCGCGCTC
GGTCTATCCC GGATGCCGGG CGACTCGGCG GTTGCGCGCG GGCGTCTCGA TGTGTACGAT
TTCGAGACGC TCAGTAGCGA AACGTTGATC GATGGCGTGT CTGCATTCTC TCTTTCCCAT
GACTCAAAGA CGCTGATGTA TCGTGCCGGT AACCGGGTGC GCGTGGTCAA GGCAGGTGAA
AGGCCAAAGG ACAATAGTAG CGAGCCGGGG CGGAAGAGCG GTTGGGTCGA TCTGGCGCGC
ATCAAATTGA TGATTTCGCC GCCTGCTGAG TGGGTGCAGA TGTATCGCGA AGCCTGGCGG
TTGCAGCGTG ACCACTTCTG GACGCCGGAT ATGTCGGGGG TGAACTGGCT CATCGTCTAC
CATCGCTACC TGCCGCTGCT TGATCGAGTC GCCACACGCG GCGAGTTCTC CGATCTTCTG
TGGGAGATGC AGGGAGAACT GGGCACCTCA CACACCTACG AGTATGGCGG CGATTATCGC
CCGGAACCGC AGTACAGTCT GGGGAAATTG GGCGCCGATC TGCGCTATGA CGCTGAAACC
GACAGTTATA TCGTCGAGCG CATTGTCGCC GGCGATGTGT GGAACGAACG CGCCAGTTCG
CCGCTGGCGC GCCCCGGCGT CAATATTGCG CCAGGTGATC GCTTGATTGC GATCGGCAGT
TGGCGCGTCG GGCGTGACGT CTCGCCGCAT GAGGTGCTGG TCAATCAGGC AGGGTGCGAT
GTGTTGTTGA CGTTCAGGAA AGCCGATGGA ACACTCCGCG CAGTGACAGT AAAGGCGCTC
CATGACGATA CGCAGGCGCG CTACCGCGAG TGGGTGGAGC GCAATCGCGC GATCGTGCAC
GAGGCAACGA ATGGGCGTGT CGGGTATATT CACATTCCCG ATATGCAGGC GTTCGGGTAC
GCCGAGTTCC ATCGCGGCTT CCTTGCCGAA GTCGCACGCG AGGGGTTAAT CGTCGATGTG
CGGTATAATG CAGGCGGCTT TGTGTCGCCG CTGGTCGCCG AGAAACTGGC GCGCAAACGC
CTGGGGTACG ATGTCTCGCG CTGGGGTGAA CCGGCGCCCT ATCCGCCCGA GTCGATTATG
GGACCAATGG TGGCAATTAT CAATGAAGCG GCCGGTTCCG ATGGCGACAT CATCAGCCAT
GTCTTCAAGA TGATGAAACT CGGGCCGTTG ATTGGCAAGC GCACCTGGGG CGGCGTGATC
GGTATTCATC CACGTGACAC GCTCATCGAT GGCGGGGTGA CAACCCAGCC AGAGTTTTCT
TTCTGGTCGG CGGAGGTTGG CTGGCAGTTG GAGAACCATG GCGTCGAGCC GGATATCGAA
GTCGAGATGC GGCCACAGGA TTATGTGGCC GGCGCCGACC CGCAACTCGA CCGCGCGATA
GCCGAAGTGT TGCGTCTGAT GAACGACAAT CCACCCCGGC TGCCGGAATT TGGTGAACGA
CCGCGTTTGC CGTTGCCGGA AGAAGAGTGT GATGAGGAAC GATCTGGACG CTGA
 
Protein sequence
MAPQGYYRWP TIHNDTVVFV CEDDLWTVPA SGGVARRLTA NPGSVQSPAL SPDGALLAFV 
GRDEGPGEVF VMPAEGGEAR RLTFLGATMR VCGWGRNGRE ILFASSASLP FSRMALLYAI
RADGGEPRLL PTGPAVTISY GPTGGAVIGR NESDPARWKR YRGGRTGDVW IDPDGTGEWR
RLIALPGNIA IPLWVGERIY FVSDHEGVGN LYSCLPTGDD LQRHTWHREY YARFPSTDGR
RIVYHAGADL YLFDPATNTS RRIEVELRSP RTQRKRRFVD PARFLQSVAL HPEGHSLAAI
VRGKPFTFGN WEGAVLQHGD PGAVRYRLAD WLPDGRRIVV VSDAVGEETL EIHSVAFSNG
GQRAAASDAA AVDAALSPFD APVRLEGLDI GRPMALAVSP KAPLVAVANN RNELLLVDLT
ERTVRLLDRS RYASMLGIAW SPDGRWLAYG FWETGQTSII KVCEVATGTI TPVTRPVLVD
RSPAFDPEGK YLYFISSRDF DPVYDDMHFD LGFPRGTRPF LVTLRANLRS PFVPRPHPLD
QATPKPSVGE AKPAGETGGD APGAEGAAQG TTKSEPMMTI DLEGIANRIV AFPLPVGRYR
QIAGIPGKAL FTVFPVESAL GLSRMPGDSA VARGRLDVYD FETLSSETLI DGVSAFSLSH
DSKTLMYRAG NRVRVVKAGE RPKDNSSEPG RKSGWVDLAR IKLMISPPAE WVQMYREAWR
LQRDHFWTPD MSGVNWLIVY HRYLPLLDRV ATRGEFSDLL WEMQGELGTS HTYEYGGDYR
PEPQYSLGKL GADLRYDAET DSYIVERIVA GDVWNERASS PLARPGVNIA PGDRLIAIGS
WRVGRDVSPH EVLVNQAGCD VLLTFRKADG TLRAVTVKAL HDDTQARYRE WVERNRAIVH
EATNGRVGYI HIPDMQAFGY AEFHRGFLAE VAREGLIVDV RYNAGGFVSP LVAEKLARKR
LGYDVSRWGE PAPYPPESIM GPMVAIINEA AGSDGDIISH VFKMMKLGPL IGKRTWGGVI
GIHPRDTLID GGVTTQPEFS FWSAEVGWQL ENHGVEPDIE VEMRPQDYVA GADPQLDRAI
AEVLRLMNDN PPRLPEFGER PRLPLPEEEC DEERSGR