Gene Rcas_1834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1834 
Symbol 
ID5539312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2342051 
End bp2343832 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content64% 
IMG OID640893972 
ProductTPR repeat-containing protein 
Protein accessionYP_001431943 
Protein GI156741814 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAT CCTTGACTCC CCGGCAACGC GCAGAGCGCC TGATCGACGA GGGAATGGCG 
GCAATCCGTG CCGGCGATCA GGCGCGCGCC CGCCAGTTAC TCAGTCAGGC AGTACAACTC
GATCCCCAGA ACGAACGCGG CTGGCTGTGG CTCTCTGGTG CGCTGACCGA TCCTGTGCAG
CGCCGCTACT GTCTCGAACG GGTCCTGGCG ATCAACCCGA AGAACGAGAT TGCGCAGCGC
GGGCTGGCAA GTCTTAAGCC TGCCGCTTCA TCGCCTGCTG CTCTGTCGCC ATCCCCCGCC
GCATCGGCGC CGAAACCGAC TGCTCCGTCG CAACCGGCGC AGCGGCAGCA ACCGGTGTTC
GACCCACTTG CGCCCGATGC GACCACGACA CGACGCACAG CGACCGCCAC GCCGCTGCCG
CGCCCGCCGA TACCTGCCAC CATCGCTCCG CCAGCACCCC AAACCGTTGC TCCTGACGCT
GCTGCGACGA CGACCGGGGT GCAGCAGCGA GCGCCGGACG ATCTCCGAAA GCCTCCCCCG
GCGATGCCTG CCGAATCAGC GCCGGTTGAT TCCCTGGCAT CACTGCGTCC CGGCGCAGCT
GCGAAGCGCA GCGGTTTGTT GCGACGACCG GGCGCTGCCG GCAACGCCGC CGCAGATGCT
CCACCAACCG CCATACCTGT CGAGCGGTCT AAAAAGCGCA CACGGACCGT TATCGTTGCG
CTCGTGGGCA TTCTTGCGGT CGTCGTGCTG TTGCTCGTCG GACTGATGTA TCTGTATCCA
GATATTCTCA ATCCGCCGCA ACCAACCGAG GAAGCGGCGA TTCCAACCGT TGAGCCGGAA
GCGCCGACGC CGGTTCCAAC GCCGACGAGT ATTCCAACTG CTACTCCGGC GCCAACGCCG
ACGCCGGTTG ATGTTCAAGC GCTGGTGCGT GAGGCGGAAG AACTTGCCGC TAGTGGCAGC
CTGGGACAGG CGGCAGAACG CTATACCGAA GCGATCCGCG CCGATCCGTC ATCGTTCGAG
GCATACTTTG GGCGCGCTCA GGTCAACTTC AATCTGTCGC TCTTCCAAAA TGCTGTCGAT
GATTTCACCA GAGCATTGGC GCTCGATCCG GAGAACGCCG AGGCGTACCA CCAGCGCGCG
CGCGCCTTCT ATCGCCTGCA ACAGTACGAC GAAGCCATTC GTGATTTCAC CGAGGCGCTT
GCGCGTGATC CCAACAACGA TGTTCTCCTG ATGCGCCGCG GCGTCGCCTA CCGCGACAAG
GGCCAGTACG ACGAGGCGCT GGCGGATTTC GATCAGTCGC TCCAGTTGAA CCCGGACGTC
AGTTTCACCT ATTACCACCG CGCATTGTTG TTCCAGGCGA CCGGCAGGCT TGAGCGGGCG
CGCGCCGATT TTGATCGGGC GCTGACGATT GCGCCGGAGT ATCGTCTGGC ATATGTCGGG
CGTGGCGGGT TGCGCTTAGA GCAGGGCGAT GCGCGTGGCG CGCTCAGGGA CTGCACCCGC
GCCATCGAAC TCGACGCTAC CGAGATCGAT GCGTATTTCT GCCGCGCCAG AGCCGCTATT
GCGCTCCGTG ATTACCGCGC CGCCGTCGCC GACCTCGATA CCGTCATTGC GCGCGATCCC
GATAGCGCCG ATGCTTACCG CGAGCGTGGG AGGGCGCACC AGGCGCTGCG CGACACCGAT
GAGGCGCGGG CGGATTATCA GCGCGCCATC GAACTCTACC GCCTTCAGGG ACGCGACAAG
GACCTGGCGG AGGTCGAAAA GTTGCTCGCT GCGCTGAAAT AG
 
Protein sequence
MNESLTPRQR AERLIDEGMA AIRAGDQARA RQLLSQAVQL DPQNERGWLW LSGALTDPVQ 
RRYCLERVLA INPKNEIAQR GLASLKPAAS SPAALSPSPA ASAPKPTAPS QPAQRQQPVF
DPLAPDATTT RRTATATPLP RPPIPATIAP PAPQTVAPDA AATTTGVQQR APDDLRKPPP
AMPAESAPVD SLASLRPGAA AKRSGLLRRP GAAGNAAADA PPTAIPVERS KKRTRTVIVA
LVGILAVVVL LLVGLMYLYP DILNPPQPTE EAAIPTVEPE APTPVPTPTS IPTATPAPTP
TPVDVQALVR EAEELAASGS LGQAAERYTE AIRADPSSFE AYFGRAQVNF NLSLFQNAVD
DFTRALALDP ENAEAYHQRA RAFYRLQQYD EAIRDFTEAL ARDPNNDVLL MRRGVAYRDK
GQYDEALADF DQSLQLNPDV SFTYYHRALL FQATGRLERA RADFDRALTI APEYRLAYVG
RGGLRLEQGD ARGALRDCTR AIELDATEID AYFCRARAAI ALRDYRAAVA DLDTVIARDP
DSADAYRERG RAHQALRDTD EARADYQRAI ELYRLQGRDK DLAEVEKLLA ALK