Gene Rcas_2538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2538 
Symbol 
ID5540020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3275358 
End bp3277439 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content61% 
IMG OID640894668 
ProductTAP domain-containing protein 
Protein accessionYP_001432635 
Protein GI156742506 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00112565 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGGACGG GCGGATTGTG TCGCCACGTT CGCCATGATG TCGGCGTCCC CACTTCACAG 
AAGGAGCATT TCTTGACTCT CGCAGGACGA CGTGGCATAG CGCTGTTGAT CATCCTCCTG
ATGGCGACGG TATGGTTCCC TGCACCTCTC GCTCGCGCCG CCAGCCCGAC AGGTACGTTT
CAACCTGATG CGTGTATGTT TGAACTTCCT GCCGGCGCCG TCGAGGGACG CGACCTTGAG
TGCGGTTGGC TCCAGGTTCC AGAACGTCAC GCGCAACCGG AAGGTCCCGT CATACGCCTG
GCGGTTGCAA TTGTCAAATC GCGCGCCAGC AACCGCAAGC CCGATCCCCT GGTGATGCTC
CAGGGCGGTC CGGGCGGTTC GACAATCGAC ACCTACACCA GAATTCTCTT CAGCCCCGGC
AGCCGCCTGC GCGACCTGAT CGACCGCGAC ATCGTACTGT TCGATCAGCG CGGCGCGTTG
TACTCCGAAC CGTCGCTCGT CTGTCGGGAG CAGCTGGAAC TGGTCGAACG CACGATCGAG
CAGCGATTGA CCTACGAAGA GTCGGATCGG CTGTCGCTCG ATGCGACCGC TGCGTGTCAT
CGGCGGCTGA AGGATGAGGG GATCGACCTT TCGGCATTCA ACAGTATCGA GAATGCGGCA
GATGTCGCTG CGCTGGCGAG CGCGTTGGGA TATGCGCAGA TCAATCTCTA TGGCGTCTCC
TACGGCACAT TGCTGGCGCA GCACGTCATG CGCGATCATC CTGGCATGCT GCGCAGTGTG
ACGCTCGATG CAGTTGTGCC GACCAGTGTC AATTATCTGC TCGAGACCCC TCGCTCACAG
AACCGCGCCT ACACCGAATT GTTCACTGCC TGCGCTGCCG ACGCTGGGTG CCGGACAGCC
TATCCGAACC TGGAACAATC GCTGATTACG ACCATCGAGC GGCTCAACCG TGAACCGGCG
CGCGTGCCGA TCACCGACAA CGAAACCGGC AGAACGTACA ATGCCGTGCT CGACGGCGAC
AGTTTCGCCA GCGTCGTTTT TCAGATCATG TATTCGTCGA GTTTCATCCC GGCAGTACCG
CGGATCATCG ATGATGCAAC GCGCAACGAT TTCCGCGTGC TGGAGCGAGT GCTGCCACTC
ATCGTCTTCG ACCGCACCTT CAGTCTGGGG ATGCACTATG CCGTGATCTG CGCCGAGGAT
GCCGACTTCA CCCCTGAAGA TGCGCCGCTC CAGGGTGTGC GCCCGTTTAT TGCCCTTGAT
GGCCGACGCA GCCTTGAAGG ATACCTGGAA CGCTGCAATA TCTGGCAGGT CGATCAACTT
GCGCCGGTGG TCGATGAGCC GGTGTCCAGC GACATCCCGA CGCTTGTCTT GTCGGGCAGG
TTCGATCCGA TTACGCCGCC GGAGTTCGGC GATGTGGTCG CACGCACGTT GAGCCGGGCA
TATGTCTACA CATTTCCCGA CACCGGGCAT GGTGCAGTTG GTTCGAGCGC CTGTGCGGAT
ACCATACTCA AAGCGTTTCT CGATAATCCG TCACAGCCGC CAGATTCTTC CTGTATCGCA
GAGAGCGCAG GACCACGCTT TATTTCGCCG GACACGATCA TATTCACTCC AATAACGCTT
GCTCTGGCTA CGCTCGACGT TGCCGGTCTG GCGCCGTTTG GCGCGTTTGT CGGAGCGCTG
CTCCTGCTCC TCTCGGCATG GATCGTCTGG CCTCTTGCCT GGTTGTTCCG TCTGATCACC
GGCGGAAAAG CGCCAGAGCC GTCCCCTGGC GCGACCATCG CGCGCTGGCT TGTCGTGCTC
ACCGGCGCTG CCGGGGCGGC ATTCGTCGCC ATTACGCTGA TCAACATCGT GCAAATGGCG
GTTGCCAATG ATGCGACGAT CTTCTACGGG TTGCCGCGTT CTCTGACGCT GGTTGCGCTG
ATCTGGGGAA TGCCCGCACT GGCGTTAGCA ATTGTGGTCT GTACCGCGCT CTCCTGGACG
CGCGGGTGGT GGTCGGGCAT CGGGCGCGTC TACTACGCGC TGCTCTCATT GTCTGCTATC
GGATGCGCAT CCGCTCTGGC GTGGATGGGG GTATTCCGTT GA
 
Protein sequence
MRTGGLCRHV RHDVGVPTSQ KEHFLTLAGR RGIALLIILL MATVWFPAPL ARAASPTGTF 
QPDACMFELP AGAVEGRDLE CGWLQVPERH AQPEGPVIRL AVAIVKSRAS NRKPDPLVML
QGGPGGSTID TYTRILFSPG SRLRDLIDRD IVLFDQRGAL YSEPSLVCRE QLELVERTIE
QRLTYEESDR LSLDATAACH RRLKDEGIDL SAFNSIENAA DVAALASALG YAQINLYGVS
YGTLLAQHVM RDHPGMLRSV TLDAVVPTSV NYLLETPRSQ NRAYTELFTA CAADAGCRTA
YPNLEQSLIT TIERLNREPA RVPITDNETG RTYNAVLDGD SFASVVFQIM YSSSFIPAVP
RIIDDATRND FRVLERVLPL IVFDRTFSLG MHYAVICAED ADFTPEDAPL QGVRPFIALD
GRRSLEGYLE RCNIWQVDQL APVVDEPVSS DIPTLVLSGR FDPITPPEFG DVVARTLSRA
YVYTFPDTGH GAVGSSACAD TILKAFLDNP SQPPDSSCIA ESAGPRFISP DTIIFTPITL
ALATLDVAGL APFGAFVGAL LLLLSAWIVW PLAWLFRLIT GGKAPEPSPG ATIARWLVVL
TGAAGAAFVA ITLINIVQMA VANDATIFYG LPRSLTLVAL IWGMPALALA IVVCTALSWT
RGWWSGIGRV YYALLSLSAI GCASALAWMG VFR