Gene Rcas_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2021 
Symbol 
ID5539499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2591656 
End bp2593830 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content60% 
IMG OID640894156 
Productglycoside hydrolase family protein 
Protein accessionYP_001432127 
Protein GI156741998 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATC TTCCCGCCGT TCTGTCGCTT GATGGCGCAT GGAAACTACT GCCGGTCAAT 
GCGTTTCGTC AGGGATTCTA CCCAACCGAC GATGACACAT GGCTGACCCA GGAGATTCCG
GCGCATTGGC AACAGCATCC GCTGCTGGAG CGCTATGCGG GATGTATGGT CTACCGCCGG
CGCTTCCGCC TGCCAGATGC GCTGGCAGGC ACAACCGCGC AATCACTGCC AATCCGGTAT
TTCCTGCGCT TCAATGGTGT GTTTTACTGG GCGCAGCCCT ATTTCAACGG CGTTGATCTT
GGGCGTCACG AGGGGTATTT CGAGCCGTAT GAGCGTGAGG TGACCGCCTG GATCGCGTCG
GAGAACGAGA TCGTCGTTGA GGTCGAGTGC CCCGATGAAG CCAATAAATC GGGCAAGCGC
CTGATCACCG GCATTTTCTC GCACTGGGAC TGCATGGACC CGCTCGCCAA CCCCGGTGGC
ATCTGGTTGC CGGTCGAACT GATCGCCGCC GGTCCGGTAC GGGTGCGCAA TCTGCTGCTG
CACACCGAAG CGGCTGGCGA GACGCTGGCG GAATTGCGCT TCCGCGCCGT CTTCGACGCA
GACGTGGCGC GTGACGTGAC GCTGCGCTGG ACGTTTGCGC CGGTCAACTT TGCGGGAGCG
GTGCAGACGA CTGAGCAACG CCGCGCGCTG GCAGCCGGGG AACACGAGAT TGTCGGGGTG
TTGTTGGTGC GCGACCCGCA TCTTTGGTGG CCCCACGATA TGGGAGCGCC GGATTTGTAT
GCGGTCACGC TCGAGGTTCT CTGTGACGGT GTGGTTTCGG ACTCGCGCAC CGTTCGTTTC
GGCATCCGCA CGTTCGAGTT GCACGACTGG ATACCCTATC TTAACGGCGC CCGCTTTTTC
ATCAAGGGGA ACAATTATCC GCCAACCGAT GTGCGCATTG CAACGACGAC GCGCGAACGG
TGTCTCGAGG ACCTGCGCCT GGCGCGTGCG TGCCATATGA ACATGCTGCG GGTGCATGGA
CATATTGCGC ATCCGGCGCT CTACGACGCC GCCGATGAGA TGGGCATGCT GCTCTGGCAG
GATTTCCCGC TCCACTGGCT CTACCGCCGT GATGTGCTGC CCGAAGCCCG CCGTCAGGCG
CGCGCTATGG TGCGTCTGTT GGGGAACCAT CCCTCGGTGG CGCTCTGGTG TATGCACAAC
GAACCGGTGT ACGTCGTTGA TACCCGCGAT GAACGGATGC TGACGCGCGT GCGCACGTAT
GCATCAATGT TCGTCTTCAG CTGGAACCGC GATGTGATGG ACAGCGAGTT GAAGCGGGTG
GTCGAACGGG AAGACGGCAG ACGACCGGTG GTGCGTTCGT CGAACGAGTT TCCCATTCCG
GGCATTCGCG CCGGCGCCAG TACACATTTC TACTATGGTT GGTACACAAT CTATGGGCGC
CTCGAAGCGT GGGAGCCGTT GATCCGGCGC TTCCCGCAGT TGGTGCGCTT TGTCACCGAG
TTTGGCGCGC AGAGTTTCCC GAATGTCGAG AGTTGCGTCA GGTTCATGGA CGCCGACATC
AGGCGCATCG ACTGGCAGCG TCTGGTTGAG CGTCATCAGT TTCAGGCGGA CATTATGGCG
CACTGGTACG ATTGGCGCGC GGCACGTTCG CTCGAAGAAC TGGTGCAGAT GTCGCAGGAG
TATCAGATTA AGGTCAACCG CCACTATATT GATCGGTTGC GTTTGCGCAA GTATCGCCCA
ACTGGCGGCA TGATGCCATT CATGTTCCAT GATGCCAATC CTTCGGTCTC CTGGTCGATC
ATCGACTACT GGCGCGTTCC GAAACGGTCA TACGAGGCAA TGCGCCTGGC GTTCAGTCCG
CACTATATCT TCACCGTGCT GGAAAAAGAA GCCATCGCTC TCGATGAAAC GCTCGATTTA
CCGGTTTATG TGGTCAACGA CGCTCACCGC GATCTGGCGG TGACGGCGAC CGCGCGCCTG
GTTGGTCCTG GCGATGCAGT GCTGGCGACT GTCGAGCGAT CCTTCACATT GCCCGCCGAC
TGTATGACAA TGGAGATCGA GCGGTTGCGC CTGATCCCGT CGGAACCCGG AACGTATCGT
CTGGAATTGA CGCTGGATGG CGATCTGGCG GAAGTTATTG TCAACGCCTA TGCGATTCAT
GTCTCGGTTG GTTAG
 
Protein sequence
MTNLPAVLSL DGAWKLLPVN AFRQGFYPTD DDTWLTQEIP AHWQQHPLLE RYAGCMVYRR 
RFRLPDALAG TTAQSLPIRY FLRFNGVFYW AQPYFNGVDL GRHEGYFEPY EREVTAWIAS
ENEIVVEVEC PDEANKSGKR LITGIFSHWD CMDPLANPGG IWLPVELIAA GPVRVRNLLL
HTEAAGETLA ELRFRAVFDA DVARDVTLRW TFAPVNFAGA VQTTEQRRAL AAGEHEIVGV
LLVRDPHLWW PHDMGAPDLY AVTLEVLCDG VVSDSRTVRF GIRTFELHDW IPYLNGARFF
IKGNNYPPTD VRIATTTRER CLEDLRLARA CHMNMLRVHG HIAHPALYDA ADEMGMLLWQ
DFPLHWLYRR DVLPEARRQA RAMVRLLGNH PSVALWCMHN EPVYVVDTRD ERMLTRVRTY
ASMFVFSWNR DVMDSELKRV VEREDGRRPV VRSSNEFPIP GIRAGASTHF YYGWYTIYGR
LEAWEPLIRR FPQLVRFVTE FGAQSFPNVE SCVRFMDADI RRIDWQRLVE RHQFQADIMA
HWYDWRAARS LEELVQMSQE YQIKVNRHYI DRLRLRKYRP TGGMMPFMFH DANPSVSWSI
IDYWRVPKRS YEAMRLAFSP HYIFTVLEKE AIALDETLDL PVYVVNDAHR DLAVTATARL
VGPGDAVLAT VERSFTLPAD CMTMEIERLR LIPSEPGTYR LELTLDGDLA EVIVNAYAIH
VSVG