Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2021 |
Symbol | |
ID | 5539499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2591656 |
End bp | 2593830 |
Gene Length | 2175 bp |
Protein Length | 724 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640894156 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001432127 |
Protein GI | 156741998 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAATC TTCCCGCCGT TCTGTCGCTT GATGGCGCAT GGAAACTACT GCCGGTCAAT GCGTTTCGTC AGGGATTCTA CCCAACCGAC GATGACACAT GGCTGACCCA GGAGATTCCG GCGCATTGGC AACAGCATCC GCTGCTGGAG CGCTATGCGG GATGTATGGT CTACCGCCGG CGCTTCCGCC TGCCAGATGC GCTGGCAGGC ACAACCGCGC AATCACTGCC AATCCGGTAT TTCCTGCGCT TCAATGGTGT GTTTTACTGG GCGCAGCCCT ATTTCAACGG CGTTGATCTT GGGCGTCACG AGGGGTATTT CGAGCCGTAT GAGCGTGAGG TGACCGCCTG GATCGCGTCG GAGAACGAGA TCGTCGTTGA GGTCGAGTGC CCCGATGAAG CCAATAAATC GGGCAAGCGC CTGATCACCG GCATTTTCTC GCACTGGGAC TGCATGGACC CGCTCGCCAA CCCCGGTGGC ATCTGGTTGC CGGTCGAACT GATCGCCGCC GGTCCGGTAC GGGTGCGCAA TCTGCTGCTG CACACCGAAG CGGCTGGCGA GACGCTGGCG GAATTGCGCT TCCGCGCCGT CTTCGACGCA GACGTGGCGC GTGACGTGAC GCTGCGCTGG ACGTTTGCGC CGGTCAACTT TGCGGGAGCG GTGCAGACGA CTGAGCAACG CCGCGCGCTG GCAGCCGGGG AACACGAGAT TGTCGGGGTG TTGTTGGTGC GCGACCCGCA TCTTTGGTGG CCCCACGATA TGGGAGCGCC GGATTTGTAT GCGGTCACGC TCGAGGTTCT CTGTGACGGT GTGGTTTCGG ACTCGCGCAC CGTTCGTTTC GGCATCCGCA CGTTCGAGTT GCACGACTGG ATACCCTATC TTAACGGCGC CCGCTTTTTC ATCAAGGGGA ACAATTATCC GCCAACCGAT GTGCGCATTG CAACGACGAC GCGCGAACGG TGTCTCGAGG ACCTGCGCCT GGCGCGTGCG TGCCATATGA ACATGCTGCG GGTGCATGGA CATATTGCGC ATCCGGCGCT CTACGACGCC GCCGATGAGA TGGGCATGCT GCTCTGGCAG GATTTCCCGC TCCACTGGCT CTACCGCCGT GATGTGCTGC CCGAAGCCCG CCGTCAGGCG CGCGCTATGG TGCGTCTGTT GGGGAACCAT CCCTCGGTGG CGCTCTGGTG TATGCACAAC GAACCGGTGT ACGTCGTTGA TACCCGCGAT GAACGGATGC TGACGCGCGT GCGCACGTAT GCATCAATGT TCGTCTTCAG CTGGAACCGC GATGTGATGG ACAGCGAGTT GAAGCGGGTG GTCGAACGGG AAGACGGCAG ACGACCGGTG GTGCGTTCGT CGAACGAGTT TCCCATTCCG GGCATTCGCG CCGGCGCCAG TACACATTTC TACTATGGTT GGTACACAAT CTATGGGCGC CTCGAAGCGT GGGAGCCGTT GATCCGGCGC TTCCCGCAGT TGGTGCGCTT TGTCACCGAG TTTGGCGCGC AGAGTTTCCC GAATGTCGAG AGTTGCGTCA GGTTCATGGA CGCCGACATC AGGCGCATCG ACTGGCAGCG TCTGGTTGAG CGTCATCAGT TTCAGGCGGA CATTATGGCG CACTGGTACG ATTGGCGCGC GGCACGTTCG CTCGAAGAAC TGGTGCAGAT GTCGCAGGAG TATCAGATTA AGGTCAACCG CCACTATATT GATCGGTTGC GTTTGCGCAA GTATCGCCCA ACTGGCGGCA TGATGCCATT CATGTTCCAT GATGCCAATC CTTCGGTCTC CTGGTCGATC ATCGACTACT GGCGCGTTCC GAAACGGTCA TACGAGGCAA TGCGCCTGGC GTTCAGTCCG CACTATATCT TCACCGTGCT GGAAAAAGAA GCCATCGCTC TCGATGAAAC GCTCGATTTA CCGGTTTATG TGGTCAACGA CGCTCACCGC GATCTGGCGG TGACGGCGAC CGCGCGCCTG GTTGGTCCTG GCGATGCAGT GCTGGCGACT GTCGAGCGAT CCTTCACATT GCCCGCCGAC TGTATGACAA TGGAGATCGA GCGGTTGCGC CTGATCCCGT CGGAACCCGG AACGTATCGT CTGGAATTGA CGCTGGATGG CGATCTGGCG GAAGTTATTG TCAACGCCTA TGCGATTCAT GTCTCGGTTG GTTAG
|
Protein sequence | MTNLPAVLSL DGAWKLLPVN AFRQGFYPTD DDTWLTQEIP AHWQQHPLLE RYAGCMVYRR RFRLPDALAG TTAQSLPIRY FLRFNGVFYW AQPYFNGVDL GRHEGYFEPY EREVTAWIAS ENEIVVEVEC PDEANKSGKR LITGIFSHWD CMDPLANPGG IWLPVELIAA GPVRVRNLLL HTEAAGETLA ELRFRAVFDA DVARDVTLRW TFAPVNFAGA VQTTEQRRAL AAGEHEIVGV LLVRDPHLWW PHDMGAPDLY AVTLEVLCDG VVSDSRTVRF GIRTFELHDW IPYLNGARFF IKGNNYPPTD VRIATTTRER CLEDLRLARA CHMNMLRVHG HIAHPALYDA ADEMGMLLWQ DFPLHWLYRR DVLPEARRQA RAMVRLLGNH PSVALWCMHN EPVYVVDTRD ERMLTRVRTY ASMFVFSWNR DVMDSELKRV VEREDGRRPV VRSSNEFPIP GIRAGASTHF YYGWYTIYGR LEAWEPLIRR FPQLVRFVTE FGAQSFPNVE SCVRFMDADI RRIDWQRLVE RHQFQADIMA HWYDWRAARS LEELVQMSQE YQIKVNRHYI DRLRLRKYRP TGGMMPFMFH DANPSVSWSI IDYWRVPKRS YEAMRLAFSP HYIFTVLEKE AIALDETLDL PVYVVNDAHR DLAVTATARL VGPGDAVLAT VERSFTLPAD CMTMEIERLR LIPSEPGTYR LELTLDGDLA EVIVNAYAIH VSVG
|
| |