Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3244 |
Symbol | |
ID | 5540742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4216223 |
End bp | 4218973 |
Gene Length | 2751 bp |
Protein Length | 916 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640895365 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001433316 |
Protein GI | 156743187 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0368186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTCCT ATCACATGCC GCTCCCTGGT CCCTGGGAGT TTCGTTTTGA CGATCAATCC GACTGGCGCC CCATTGCGGT TCCCGGATGC TGGGAAGACG CTGGTTTTGT AAAAGACCGC TCTGGTCCGG CATGGTATCG CACCTCATTC GTAATTCCGC ACGAACTGGA AGGACGGCGC CTCTTTCTGC ACTTTGGCGC CGTCAGTTAT CACTGTGAAG CATTCATCGT GCGCGATTCG GGTGATATGT GCAGCATCGG CACGCACACC GGCATGTGGG ATGCGTTCGA CCTCGAAATT GGCGATGCCG CAGCGCCGGG TGAGCATGTC ACGTTGCTGG TGCGGGTCGA GAAACCGGCA AGTCTGAGCG ACGGACCGGA GTCGGCATCG CTGCCAGGGC GCTTCCCGCT GCGCGAAACG CTCGCCGGAT TTCTCCCGTA TGTCTGGGGG CATAGCCACG GCGGAATCTG GCAGGAAGTC GCCCTGATCG CCTGCGGTGC GACGCGCTTT CTCGATGCCT GGGTGCATGG CGCGCCTGAT GGTCACATCT TCGTGGAAGC GGAACTCGAT GGTCCGGCAC GTGTCACGCT GGAACTCTAC CGTCCAGATG GCAGTTTCAT CCTCTCGGCA GAAGAAGATG CTGTGCGTGT GGAAACCGCG AGCGGCATTC GCTACCGGCT GCACATGGAC GGCCCGACCT CCGACGCACG CCTCTGGTCG CCGGACGATC CGGCGCTTTA CCGGGCTGTG CTGCGCGCCG GTTACGATGA TCGCATCGAA CTGCGCTTTG GGTTGCGCTC ATTCGACGCG GATGGCGCAA CGCTGCGACT TAACGGCGCA CCGATCTACC CCCGCATGAT CCTGTCGTGG GGCTGGCGCT GGGAGACCTT CGCCCCCAAT CCGGGACCGG AACGGGTGCG CGCCGATTTC GAGCGCCTGA AACGCATGGG GTACAACGGC ATCAAGTGTT GTCTCTGGTT CCCGCCTCGC TACTACTTCG ACCTGGCGGA CGAACTGGGC ATGCTGTTGT GGATCGAGTT GCCGATGTGG TTGCCTCGCG TAACCGACCA TTTCCGTCGT CAGACGCCGG TTGAATACGA ACGCCTGGTT CGTCAGGCGC GGCGACATCC ATCGGTAGTC CTCTACAGCC TGGGATGCGA ACTGGGAAAG GACGTCGGCG CCGATATTCT CGGCTCGCTG TACGCGATGA CGCGCAGTAT GTGCGGCGAC GCGCTTGTGC GCGATAACAG CGGCTCCGGC GAAGCCTACG GCGGTCTGCT CAACGAGTTC GCTCAGTACT ACGACTACCA CTTCTATGCC GACCTGCACT TCTTTCGTGG TCTTCTCGAC GCCTTTTCGC CGCGCTGGCG CCCGGCACAA CCCTGGTTGT TCGGTGAATA CTGCGATTAC GATACCTTCC GTGATCTGCG GCGCTACCGC CGGGCGGATG GTTCACGCCC CTGGTGGTTG AGCGCCGATC TGGCGATCAA TCCAACCGGC GCGCGCTGGA CGAACGAAGC GCCCTTTCTC GAAGAGCGGC TGCGCGCACA GGGGTTGTGG GAGCGCAGTG CTGAACTCGA AGCATTCTCG TATGCGCATG GCTTGCTTCA CCGCAAATGG ACAATCGAGA CGACGCGCAT GTATCGTGAG GTTTCCGGCT ATGTCATCAC CGGCGAAGCC GACACGCCGA TCACCAGCGC TGGCATGTGG GATGCAACCG GCGCCCTCAA GTACGATCCC ACCGAGTTCC GGCGTTTCAA CAACGATCTG GTCGCACTTA TCGGATGGGA CCGACGACGC GACTGGGTGC GCGGCGGCGA CCGCGCGGCA TTCTGGGATG TCTGGAGTTA TAACGCCGGA ACGCTCGTCC GTCCCCACGT GATTGTATCG CACTATGGGG CGCAGGGCGG ACCGGCGCGC GCTGCCTGGA GCATCGCCTT CGATGACGAA GCGCCCTTTG CGTCTGGTGA TATCGTCGCC AGCCACGATG TTATGCCCGG CGACGTGCGC GAAATCGGCG TAGCGGAATT TACCGCACCC GATGTGAGCG CACCACTGCG CGCCATGTTC CGGGTGTCGC TTAATGTCGG AACGCAGCGG ACGGACAATG CCTGGCCAAT CTGGTTCTTT CCTGCGAACC CGTGGGCAAC CATGCGCAAT GTGGCGATCT ACGATCCGCT GGGACGGTTG CGCGATCTCA CCCGACTGGC GCCACAGGTG GTTGAAGTGA CGCACGCCGA TCTGCGGAGC GAAGCCGGAG CGCGCGTCGA TCTGCATGCA GCGCCGTTCG TCGTTGTCGC CAGCGCCTGG ACACGCGCAC TATCGATGTA TACTCGCGGA GGCGGGCGTG TCGTGCTGCT CCAGGACGGC GACGGTCCGC CGGGACCGGT TGCAACGGCG GCTATGCCGT TCTGGCGCGA GGCGCTGCGG GTCTGTGAGC CGCACCCGGC GTGGGGTGAT TTTCCGCACG ATGGATGGGC TGGTCTCCAA TTCTTTGGCT GCGCCACCGA CTGCGCGCTC GATACGCAGC CACTCGACGG ACTGACGCGC CCTATTCTGC GCCGTATCGA CACGCGCACC ACCGCCGTTC ACGACTATGC CGCCGAGGTT GTATGGGGCG AGGGACGATT GATCGTCAGC ACTCTGCGCA TCTACGGCGG CGCCGGTGAG CAGCCTTCCG GCATCGGGCG CAATACGTCG GCTGCGTATT TACTGCTGTG CTGGGTGAAG TATCTGGGGA GCAAGAGGTG A
|
Protein sequence | MLSYHMPLPG PWEFRFDDQS DWRPIAVPGC WEDAGFVKDR SGPAWYRTSF VIPHELEGRR LFLHFGAVSY HCEAFIVRDS GDMCSIGTHT GMWDAFDLEI GDAAAPGEHV TLLVRVEKPA SLSDGPESAS LPGRFPLRET LAGFLPYVWG HSHGGIWQEV ALIACGATRF LDAWVHGAPD GHIFVEAELD GPARVTLELY RPDGSFILSA EEDAVRVETA SGIRYRLHMD GPTSDARLWS PDDPALYRAV LRAGYDDRIE LRFGLRSFDA DGATLRLNGA PIYPRMILSW GWRWETFAPN PGPERVRADF ERLKRMGYNG IKCCLWFPPR YYFDLADELG MLLWIELPMW LPRVTDHFRR QTPVEYERLV RQARRHPSVV LYSLGCELGK DVGADILGSL YAMTRSMCGD ALVRDNSGSG EAYGGLLNEF AQYYDYHFYA DLHFFRGLLD AFSPRWRPAQ PWLFGEYCDY DTFRDLRRYR RADGSRPWWL SADLAINPTG ARWTNEAPFL EERLRAQGLW ERSAELEAFS YAHGLLHRKW TIETTRMYRE VSGYVITGEA DTPITSAGMW DATGALKYDP TEFRRFNNDL VALIGWDRRR DWVRGGDRAA FWDVWSYNAG TLVRPHVIVS HYGAQGGPAR AAWSIAFDDE APFASGDIVA SHDVMPGDVR EIGVAEFTAP DVSAPLRAMF RVSLNVGTQR TDNAWPIWFF PANPWATMRN VAIYDPLGRL RDLTRLAPQV VEVTHADLRS EAGARVDLHA APFVVVASAW TRALSMYTRG GGRVVLLQDG DGPPGPVATA AMPFWREALR VCEPHPAWGD FPHDGWAGLQ FFGCATDCAL DTQPLDGLTR PILRRIDTRT TAVHDYAAEV VWGEGRLIVS TLRIYGGAGE QPSGIGRNTS AAYLLLCWVK YLGSKR
|
| |