Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2493 |
Symbol | |
ID | 5539974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3210643 |
End bp | 3213774 |
Gene Length | 3132 bp |
Protein Length | 1043 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640894623 |
Product | Beta-galactosidase |
Protein accession | YP_001432591 |
Protein GI | 156742462 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCCTA CCGACGACTC GCCGCTCCCT ATGCTTTTCA TCGCGGGCAT GAAAACCTGG GAGATGCCCG AATTGACGGC GCTCAATACG TTGCCACCAC ACGCACTGAC CGTTCCATTT CCAGCAGATG CCGACATGTC CGTCGATCCA TCGGCATCGC CATGGTATCA GAGTCTGAGC GGTGTGTGGG AGTTCCGCCT GCTGCCGCGC CCTGATGCCG TGACCGCCGC TGCATTGGCA GCAGGCGACT GGTGTCCGAT CCAGGTCCCC GGCAATTGGA CGATGCAGGG CTTCGGGACG CCGCACTATA CCAATGTGCA GATGCCCTTC CCGCATTTGC CGCCGTTCGT GCCAGACGAT AATCCCACCG GCGTTTACCG CCGTCAGTTT ACGCTTCCTC CACAATGGCA CAGACGCCGG ATCGTCCTCC ACATCGCCGG GTGTGAGGGC GCGTGCTATG TGTACCTCAA TGGCGCACCA ATCGGGCTGC ATAAGGACTC GCGCACGCCC GCCGAGTATG ATGTGACCGG CGCCGTGCGC TTCGATGCGC CGAATGAGTT GATCGCCGTC GTGTTGCGCT GGTCCGACGC CAGTTTCGTG GAAGATCAGG ACCACTGGTG GCAGTCGGGC ATTCACCGTG ACGTGTTTCT CTATGCGACC GATACCGTCT ACCTGGCGGA TCTGTCGGCG CGCGGGGATG TGAGCGATGA TCTACGGGAA GGAACGCTCC AGGTGCGTTG CACACTCGAC GCCATCGCCG AAGCCGAAGA GCATACCCGC GTCGAGGTGC AACTCTATGA CGCAAGCGGC GCACCGGTCT TCGCAGAGCC GCTGCGCGCA ACGTATACGC AAACCCATCC GCGCTTTGGC GTGCGTCGCT TCGTGCGCCC GGAACTGTGC CTGGAAGGAC AGGTTGAGTC GCCGCATCTA TGGTCAGCAG AAACGCCGTA TCTGTACATG CTCGTGGTTA CGCTCCATAG ACCGGCGGGA CCGGAACGTC ACACCTGCTA TGTCGGGTTT CGCTCGATTG CCATTCGCAA TCGGCAGTTG CTGGTGAACG GCAGAGTAAT CACGATCAAA GGTGTCAATC GCCACGACCA TTCCGACACA ACTGGCAAGG CGGTCAGCCG CGCCTTGATG GAACTCGATA TTCAGCGCAT GAAGCAGTTC AACATCAACG CCGTGCGCAC GTCGCATTAC CCAAATGACC CATACTGGCT CGATCTGTGC GATCGCTACG GGTTGTACGT CATCGACGAG GCGAATATCG AGTCGCACGC CTTCTATTTC GACATCTGCC GCGATGCACG CTACACTCGC GCATTCGTCG AGCGCGTGCG CAACATGATC GAGCGCGACA AGAATCATCC CTCGGTTATC TTCTGGTCAT TAGGGAACGA AAGCGGGTAT GGTCCCAACC ACGATGCTGC CGCCGGTCTG GCGCGTCGCC TCGATCCGTC ACGACCGCTG CACTATGAGG GCGCCATCTC GCGGTGGATG GGAGAGTCGT GGCAGGATGG GCGCACAGTC ACCGATGTGA TCTGCCCGAT GTATGCGCCA ATTGATGAGA TTGTCGCCTG GGCGGAACAA GAGACCGATG ATCCGCGTCC CCTGATCCTG TGCGAATACT CGCACGCCAT GGGGAACAGC AACGGTAGTC TGGCAGATTA CTGGGAAGCG TTCGAGCGCC ACCCGACGTT GCAGGGCGGG TTCATCTGGG AATGGCTCGA CCATGGCATC CGTGTCGCCG ACGATCAGGG GCGTGTCTAT TGGGCGTATG GCGGCGATTT CGGTGATGTT CCCAACGATG CCAACTTCGT TTGCGACGGT CTGGTGTGGC CCGACCGTTC ACCTCACCCG GCATTGTACG AATACAAATA TCTGATCCAG CCGGTGCGGG GTGAACTGGT CGATCCGGCG GGCGTAACGG TGCGAATCGT CAACCGGCAA GATTTTGCCG ATCTCGACTG GCTGTATGGC GTGTGGGAAG TGACGGTCGA TGGTCTGCCG GTGGCGTCGG GCGAGTTGCC GGAACTGTAC GCTGCGCCGG GTGAGGCGCA GGTGGTGAGT CTTGACCTCG GCGCAGCGAG CAGCGCTCCC GGAGAGCGGT TCCTGACGCT GCGCTTCTAT CAGCGCAATG CGACACTCTG GGCGCCATCC GGGCACGAGG TGTCCTGGCA ACAACTGCCG CTGCCGACGA TTGCCGTTGC GCCTGAACCG GAAGTGACAT CGGCATCGGT TGCTGTCGAA GAGATCGCCG GGCAGATCGT CTTGCGCGCC GGCGCCGTGC GCGCTGCTTT CGATACGACG ACCGGTCTTT TGACTTCATT CGGCAGCGGA TCAGAGAACC TGATTGTTCG CGGACCGCTG CTCAATGTCT GGCGCGCCGC CACAGACAAC GACGGGCTAA AGGTGTGGAA TGAGCCGGAC AAACCGCTGG CGCGCTGGAA AGCACTGGGA TTGCACCAGG TGCAACATCG CCTGCGCAGG ATACGCCTGA TGGCTGCCAG CGATGAAGCG GCAACCGTCG AAATCGAGCA TGGCGCCTCT GGCCGTGGGG AGTGGCGCGA TTTCACCCAT ATCCATCGCT ACACACTGGA CGCCAGCGGT GAGCTGCTGG TCGAGAACAC CGTCCTCATC GGTAGTGCGA TCAGCGACAT CCCGCGCGTC GGAGTACGCC TGACGCTGAT TCCGGGGCTT GAACATCTCG AATGGCACGG ACGCGGACCG TGGGATAACT ATAGTGACCG CAAAGCAAGC GCCATTGTGG GGCGTTGGCG CTCGACCGTG ACCGACCAGT ACGTGCCCTA TATTATGCCG CAGGAACACG GGCATAAAAC CGATGTGCGC TCCCTGCGCC TGACCGACGC TGATGGGCGC GGATTGTTTG TCGCCGGGCG TCCGACCTTC GAGTTTTCGG CGCTCCACCA CAGCGACGAC GACCTGTTCC GCGCTCTGCA CACGATCGAT CTGACTCCGC GCGCCGAAGT ATTCCTCAAC CTCGATGCCG CGCATCGCGG GCTTGGAACG TTGAGTTGCG GACCCGACAC GCTCGAACGC TACCGCCTGA TGGAGAGTGA ATACCAATTC GTGTACCGGA TGCGTATTCT GGGAGAGCCG GTTGATGAAT GA
|
Protein sequence | MHPTDDSPLP MLFIAGMKTW EMPELTALNT LPPHALTVPF PADADMSVDP SASPWYQSLS GVWEFRLLPR PDAVTAAALA AGDWCPIQVP GNWTMQGFGT PHYTNVQMPF PHLPPFVPDD NPTGVYRRQF TLPPQWHRRR IVLHIAGCEG ACYVYLNGAP IGLHKDSRTP AEYDVTGAVR FDAPNELIAV VLRWSDASFV EDQDHWWQSG IHRDVFLYAT DTVYLADLSA RGDVSDDLRE GTLQVRCTLD AIAEAEEHTR VEVQLYDASG APVFAEPLRA TYTQTHPRFG VRRFVRPELC LEGQVESPHL WSAETPYLYM LVVTLHRPAG PERHTCYVGF RSIAIRNRQL LVNGRVITIK GVNRHDHSDT TGKAVSRALM ELDIQRMKQF NINAVRTSHY PNDPYWLDLC DRYGLYVIDE ANIESHAFYF DICRDARYTR AFVERVRNMI ERDKNHPSVI FWSLGNESGY GPNHDAAAGL ARRLDPSRPL HYEGAISRWM GESWQDGRTV TDVICPMYAP IDEIVAWAEQ ETDDPRPLIL CEYSHAMGNS NGSLADYWEA FERHPTLQGG FIWEWLDHGI RVADDQGRVY WAYGGDFGDV PNDANFVCDG LVWPDRSPHP ALYEYKYLIQ PVRGELVDPA GVTVRIVNRQ DFADLDWLYG VWEVTVDGLP VASGELPELY AAPGEAQVVS LDLGAASSAP GERFLTLRFY QRNATLWAPS GHEVSWQQLP LPTIAVAPEP EVTSASVAVE EIAGQIVLRA GAVRAAFDTT TGLLTSFGSG SENLIVRGPL LNVWRAATDN DGLKVWNEPD KPLARWKALG LHQVQHRLRR IRLMAASDEA ATVEIEHGAS GRGEWRDFTH IHRYTLDASG ELLVENTVLI GSAISDIPRV GVRLTLIPGL EHLEWHGRGP WDNYSDRKAS AIVGRWRSTV TDQYVPYIMP QEHGHKTDVR SLRLTDADGR GLFVAGRPTF EFSALHHSDD DLFRALHTID LTPRAEVFLN LDAAHRGLGT LSCGPDTLER YRLMESEYQF VYRMRILGEP VDE
|
| |