Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2714 |
Symbol | |
ID | 5209683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 3371823 |
End bp | 3374885 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640596315 |
Product | Beta-galactosidase |
Protein accession | YP_001277037 |
Protein GI | 148656832 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.204643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAAT TAACGGCGCT CAATACGCTG CCGCCGCATG CTCTGACCAT TCCTTTTCCA GCCCACGGTC CGATTGATGC CGATCCACTG GCGTCGCCGT GGCGCCAGAG TCTGAACGGC GTCTGGGAGT TCCTGCTCTT GCCGCGCCCC GACCATGTGA CGGGTGCAGC GCTGGCAGGC GGCGATTGGA AGCCGATCCA GGTTCCCGGC AACTGGACAA TGCAGGGGTT CGGAACGCCG CACTATACCA ACGTGCAGAT GCCCTTTCCC CAGATGCCGC CATCGGTTCC CGACGATAAT CCGACCGGCG TCTATCGCCG TCGGTTTACG CTTGCTCCCG ACTGGCGCGG ACGGCGGATC GTGCTGCACA TTGCCGGGTG CGAGGGCGCG TGTTATGTGT ATCTCAACGA TCAACCCGTC GGTTTTCATA AGGACTCACG CACCCCCGCA GAGTACGATG TGACCGGCGT GGTGCGCTTC GATGCGCCGA ATGAACTGAT TGCCGTTGTG CTGCGCTGGT CCGACGCCAG TTTCATCGAA GATCAGGATC ACTGGTGGCA ATCGGGCATT CACCGTGATG TGTTTCTCTA TGCCACCGAT ACCGTCTATC TGGCGGATCT GTCAGTACGC GCGGATGTGA GCGATGATCT GCAGGAGGGC ATACTTCGGG TGCGCTGCAC CCTTGATGCT ATCGGTGAAG CCGGGGAGCA TACCCGCGTC GAAGCGCAAC TCTATGATGC GCACGGGACG CAGATGTTTG CTGAGCCGCT CGGCGCCACC TATACGCAGA CCCATCCACG CTTCGGGGTG CGTCGCTTTG TGCGCCCGGA ACTGCTTCTG GAGGGACATG TACCGTCGCC ACATCTGTGG TCGGCGGAAA CGCCATACCT GTACACATTT GTCGTGACCG TGTATGGACC GGCTGGACCA GAGAGAAGTG CGTGCCGGGT TGGTTTCCGC TCGATAGCCA TTCGTCACCG TCAACTGCTG GTGAATGGTC GGGCGATCAC CATCAAAGGG GTCAACCGTC ACGATCATTC CGATACGACC GGCAAAGCAG TCAGTCGGGA ATTGATGGAA CTCGACATTC AACGCATGAA GCAGTTCAAC ATCAACGCTG TGCGTGCGTC GCACTACCCG AATGACCCAT ACTGGCTCGA TCTATGCGAC CGTTATGGTT TGTATGTGAT CGATGAGGCG AATATCGAGG CGCACGCATT CTACTTCGAC CTTTGCCGCG ATGCGCGCTA CACGCGGGCG TTTGTCGAAC GGGTGCGGAA CATGATCGAG CGCGACAAAA ATCATCCCTC GATCATCCTC TGGTCGCTGG GGAATGAGAG CGGATACGGT CCGAATCACG ATGCTGCCGC CGGTCTTGCG CGTCGTCTCG ATCCGTCACG ACCGCTGCAC TACGAGGGCG CCATCTCACG CTGGATGGGC GAGTCGTGGC ATGGTGGACG CACTGTGACC GATGTGATCT GCCCGATGTA TGCCTCCATC GAGGAGATTG TTGCGTGGGC TGAGCAGGAA ACCGACGATC CACGCCCGTT GATCCTCTGT GAGTATTCCC ATGCGATGGG AAACAGTAAC GGCAGTCTGG CAGATTACTG GGAAGCGTTC GAGCGCTATC CAGCGCTACA AGGCGGTTTC ATCTGGGAAT GGGTCGATCA CGGCATCCGT GCGACCGATG CGCAGGGGCG CGTCTACTGG GCATACGGCG GCGATTTTGG CGATGTCCCC AACGATGCCA ACTTTGTGTG CGATGGTCTG GTCTGGCCCG ACCGCACACC CCATCCGGCG TTGTACGAGT ACAAGTATCT GATCCAGCCG GTGCGCGTCG AACTGGTCGA TCCGTCTGGA ACGATGCTGC GGATCGTCAA TCGCCACGAT TTTGCCAGCA TCGATTGGTT GGACGGGGTG TGGGAAGTGA TTGCTGACGG CGTGCCGGTG GCATCTGGCA GGTTGCCCGA ACTTCATGCC GCACCGGGCG AAGCGCAGGT GGTGAAACTG GATCTCGACG CAGCGCATGG AGCGGGCGAA CGTTTCCTGA CGGTGCGCTT CTACCAGCGT GAAGCGACTC TCTGGGCGCC TCCAGGGCAC GAGGTTGCCT GGCAGCAACT CCCGCTTCCA ACGGTCGCCG CGATGCCTGA ACCGGTTATT GCGGGCGAAT CTGTGGTGGT GGAGCAGCGT CCGGATCGTA TCACGCTGCG CGCTGGCGCC ACGCACGCCG TGTTCGACGT CAGGAGCGGG ACTCTGGCAT CGTTTGGGCG CGATGAGCAA AACCTGATCG TTCGTGGTCC GTTGCTCAAC GTCTGGCGGG CGGCAACCGA TAATGACGGC TTGAAACTGC GGGACGAACC GGAGAAGCCG CTGGCGCGCT GGAAGGCGTT GGGTCTGCAC CGGTTGCACC ATCGCCTGAA CCACATACGA GTGGTTGCCG TTGACAACGG GGCGGCGTCG GTTGAAATCG AGCACGCCGC CACCGGTCGC GACCGTTGGG GCGATTTTAT CCATATCCAT CGCTACACCC TGCACGCCGA CGGCGAACTA TCGGTAGAGA ACACCGTCAT CATCGGCAAT GCCATCAGCG ATCTCCCGCG CGTCGGGGTA TGCATGCTAC TGACGCCTGG TTTGGAACAT CTCGAATGGT ATGGACGCGG TCCGTGGGAC AACTACAGCG ATCGCAAGGC AAGCGCCTTA ATGGGGCGCT GGCGTTCGAC CGTGACCGAC CAGTACGTGC CGTACATTAT GCCGCAAGAG CATGGGCACA AAACTGATGT TCGCTTCCTG CTGCTGACCG ATCAGGACAG GCGTGGGTTG TTCATCGGCG GACAGCCGAC CTTCGAGTTT TCGGCGCTAC ACCACAGCGA CGATGACCTG TTTCGCGCCC TGCACACTAT CGACCTGACG CCGCGTGCTG AGGTCTTTCT CAATCTCGAT GCAGCGCATC GCGGTTTGGG AACCCTGAGT TGCGGACCTG ACACGCTCGA ACAGCACCGT TTGATGGACT CAGTGTATCG GTTTGGGTAT CGGATGCGGG CAGTGTCGTC GGATGTTGGA TAG
|
Protein sequence | MPELTALNTL PPHALTIPFP AHGPIDADPL ASPWRQSLNG VWEFLLLPRP DHVTGAALAG GDWKPIQVPG NWTMQGFGTP HYTNVQMPFP QMPPSVPDDN PTGVYRRRFT LAPDWRGRRI VLHIAGCEGA CYVYLNDQPV GFHKDSRTPA EYDVTGVVRF DAPNELIAVV LRWSDASFIE DQDHWWQSGI HRDVFLYATD TVYLADLSVR ADVSDDLQEG ILRVRCTLDA IGEAGEHTRV EAQLYDAHGT QMFAEPLGAT YTQTHPRFGV RRFVRPELLL EGHVPSPHLW SAETPYLYTF VVTVYGPAGP ERSACRVGFR SIAIRHRQLL VNGRAITIKG VNRHDHSDTT GKAVSRELME LDIQRMKQFN INAVRASHYP NDPYWLDLCD RYGLYVIDEA NIEAHAFYFD LCRDARYTRA FVERVRNMIE RDKNHPSIIL WSLGNESGYG PNHDAAAGLA RRLDPSRPLH YEGAISRWMG ESWHGGRTVT DVICPMYASI EEIVAWAEQE TDDPRPLILC EYSHAMGNSN GSLADYWEAF ERYPALQGGF IWEWVDHGIR ATDAQGRVYW AYGGDFGDVP NDANFVCDGL VWPDRTPHPA LYEYKYLIQP VRVELVDPSG TMLRIVNRHD FASIDWLDGV WEVIADGVPV ASGRLPELHA APGEAQVVKL DLDAAHGAGE RFLTVRFYQR EATLWAPPGH EVAWQQLPLP TVAAMPEPVI AGESVVVEQR PDRITLRAGA THAVFDVRSG TLASFGRDEQ NLIVRGPLLN VWRAATDNDG LKLRDEPEKP LARWKALGLH RLHHRLNHIR VVAVDNGAAS VEIEHAATGR DRWGDFIHIH RYTLHADGEL SVENTVIIGN AISDLPRVGV CMLLTPGLEH LEWYGRGPWD NYSDRKASAL MGRWRSTVTD QYVPYIMPQE HGHKTDVRFL LLTDQDRRGL FIGGQPTFEF SALHHSDDDL FRALHTIDLT PRAEVFLNLD AAHRGLGTLS CGPDTLEQHR LMDSVYRFGY RMRAVSSDVG
|
| |