Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3325 |
Symbol | |
ID | 4443967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3733230 |
End bp | 3735245 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639691148 |
Product | Beta-galactosidase |
Protein accession | YP_832800 |
Protein GI | 116671867 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0011475 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATCCC AGGAAAATCC CACTCCACCC GCAGTGTGGA GCAGCCTCGA AGGTATCGCC TACGGGGGCG ACTACAACCC CGAGCAGTGG CCTGTCAGCA CCCGGCAGGA AGACCTGGAC CTCATGAAGG AAGCCGGCGT CACGTTCCTC AGCGTGGGCA TTTTTTCCTG GGGCCTGCTG GAACCGTCCG AGGGCGACTA CGATTTTGGC TGGCTGGACG ACGCCCTGGA CAACCTGGCA GCAGCAGGCA TCAAGGTGGC CCTCGCGACC GCAACCGCCG CGCCCCCGGC ATGGTTGGTG CGCAAGCACC CGGAAATCCT CCCGGTCACC GCTGACGGAA CCGTGCTGGG GCCGGGCTCC CGGCGGCACT ACACGCCGTC GTCGGCCGTT TACCGGCGGT ACGCAACCGG CATCACCCGG GTGCTGGCAG AGCGCTACAA GGACCACCCG GCACTGGCGC TGTGGCACGT GGACAACGAA CTCGGCTGCC ACGTTTCGGA GTTCTACGGC GACGAGGACG CCACTGCCTT CCGCCGTTGG CTTGAGCGTC GGTACGGGAG CATCGATGCC CTGAACGCGG CGTGGGGAAC GGCGTTCTGG TCCCAGCACT ACTCCTCGTT CGAGGAGGTC CTGCCGCCAG CGGCGGCGCC GTCCACGCTT AATCCGGGGC AGCAGCTGGA CTTCCAGCGC TTCAGCTCCT GGGCCCTGAT GGACTATTAC CGCGCCCTCC TCGAGGTGCT GCGCGACGTA ACGCCCGGGG TGCCTGCCAC CACCAACCTC ATGGTTTCCA GCGCCACCAA ATCCATGGAC TACTTCGACT GGGCCAAGGA CCTGGACGTG ATCGCCAACG ACCACTACCT CGTGGCCGCC GATCCGGAGC GTCAGATCGA ACTTGCGTTC AGTGCGGACC TCACGCGCGG CGTTGCCGGC GGTGACCCGT GGATACTGAT GGAACATTCG ACGTCGGCAG TGAACTGGCA GCCCCGCAAC CAGCCCAAGA TGCCCGGCGA AATGCTGCGC AACTCCCTGG CGCACGTGGC CCGCGGCGCT GATGCCGTTA TGTTCTTCCA GTGGCGGCAG AGCTTCGCCG GTTCGGAGAA GTTCCACTCC GCTATGGTGC CCCACGGCGG ACGGGACACC CGCATCTTCC ACGAGGTGGT GGGGCTCGGC GCCGCACTCC AGAAGCTGGC CGCGGTGCGG GGATCGCGCG TTGAGTCGCG CGTGGCCATC GTCTTCGACT ACGAGGCCTG GTGGGCCAGC GAACTCGATT CCCACCCCAG CCAGGACGTG AAATACCTCG ACCTGATGCG CGCCTTCCAC CGCTCGCTGT TCCTGCGCGG GGTTTCGACG GATTTTGTGC ACCCTTCCGC CTCCCTGGAG GGCTACGACC TGGTGCTGGT GTGCACCCTG TACAACGTCA CCGACCCGGC TGCCGCCAGC ATCGCATCGG CTGCCCGCGC CGGCGCAACA GTCCTGGTGA GCTATTTCAG CGGGATTGTG GACGAACAGG ACCACATCCG CCTCGGCGGG TACCCGGGCG CCTTCCGGGA GCTGCTGGGC ATCAGGGTGG AGGAATTCCA CCCCCTGCTG ACCGGTGCCA CGGCCAAGCT CAGCGACGGC CGAGTGGGCA CGGTATGGAG CGAGCACGTG CACTTGGCCG GGGCTGAACC GGTACAGACG TTCACTGAAT ACCCGTTGGA GGGCGTCCCC GCCCTGACCC GGCGCTCCGT CGGAACGGGC TCGGCCTGGT ACCTGGCCAC CTTCCCGGAC CGCGACGGCA TCGAGGCGCT GGTGGACCGG TTGCTCGCTG ATTCCGGAGT TTCCGCAGTG GCGGAGGCGG ACCAGGGCGT TGAACTGACG CGACGCCGCA CCGGCGACGG CACCAGCTTC ATCTTCGCCA TCAACCACTC ACGCGCCGAT GCCACGGTCC GCGTGGCAGG GGCAGAGCTG CTGTCCGGCG AGCGGTTCAC CGGCACGGTT CCGGCTGGCG GCGTGGCGGT GATCGCGGAG GACTGA
|
Protein sequence | MSSQENPTPP AVWSSLEGIA YGGDYNPEQW PVSTRQEDLD LMKEAGVTFL SVGIFSWGLL EPSEGDYDFG WLDDALDNLA AAGIKVALAT ATAAPPAWLV RKHPEILPVT ADGTVLGPGS RRHYTPSSAV YRRYATGITR VLAERYKDHP ALALWHVDNE LGCHVSEFYG DEDATAFRRW LERRYGSIDA LNAAWGTAFW SQHYSSFEEV LPPAAAPSTL NPGQQLDFQR FSSWALMDYY RALLEVLRDV TPGVPATTNL MVSSATKSMD YFDWAKDLDV IANDHYLVAA DPERQIELAF SADLTRGVAG GDPWILMEHS TSAVNWQPRN QPKMPGEMLR NSLAHVARGA DAVMFFQWRQ SFAGSEKFHS AMVPHGGRDT RIFHEVVGLG AALQKLAAVR GSRVESRVAI VFDYEAWWAS ELDSHPSQDV KYLDLMRAFH RSLFLRGVST DFVHPSASLE GYDLVLVCTL YNVTDPAAAS IASAARAGAT VLVSYFSGIV DEQDHIRLGG YPGAFRELLG IRVEEFHPLL TGATAKLSDG RVGTVWSEHV HLAGAEPVQT FTEYPLEGVP ALTRRSVGTG SAWYLATFPD RDGIEALVDR LLADSGVSAV AEADQGVELT RRRTGDGTSF IFAINHSRAD ATVRVAGAEL LSGERFTGTV PAGGVAVIAE D
|
| |