Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0905 |
Symbol | |
ID | 4076275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 966827 |
End bp | 967888 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638006207 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_612900 |
Protein GI | 99080746 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTTCG GGGCGACCAT TCTGGACGCA GAAGGACTGC GTCTCACCGC CGATGAAAAG GCGCTGTTTC GCGAAGCAGC GCCTTTTGGC TTTATTCTCT TTGCGCGCAA TATCGAGAGC GGCGATCAGC TGCGCGCGCT GTGCGACGAG TTTCGCGAAG CCGCCGGGCA TGACTGTCTG ATTACGATCG ACCAGGAAGG CGGGCGCGTG CAGCGGCTTC GTGCGCCGCT GGCGCGGGAG TTTCGCCCGG CGCTCGATCA TGTGGAGGCG GCCTCGGATC CGGTGAGGGC GATGTATCTG CGTGCGCGCC TCATCGCGGC GGAACTGCGC GACTATGGCA TCGACAGCAA CTGTGCGCCG CTCGCTGATG TGGCAACTGC AGAGACCCAT CCGTTCCTGC GCAATCGCTG TTACGGATCC AACGTCGAGA GCGTCGCCGC CATTGCCCGC GCCTGTGCCG AAGGACATCT GGATGGTGGC GTGGTGCCGG TGATGAAGCA CATCCCCGGC CATGGGCGAT CAACGATGGA CAGCCACCAT GATCTGCCCC ATGTGACCGC CCCCGCGGAC CTTTTGCGCG CGGAGGATTT CGCGACCTTT GCCAAGCTCC GGGATCTGCC GATGGCGATG ACGGCGCATC TGGTCTACGA CGCCTTTGAT CCGCGCCCGG CGACGCTTTC GCCGGTCATG ATGGAGATCA TCCGCAATGA GCTGGGCTAT GATGGTCTGG TGATGACCGA CGACATCTCG ATGAAGGCCC TATGGCAGGA CGCGGCCCAA GATGAGGCGC GCACCGCTGG CGGGCCATAT CCCGACGACG AAGTGACCTG GAGTGCCCGA TTGGCGCGTC AGTCCCTTGA TGCCGGCTGT GATGTGGCGC TTTTCTGCAA TTCGTCACTG GCGGCACGCG CCGAGGTTGT CGCGTCCGCT GGGCAGATGA CTAAGGCCGC TCAACGGCGC GCCGAGGCGG CCCTGGCCTG CCGCAAGCCC CCTGTCGTGC TTGACACTGA GGCCGTGGAA GCGGAACTAT CTGCCCTCAT GGGCGGACAG GTTTATGGCT GA
|
Protein sequence | MRFGATILDA EGLRLTADEK ALFREAAPFG FILFARNIES GDQLRALCDE FREAAGHDCL ITIDQEGGRV QRLRAPLARE FRPALDHVEA ASDPVRAMYL RARLIAAELR DYGIDSNCAP LADVATAETH PFLRNRCYGS NVESVAAIAR ACAEGHLDGG VVPVMKHIPG HGRSTMDSHH DLPHVTAPAD LLRAEDFATF AKLRDLPMAM TAHLVYDAFD PRPATLSPVM MEIIRNELGY DGLVMTDDIS MKALWQDAAQ DEARTAGGPY PDDEVTWSAR LARQSLDAGC DVALFCNSSL AARAEVVASA GQMTKAAQRR AEAALACRKP PVVLDTEAVE AELSALMGGQ VYG
|
| |