Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3522 |
Symbol | |
ID | 5077671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 135445 |
End bp | 137049 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640481246 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001165908 |
Protein GI | 146275748 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3507] Beta-xylosidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.653851 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACCGGC GCGATGCGAT GAAGTCGGCC GCCATTGCCG CGCTCGGCCT TGCAACGCCG CTGCCCGCGC TTGCCGGGCC GGCTACCGCT GCCACCGCTT CCGCGCCCCG CTGGAAGCGC GGGCTCGACA ACCAGCGCAT CGCCGACCTC GGGGATGGCA CGTTCCTCAA TCCGGTCCTC GCCGGCGACC GCCCCGACCC CGCCATCCTC AAGGACGGCA AGGACTACTA CCTCACCTTC TCCGGCTTCG AGAGCTATCC CGGCCTGCTC ATCTGGCACT CGCGCGATCT TGTGAACTGG GCGCCGAGGA AGCCCGCGCT TCACAGGAAC ATCGGCTCGG TCTGGGCGGT CAGCCTCGAC CGGCACAAGG GTCGCTACTA CCTCTACATC CCGGTCAAGG CCTCTCCGCA GAACGACACC TTCGTGATCT GGGCCGACCA TATCGACGGA CCGTGGAGCG AACCCAAGCC CCTCGGCCTG CCCAACCATA TCGACCCGTG CCATGCGGTG GGCGAAGACG GCTCGCGCTG GCTGTTCCTA TCCGGCGGAG ACCGCGTGCG TCTCTCGGAC GACGGCCTCA GCCTCGCCGG CCCCGTGGAA CACGTCTACG ATCCCTGGCG CTATCCCGAG GAATGGGACG TCGAGGGCTT CGCGCCCGAA GGCCCCAAGA TTCACCGCGT CGGCAAGTGG TTCTACATGC TGACCGCCGT GGGCGGAACC GCAGGACCAC CCACCGGCCA CATGGTCATC GCCGCGCGCT CGCGCTCGAT CCACGGGCCG TGGGAACAGC ATCCCGGCAA TCCGCTGGTG CGCACCACCG ACATCCGCGA AGCCTGGTGG TCGCGCGGCC ACGCCAGCCT CGTCGAAGGG CCGGACCGCA CCTGGTGGAG CCTCTATCAC GGCTTCGAGA ACGGCTTCTG GACGCTGGGT CGCCAATGCC TGCTCGATCC GGTGGAATGG ACCGCCGATG GCTGGTTCCG CATGACGGGC GGCGACCTCT CGCGCCCCCT CGCCAAACCG AAGGGCGGCG AGGCGCTGCC CCATGGCATG GCGCTGTCGG ACGACTTCTC CCGCCTGCAA CTGGGCACCA AGTGGTCGTT CTTCCGTCCT TCGCCGGACG AGGCAAGCCG CGCGCGCGTG GTCGGCAACA CCCTGCTCCT CAAGGGCAAG GGCCTCGCCC CATCGACCGG TTCGCCGCTC CTCCTGATCG CGGGCGACAC GTCCTACCGG TTCGAATGCG ACATCGAACT CGCGCCCGGC GCCACGGCGG GCCTGGTGCT GTTCTACGAC GACAAGCTCT ACGCCGGCCT CGGCTTCGAT GCCGAACGCT TCGTCACCCA CCAGTACGGG ATGGAACGCG GCCGCCCCGC CAATCCCCAC GGCACCGCCA TGCGCATCCG CGTGACCAAC CGGCGACATA TCGTCAGCTA CCACACTTCG GGCGACGGCG GGCAGACGTG GCGCAAGTTC GACCGCGGCA TGGAAGTGTC AGGCTACCAC CACAACGTGC GCGGCGGTTT CCTCATGCTC CGCCCCGGCC TCTACGCTGC GGGCCCCGGC GAGACCAGGT TCCGCAATTT CACCTTCACT GCACTGGAAG ACTGA
|
Protein sequence | MDRRDAMKSA AIAALGLATP LPALAGPATA ATASAPRWKR GLDNQRIADL GDGTFLNPVL AGDRPDPAIL KDGKDYYLTF SGFESYPGLL IWHSRDLVNW APRKPALHRN IGSVWAVSLD RHKGRYYLYI PVKASPQNDT FVIWADHIDG PWSEPKPLGL PNHIDPCHAV GEDGSRWLFL SGGDRVRLSD DGLSLAGPVE HVYDPWRYPE EWDVEGFAPE GPKIHRVGKW FYMLTAVGGT AGPPTGHMVI AARSRSIHGP WEQHPGNPLV RTTDIREAWW SRGHASLVEG PDRTWWSLYH GFENGFWTLG RQCLLDPVEW TADGWFRMTG GDLSRPLAKP KGGEALPHGM ALSDDFSRLQ LGTKWSFFRP SPDEASRARV VGNTLLLKGK GLAPSTGSPL LLIAGDTSYR FECDIELAPG ATAGLVLFYD DKLYAGLGFD AERFVTHQYG MERGRPANPH GTAMRIRVTN RRHIVSYHTS GDGGQTWRKF DRGMEVSGYH HNVRGGFLML RPGLYAAGPG ETRFRNFTFT ALED
|
| |