Gene Saro_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0789 
Symbol 
ID3915842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp837765 
End bp839096 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content62% 
IMG OID640443519 
Productglycoside hydrolase family protein 
Protein accessionYP_496068 
Protein GI87198811 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0245888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGATC GTCGCAGCCT GATGGCTTCC GCCGTGGCTT TGAGCGCCTC AGTGATGACC 
AGCAAAAGCG CCTTGGCCCG GTCCGCCAAG CCGAGGCCGC TCGATCCGCA GTTTCCCGAA
GGCTTCCTGT GGGGCGCAGC CACCGCCGCG CATCAGATCG AAGGCAACAA TCTCAACGCC
GACCTCTGGG TGATCGAAAA CGTCCCCGGC ACCATTTTTG CAGAACGGTC GGGCGACGCC
GCAAACAGCT TCGAACTGTG GCCGGTCGAC CTTGATCTCG TGAAGGGCAT GGGGCTCAAT
TCCTATCGCT TCAGCCTCGA ATGGGCGCGG ATCGAGCCGG ATGAAGGGCA TTTCTCCAAT
GCCATGCTCG ATCACTACAA GGCGATGATC GAGGGTTGCC GGGCGCGGGG GCTCAAGCCG
GTCGTCACCT TCAACCATTT CACCACCCCG CGCTGGTTCG CAGCCAAGGG CGGATGGCAT
AATCCGGAGT CATCGGCGCT CTTCGCCCGC TTCTGCGAAC GGGCGGCGCG CCATCTTGCG
GCAGGAATCG AACTCGCCAC AACATTGAAC GAGCCCAATC TGGCTGGCGT GATCGGCGAG
ATCCTGCCGC CGCCACTGGT GGCAGGCGAC AAGGCAACGC AAGAAGCAGC AGCGAAGCAG
CTCGGCGTGG CGCTTTATAC GCCCGGCGTC GCGCTCTACA TCAAGGAGCC GAAGACCTAT
CGCGCCAACA TGATGGAAGG CCATCGCCGT GGGGTCGCCG CAATCAAGGC GGTGCGGCCC
GACCTGCCGG TTGGCGTAAG CCTGGCGATG ATCGACGATC AGGCGGTTGG CAAGAACTCG
ATGCGCGACC GGATCCGCGA ACGCTACTAC AACGAGTGGC TGCGCCTTGC GGGCGAGACT
TGCGATTTCA TCGGTGTGCA GAACTACGAA CGCAAGGTCT GGACCGACAA GGGCGAGTTG
CCGTCCCCCG CCGATGCGCG GCGCAATACT GGCGGCGCTG AAGTCTGGCC CGGGTCGCTG
GCGGGCGCGG TGCGCTATGC CCATGCAGTG ACCAAGCTGC CGGTCTATGT CACCGAACAC
GGCGTCAATT CCGACGACGA CGCGCTGCGC CAGTGGTTGA TCCCCGAAGC GCTTACCGAA
CTGAAGCGTG CGATCGACGA TGGTGTGCCG GTGCGCGGCT ATATCCACTG GTCGCTGATC
GACAATTTCG AATGGGGCTT CGGCTACAAG TACCGCTTCG GCTTGCACTC GTTCGACCAA
AGCACCTTCC AGCGAACCGC CAAACCCAGC GCAGCAATTC TGGGCAGAAT TGCACGGCGA
AATAGGCTCT GA
 
Protein sequence
MIDRRSLMAS AVALSASVMT SKSALARSAK PRPLDPQFPE GFLWGAATAA HQIEGNNLNA 
DLWVIENVPG TIFAERSGDA ANSFELWPVD LDLVKGMGLN SYRFSLEWAR IEPDEGHFSN
AMLDHYKAMI EGCRARGLKP VVTFNHFTTP RWFAAKGGWH NPESSALFAR FCERAARHLA
AGIELATTLN EPNLAGVIGE ILPPPLVAGD KATQEAAAKQ LGVALYTPGV ALYIKEPKTY
RANMMEGHRR GVAAIKAVRP DLPVGVSLAM IDDQAVGKNS MRDRIRERYY NEWLRLAGET
CDFIGVQNYE RKVWTDKGEL PSPADARRNT GGAEVWPGSL AGAVRYAHAV TKLPVYVTEH
GVNSDDDALR QWLIPEALTE LKRAIDDGVP VRGYIHWSLI DNFEWGFGYK YRFGLHSFDQ
STFQRTAKPS AAILGRIARR NRL