Gene Saro_1611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1611 
Symbol 
ID3918719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1679091 
End bp1681526 
Gene Length2436 bp 
Protein Length811 aa 
Translation table11 
GC content67% 
IMG OID640444351 
ProductBeta-glucosidase 
Protein accessionYP_496885 
Protein GI87199628 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATGC GAGCACACCT GCGGGGCCTG GCGCTAATCC TGCTGACCGG CAGCAGCCTC 
GCCGCAGCAG CCGACCGGGT TGATGTCCAG GCGGCGCCTT CAGGCATTGC CAACCCTGCG
GCGTGGCCCG CCGCGCACAG CCCATCCGCC ATCACCGACC CCGCGACGGA GCGGGCAATT
ACCCGCATTC TCAAGCGCAT GACGCTGGAG CAGAAGGTGG GGCAAGTCAT CCAGGGCGAC
ATCAGCTCGA TCACGCCTGC CGATCTGGAG CGATATCCGC TGGGCTCGAT CCTCGCTGGC
GGGAACAGCG GCCCCTACGG CAACGAACGC GCCGACGCCG CCACCTGGCT GCGCCTCGTC
AACGAATTCC GCGCAGCTTC GCGCAAGGCG GGGGCGGGGG TGCCGATCCT GTTCGGTGTC
GATGCCGTCC ACGGCCACTC CAACATTCCC GGAGCGACGA TCTTCCCGCA CAACGTCGGC
CTCGGCGCGA CACGCGATGC GGATCTGATC CGCCGGATCG GCCAGGCAAC CGCTGCCGAG
GTGGCGGGGT CGGGTATCGA GTGGACCTTT GCGCCGACCC TGGCGGTGCC TCAAGACTTG
CGCTGGGGGC GCGCGTACGA GGGCTATTCA AGCGATCCGC AGGTCATCGC CCGCTACGCT
CCGGCGATGG TAGAGGGCCT TCAGGGCACT TTAGGTGCCG TTCGGGTCTT GCCGTCGAAC
CGCGTCGCGG CAAGTGCGAA GCACTTTCTG GCCGATGGCG GGACCGAGAA CGGCAAGGAT
CAGGGTGATG CCAAGCTCTC TGAAGCCGAC CTCGTGCGTA TTCACGCACA GGGCTATCCC
CCCGCCATCG ATGCCGGTGC GCTGACGGTA ATGGCAAGCT TTTCAAGCTG GAACGGGATC
AAGAACCACG GCAACCGCTC GCTCCTGACC GATGTCCTCA AGAAGCGCAT GGGTTTCGAG
GGGCTGGTCG TGGGTGACTG GAACGGTCAT GGTCAGATTC CGGGATGCAC CACCACCGAT
TGCCCGTCGG CCCTCAACGC CGGCCTCGAT CTCTACATGG CGCCCGATAG CTGGAAAGGC
TTGTTCGACA ATACCTTGCG GGAGGTTCGT GAAGGGAAGA TCAGCAAGAC CCGGCTGGAC
GACGCCGTAC GCCGCATCCT GCGCGTGAAA TTCAAGCTGG GTCTGATGGG GCCCAGACTG
GTCGAGCGCG GCGATCCCGC GGCGGTTGGC GCCGATGCCC ATCTCGAAAT CGCGCGGGAA
GCAGTCGCCA AGTCATTGGT CTTGCTCAAG AATGAGGGTG GCGTCTTGCC CATCCGTCCC
GGTGCGAGGG TGCTGGTCAC CGGGCCCGGG GCCGACAACA TGGCAATGCA GGCGGGCGGG
TGGACGATCA CCTGGCAGGG TACGGACACC AGCGCCGCCG ATTTTCCCAA GGGCCGCACC
ATCGGTCGGG CGATCTCGGA GACTGTCGCC GAGGCCGGGG GCAAGGCGGA GATTGCTTCC
GATCTGCCTC CGGGGGCCAT GCCCGATGTC GCGGTCGTCG TTTTCGGTGA GCAACCCTAC
GCCGAATTCC AGGGTGATGT GCCGAACCTC GATTTTCACG CGCGGGCGGG TGAACTGGAC
CTGATCAAGC GTCTGAAAGC GCGGGGTATT CCCGTAGTGG CCCTGTTCCT TTCCGGGCGT
CCGATGTTCG TCGGGCCTGA AATGAACCTT GCCGACGCAT TCGTGGCGGC GTGGCAGCCC
GGGTCGCAGG GGCAGGGCGT TGCCGACGTC CTCGTTGCGC GCAAGGACGG CAAGCCAGCG
CGCGATTTCA CTGGCACGTT GCCGTTCGCA TGGCCGCAGG ACGCGCGCTC TCCGCTGGTC
GATCCGCTTT TCCCGCTGGG CTACGGCCTA TCCTATGCAA GGCCGGGCAA GGTCGGTCCG
GTGAACGAGG ACGCCCGGGT CGAGCACGGG CCAGCCATGA GCGAGACCAC CTTCATCCGT
CACGGCAAGG TGATCGAGCC GTGGCGGCTG GGCCTGGACA GTGTCGTGTC GACCCGCGCG
GTCGACGTTG CCGCACAGGA GGATGCGCGC CAGTTCCGCT GGGCCGGGCA GGGCGCGATC
GCGCTCGATG GTCCGCCGGT CGACATGGAG CGCCACCGCA ACGGCGGCTT CGTCATGCGG
CTGGACTGGC GGATCGATGC GCGCGGAACC GGGCCGGTGA CCGTCGCTCT GGGAGGATCG
CGGCTCGATG TGACCTCGAT CGTCGATGCC AGCGTGGCGG GCAAGCCGGT GGCGCTACGG
ATACCGCTTC AGTGCTTTGC CGCGAACGGT GCGACGCTGA AGGAAGTGGG GCAGCCACTG
CGCATCGGTG CCGACGCCGG CTTTACCGCA AGCATTCGCA ACGCCGGTAT CGAAGGCGTG
GGCGAAAGCA TCCCGTGTCC GCGCCCGGCG CGCTGA
 
Protein sequence
MPMRAHLRGL ALILLTGSSL AAAADRVDVQ AAPSGIANPA AWPAAHSPSA ITDPATERAI 
TRILKRMTLE QKVGQVIQGD ISSITPADLE RYPLGSILAG GNSGPYGNER ADAATWLRLV
NEFRAASRKA GAGVPILFGV DAVHGHSNIP GATIFPHNVG LGATRDADLI RRIGQATAAE
VAGSGIEWTF APTLAVPQDL RWGRAYEGYS SDPQVIARYA PAMVEGLQGT LGAVRVLPSN
RVAASAKHFL ADGGTENGKD QGDAKLSEAD LVRIHAQGYP PAIDAGALTV MASFSSWNGI
KNHGNRSLLT DVLKKRMGFE GLVVGDWNGH GQIPGCTTTD CPSALNAGLD LYMAPDSWKG
LFDNTLREVR EGKISKTRLD DAVRRILRVK FKLGLMGPRL VERGDPAAVG ADAHLEIARE
AVAKSLVLLK NEGGVLPIRP GARVLVTGPG ADNMAMQAGG WTITWQGTDT SAADFPKGRT
IGRAISETVA EAGGKAEIAS DLPPGAMPDV AVVVFGEQPY AEFQGDVPNL DFHARAGELD
LIKRLKARGI PVVALFLSGR PMFVGPEMNL ADAFVAAWQP GSQGQGVADV LVARKDGKPA
RDFTGTLPFA WPQDARSPLV DPLFPLGYGL SYARPGKVGP VNEDARVEHG PAMSETTFIR
HGKVIEPWRL GLDSVVSTRA VDVAAQEDAR QFRWAGQGAI ALDGPPVDME RHRNGGFVMR
LDWRIDARGT GPVTVALGGS RLDVTSIVDA SVAGKPVALR IPLQCFAANG ATLKEVGQPL
RIGADAGFTA SIRNAGIEGV GESIPCPRPA R