Gene Saro_1719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1719 
Symbol 
ID3916294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1807270 
End bp1809192 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content68% 
IMG OID640444460 
Productglycoside hydrolase family protein 
Protein accessionYP_496993 
Protein GI87199736 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGGCT TGAAGATTTC TGCACCAATG GTGCTGCTTG CCGGTGTCGG GGCGGTCGCG 
CTGGGGGCAT CCGCGTCGGC AAGGCCCGCC TTTCGCGACC TGAACCACAA TGGCCGGCTC
GATCCATACG AGAACGTCGC GCTGCCGGTC GAACGGCGGC TCGACGATCT GCTCAAGCGG
ATGACGCTGG AGGAAAAGGT CGCGCTCATG CTCCACGGAA CGCTTCAGGC CGAAGGCGGA
CGCGGCATCG GGGTGGGCAA TGCCTACGAC ACGGCCCTTG CGCAGGAACT CCTTGCGCGC
GGGGTCAACA GTTTCATCAC CCGCATCGCA CCCGAACCCC GCGCCTTCGC GACGCAGAAC
AATGCGATCC AGCGGCTTGC CGAGGCGACC CGGCTCGGCA TTCCCGCCAC GATCAGCACC
GATCCGCGCA ATCACTTCCA GGTAGTGGTC GGCGCAAGCA GCGATGCCAG CGGCTTCTCC
AAATGGCCCG AGACGCTCGG CATGGCCGCC ATCGGCGACG AGAAGCTGGT TGAGCGGTTC
GGCCGACTGG TCGCCGCCGA ATATCGCGCC GTCGGCATCC AGATGGCCCT GTCGCCACAG
GCCGACCTCT ATACCGAGCC ACGCTGGCCG CGCGGCAACG CGACGTTCGG CTCGGACCCG
GCCACGGTCT CGCGTCTTGC CGGTGCATAT GTGCGCGGGT TCCAGGGCGG CGCGGACGGA
CTCGTCCGCT CTGGCGTCGC GACCGTGGTC AAGCACTGGG TCGGCTATGG CGCCGAGCCC
GAGGGCTTTG ACGGCCACAA CTATTACGGC CGCATCGCCC GTCTCGACAA CGAAAGCTTC
GCCCAGCACG TTGCCGCTTT CGAAGGCGCG CTCGCCGCGA AGTCGGCAGG CGTAATGCCC
ACCTATGTCA TCGCGGAGGG CGTGAGCATC GATGGCAAGC CGCTGCCGCA GGTCGGCGCG
GGCTTCAGCA AGCCTCTGAT CGAGGGACTT CTGCGCGGCA CCCACAAGTT CGGCGGCATC
GTCATTTCCG ACTGGGGCAT CACCAACACC TGCCCCGAAC AGTGCAGCAA CCCGACGGCC
GAAAAGCCCC AGGGCTTTGC GATCGCGATG CCGTGGGGCG TCGAGGGTCT GTCCGAGGAA
GACCGCTACG CGCTGGGCGC GAATGCCGGG ATCGACCAGT TCGGCGGCGT CGACAATCCA
GGCCCTCTGC TTGCCGCCGT CCGTGCCGGG AAGGTTTCGC CCGCCCGGGT CGATCAGGCC
GCCCGCCGCG TGCTTCGCCT GAAGTTCGAG CTTGGCCTGT TCGACGATCC CTATGTCGAC
GTCGAGAAGG CAGCGCAGAT CGTCGGCAAC AAGGCGACGC AGGCCGAGGC CGATGCAGCC
CAGCGCGCCG CGCAGGTGCT GCTGTTGAAC CGCAACGCGC TTCTGCCGCT CGCGCCGGGC
CGAAAGGTTT GGCTCTCCGG TGTCGACGCC AGTGCCGCGC GTGCTGCCGG GCTGGTCGTG
GTGGACACCG CCGAACAGGC CGAGGTGGCC ATCGTTCGCG TTGCCACCCC GCACGAAGTC
CTTCATCCTC ACCACTTCTT CGGCTCGCGC CAGAACGAGG GCCGCCTCGA CTTCCGTGCT
GGAGACGAGG CGACCAGAAC CATCGCCGCT GCCGCTGCGC GCATCCCGAC CGTTGTGGCC
GTCGACCTCG ATCGTCCTGC CGTGCTGACC GAGGTGAAGG ACAAGGCTAC CGCGCTCTTC
GGCCTGTTCG GCGCCAGCGA TGCCGTGTTG CTGGATCTCG TCACCGGCAA GGCCCGGCCG
GCGGGCAAGC TGCCGTTCGA ACTGCCGTCC TCGGCAAAGG CGGTCGAGGA TCAGCATCCG
GGCCGACCGG ACGACAGCGC CAACCCGCTC TTCCGGCGCG GTGACGGGCT GACCTATCCC
TGA
 
Protein sequence
MGGLKISAPM VLLAGVGAVA LGASASARPA FRDLNHNGRL DPYENVALPV ERRLDDLLKR 
MTLEEKVALM LHGTLQAEGG RGIGVGNAYD TALAQELLAR GVNSFITRIA PEPRAFATQN
NAIQRLAEAT RLGIPATIST DPRNHFQVVV GASSDASGFS KWPETLGMAA IGDEKLVERF
GRLVAAEYRA VGIQMALSPQ ADLYTEPRWP RGNATFGSDP ATVSRLAGAY VRGFQGGADG
LVRSGVATVV KHWVGYGAEP EGFDGHNYYG RIARLDNESF AQHVAAFEGA LAAKSAGVMP
TYVIAEGVSI DGKPLPQVGA GFSKPLIEGL LRGTHKFGGI VISDWGITNT CPEQCSNPTA
EKPQGFAIAM PWGVEGLSEE DRYALGANAG IDQFGGVDNP GPLLAAVRAG KVSPARVDQA
ARRVLRLKFE LGLFDDPYVD VEKAAQIVGN KATQAEADAA QRAAQVLLLN RNALLPLAPG
RKVWLSGVDA SAARAAGLVV VDTAEQAEVA IVRVATPHEV LHPHHFFGSR QNEGRLDFRA
GDEATRTIAA AAARIPTVVA VDLDRPAVLT EVKDKATALF GLFGASDAVL LDLVTGKARP
AGKLPFELPS SAKAVEDQHP GRPDDSANPL FRRGDGLTYP