Gene Saro_2446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2446 
Symbol 
ID3916765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2632413 
End bp2634422 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content65% 
IMG OID640445201 
Productalpha-glucosidase 
Protein accessionYP_497716 
Protein GI87200459 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATCTGC GCTTTGTAGC CGGAACTGAA GGGTTCGATC TCCGAATTGG GGATCGTGTG 
GTGCTGCGCC ACTCCGAATC CTGTCCTGCG CTGGTCATGG CGCGCGGCAA TCCGGAAGTC
GTCATGTACC GTGGCAATTT CCGTATATCC GACAAGCCTT TCGAAGAGCT GGTCCCGCTC
GCATGGCGTC AATGCGAGGG CGGGGTGGAG CTTCTCGACA GCGGGGCGCC GGTCGCCTCG
ATCGTCCTTG CCGGCAATTC CTTCGAGATT CGCGCTGCCG ACCCCGCGCA CGATCGCCTG
CGCATCCGCT TCCACGCCGA GGCGGGCGAA TGCGTCTGGG GTGGGGGCGA GCAGATGAGC
TACCTCGCGC TCAATGGCCG CAACTTCCCG ATCTGGACGA GCGAGCCTGG CGTGGGGCGC
GACAAATCGA CCGAGCTTAC CCGCCGCATG GATGCCGAGG GCATGGCCGG CGGCGATTAC
TGGAACACCA ACTACCCGCA GCCCACGTTC ATTACCTCGC GCTGGATTGC GGTTCATCTC
GACGCGAGCT TCTACAGCGT TCTCGATTTC ACGGATCCGG CTGGCCACCT GGTGGAAGTC
TGGAGCGGCG CTGCCCGTTT CGAGGTTTTC TCCGCCGATG GCCCGCAGGA TCTCGTCGGC
CAGCTTTCCA CGCGTTTCGG CCGCCAGCCC GAACTGCCCG AATGGGCCAT CGGTGGTGCT
ATCGTCGGCC TGAAGTCTGG CGCCTCTAGC TTCGACCGCC TGGAAAAGTT CATCGACGCC
GGCGCCGCCG TGTCGGGCCT GTGGTGCGAG GATTGGGCCG GCATTCGCGA GACCAGTTTC
GGTCGCCGCC TCTTCTGGGA CTGGCACAGC GGAGAGCGCA GCGGTGCCCG CTATCCGCAA
CTGCACGAAA GGATTCAGGC GCTCGAAGCG CGCGGCATCC GCTTCCTTGC CTATGCCAAC
CCCTATATCG CGGTGGATGG CGATCTCTAC CAGGAAGGCC GTGCCGGCGG GCATTTCTGC
CTTCGCCGGG ATAGCGACGA AGTCCACCTC GTCGACTTCG GGGAATTCGA CTGCGGCGTC
GTCGATTTCA CTCGCGAGGA GACGTGCGCG TGGTTTGCCG AGCGGGTCCT CGGTCGCGAA
ATGCTAGACA TCGGCATTTC CGGATGGATG GCCGATTTCG GCGAATACCT GCCGACCGAC
CTGCGCCTGG CCGACGGTTC CGACCCGATG GAGGCGCACA ACCGCTGGCC GGTGCTCTGG
GCCGAGGTCA ATGCGCGTGC ATTGGCAAGC CGGGGCAAGA CCGGGGACGC GCTGTTCTTC
ATGCGCGCCG GGTTCTCCGG CGTGCAAGCC CATTGCCCTC TGCTTTGGGC GGGCGACCAG
TCGGTCGATT TCACCCGCCA CGACGGCATT GGTACGGTCA TCACCGGCGC GCTGTCGGCG
GGTCTGGTGG GCAACGCCTA CAGCCACTCC GACTGCGGTG GTTACACCTC GCTCCACGGC
AATGTCCGAA CCGAGGAACT GTTGCACCGC TGGTGCGAAC TGGCAGCCTT CGCTCCGGTC
ATGCGCAGCC ACGAGGGCAA CCGGCCTGAC GACAACCTTC AGTACGACAG CACGCCGGAC
CTCCTCGCCT GCTTCGCCCG CTGGAGCCGC GTCCACGCTC ACCTCGCGCC CTATGTCCGC
CATCTCTGCA CGGAGGCGCG AGAGTCGGGG CTTCCGGCGC AGCGCCCGTT GTTTCTCCAC
TACCCGCAGG ATTCCGCGCT CTTTACGGTG CAGGACCAAT ACCTCTACGG TGCCGACCTG
CTGGTGGCGC CAGTGGTCGA GGAGGGGGCT CGCCGTCGCC AGGTGGTCCT TCCGGGGACG
GGCATCTGGC GGCATTGCTG GACGGGCGAG GATTTCGCCC CCGGCACGCA CGATATCTCC
GCCCCCATCG GGCAGCCGCC GGCATTCTAC CGTCCGGACA GCGCCTTCGC GTCGCTGTTC
GCGGGACTGA AGGGGGTGCT TGAAGGATGA
 
Protein sequence
MNLRFVAGTE GFDLRIGDRV VLRHSESCPA LVMARGNPEV VMYRGNFRIS DKPFEELVPL 
AWRQCEGGVE LLDSGAPVAS IVLAGNSFEI RAADPAHDRL RIRFHAEAGE CVWGGGEQMS
YLALNGRNFP IWTSEPGVGR DKSTELTRRM DAEGMAGGDY WNTNYPQPTF ITSRWIAVHL
DASFYSVLDF TDPAGHLVEV WSGAARFEVF SADGPQDLVG QLSTRFGRQP ELPEWAIGGA
IVGLKSGASS FDRLEKFIDA GAAVSGLWCE DWAGIRETSF GRRLFWDWHS GERSGARYPQ
LHERIQALEA RGIRFLAYAN PYIAVDGDLY QEGRAGGHFC LRRDSDEVHL VDFGEFDCGV
VDFTREETCA WFAERVLGRE MLDIGISGWM ADFGEYLPTD LRLADGSDPM EAHNRWPVLW
AEVNARALAS RGKTGDALFF MRAGFSGVQA HCPLLWAGDQ SVDFTRHDGI GTVITGALSA
GLVGNAYSHS DCGGYTSLHG NVRTEELLHR WCELAAFAPV MRSHEGNRPD DNLQYDSTPD
LLACFARWSR VHAHLAPYVR HLCTEARESG LPAQRPLFLH YPQDSALFTV QDQYLYGADL
LVAPVVEEGA RRRQVVLPGT GIWRHCWTGE DFAPGTHDIS APIGQPPAFY RPDSAFASLF
AGLKGVLEG