Gene Saro_1878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1878 
Symbol 
ID3917099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1979797 
End bp1981896 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content68% 
IMG OID640444622 
Productglycoside hydrolase, clan GH-D 
Protein accessionYP_497152 
Protein GI87199895 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGAGG TATTCGCCTT GCACTCCGGG GACGGCAGTC TCGTCTGGGA AGCGGACGAC 
GGCAGCGCGC CCGCGTGGCG GCACTTCGGC CCCCGGCTGA CAGCCGACGG GATCCGGCCG
ATCCGGGACC AGCGCGCGCC TGCGTCATAC TCGCTGGACG ACGACGTACC GTTCGCCGTC
GCACCGGCGG CCGGGCTGGG CTGGTTCGGA CCTTCGGCCA TGTGCTTGCG GCGCGGCTCA
GTCGCACTCG TGCCGGCAAT GGCGGGGGCG TCCGTCGCCG GGGATGCCGG CTCCGTGCGC
ATCATCACGC GCGATCCGGT TGCGGGCGTG GAGCTTGAAC AGGTGTTCGA AGCAGTTGGC
GGTGCTTTCG TCTGCCGCAG CGTCGTGCGC AACATCGGCA GTGAAACGTT CGAGGTCGAC
TGGCTGTCGA GCGCGCTCCT CCCCATCGCG GGTTCCGCGC GCGAGATCGT GTCGTGGCGT
GGACGCCACA ACGCCGAACT GGTCGAATGC CGCGAGCCGA TGCCGCAGCA TTCGTGGGTG
CGCGAAGGTC GGCGCGGGAT TTCCGGGCAC GGCGGCCCTC CGGGCCTGTT CGTGCTCGAC
GAAGGCGCTA CGTATCACGC GGGCACTGTC CGCGCGCTGC AGCTTGCGTG GTCTGGCGAT
GCGCGCATCG AGGTCGAGCG CGACGACGAG GGTTTCTGGA CGTTGAACGC GGGCGCGGTG
CTTCAACCTG GCGAGGCCAG CCTTGCGCCC GGCGAGAGCT GGCAGTCGCC CGACGCCATC
GTGACGGTAT CAACCTCCGG CCGGAACGGC GCGGCGCAGG CGTTTCACGA TGCAGTCCGC
GCGCGCATCC GCTGGCCGGA CGGGGCCATG CGCCCACGGC CCGTGCACCT CAACTCGTGG
GAAGCCTGCT ATTTCGACCA CGACGAGGAG CGCATCGTTG CCCTTGCGGA AGCGGCGGCA
TCCGTGGGCG TGGAGCGGTT CATTCTCGAC GATGGCTGGT TCCGGGGCCG CAACGACGAC
ACGGCGGGAC TGGGCGACTG GACAACCGAT CCCGTCAAGT ATCCGCACGG ACTGCGCACC
CTGGCAGACC GCGTCAATGC GCTGGGCATG GAGTTCGGGC TCTGGGTCGA ACCGGAAATG
ATCAATCCCG ATAGCGACCT CTATCGGGCG TATCCGGACT GGGCGCTTGC GCTTCCGGGG
CGGAAACGAC CGACTGCCCG GAACCAGCTC GTGCTTGACA TGCGGCGACG GGACGTTCGC
GACCACCTCT TCGGCTGCAT CGACGCGCTG CTGCGCGAAT TGCCGATCAC CTACCTCAAG
TGGGACCACA ACCGCGACCT CGCGCCTGCT GGCGGGGCGG CGCAGATGAG GGGGGCGTAT
GAACTGTTCG CGCGGGTCCG CGCAGCGCAT CCCGCCGTGG AGATCGAGGC CTGTGCTGGC
GGCGGCGGGC GCAACGATGC AGGTATGGCC GACTATTGCC ATCGCTACTG GACCAGCGAC
AATATCGACG CGGCCAGTCG CATCGGTATC CAGCGCGGGT TCCTGTCGTT CCTCCCTCCC
GAGGTGATGG GATCGCACAT CGCCGCAAGC CCCGCCCATG CCACGGGGCG CAGGCATTCC
CTGGGCTTCC GGGCTGCCAT GGCGATGGCC GGTCATCTCG GCGTGGAAAT GGACCCGCGC
ACGCTGGGCG ATGCAGAGCG CGCCGAACTG GCCGACTGGA TCGCCTTTCA CAAGCAATGG
CGCGGATTGC TGCACCAGGG CACTGTCTGG CTGGGCGAGG GTGCGGATGC AGCGTTCTGG
CAGGCGCAGG GGAATGCAGC GGAACTGCTG CTCTTCGTGA TCCGCGCGGA CCCGCCGCTG
GACCGTCGCC CGCAGCCTTT GCCACTGCCT TTCGCGGGCC AGGATGGAAC ATGGGACATC
CGGCTTCTTC GTATCGCCGG GGGCGAGGGC GGGCATGCCG CGCATTCCGC TGCGCTGTTC
GAGGCAATGA AGGCCGCCCC TCAGGCGTTC CCGGCTGACT GGCTCTCGGC CAACGGCCTG
CCGCTCCCGC CCTGCAAGGC GGAGACGGTC ACCATCTTCC ATCTGCGCAA ACGCGCCTGA
 
Protein sequence
MGEVFALHSG DGSLVWEADD GSAPAWRHFG PRLTADGIRP IRDQRAPASY SLDDDVPFAV 
APAAGLGWFG PSAMCLRRGS VALVPAMAGA SVAGDAGSVR IITRDPVAGV ELEQVFEAVG
GAFVCRSVVR NIGSETFEVD WLSSALLPIA GSAREIVSWR GRHNAELVEC REPMPQHSWV
REGRRGISGH GGPPGLFVLD EGATYHAGTV RALQLAWSGD ARIEVERDDE GFWTLNAGAV
LQPGEASLAP GESWQSPDAI VTVSTSGRNG AAQAFHDAVR ARIRWPDGAM RPRPVHLNSW
EACYFDHDEE RIVALAEAAA SVGVERFILD DGWFRGRNDD TAGLGDWTTD PVKYPHGLRT
LADRVNALGM EFGLWVEPEM INPDSDLYRA YPDWALALPG RKRPTARNQL VLDMRRRDVR
DHLFGCIDAL LRELPITYLK WDHNRDLAPA GGAAQMRGAY ELFARVRAAH PAVEIEACAG
GGGRNDAGMA DYCHRYWTSD NIDAASRIGI QRGFLSFLPP EVMGSHIAAS PAHATGRRHS
LGFRAAMAMA GHLGVEMDPR TLGDAERAEL ADWIAFHKQW RGLLHQGTVW LGEGADAAFW
QAQGNAAELL LFVIRADPPL DRRPQPLPLP FAGQDGTWDI RLLRIAGGEG GHAAHSAALF
EAMKAAPQAF PADWLSANGL PLPPCKAETV TIFHLRKRA