Gene Saro_1884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1884 
Symbol 
ID3917105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1989933 
End bp1991255 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content67% 
IMG OID640444628 
Productglycoside hydrolase family protein 
Protein accessionYP_497158 
Protein GI87199901 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00690154 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACCCC TGTACCATTA CGCGCCCGCC GCCAACTGGC TGAGCGACCC CAACGGCCTC 
GTGTGGCAGG ATGGGGAGTG GCACCTGTTC TACCAGTACA ATCCCTTCGG CGAGGACTGG
GGGCACATGT CCTGGGGCCA CGCGGTCAGT CGCGATCTTG TGACTTGGCA GGAACTGCCG
GTGGCTCTTG CCGAAGAAGA CGGGACGATG ATCTTCTCGG GATCTGCAGT CATCGATCAC
CAGGGCAGCG CGGGCTTCGG AAAGGGCGCG ATGGTCGCGG TCTATACCGG CGCGCGAACC
GACCGGGCGC ATCAGTTCCA GAGCATTGCC GCCAGCACGG ACCGGGGGCG CACTTTCACC
AAGTTCACGG GCAACCCGGT GCTCGACCTG CAAATGGCGG ACTTCCGCGA TCCGAACGTG
TTCTGGCACG GGCCGTCGGG CCGGTGGATC ATGTCGGTCG TGCTTTCGGA GGAGAACCGC
GCGCAATTGT ACGCTTCGGT GGACTTGAGG CACTGGGATC TGCTGTCCGA TATTGGGCGC
GACGGTGCGC CCGGGCACTT GTGGGAATGC CCCTGGATGG TCGAGTTGCC GGTGGAGGGC
ACCGACGAGA CACGTTGGCT GTTCAAGGTC GATGTCCTGT CCGGCGCGGC AGGCCAGGCA
TCGCCGTGGC GCGTGGGGCA TTTCGACGGA AGGCGGTTCG TTCCGGAGAC CGGTTGGGCG
GTGGGTGACC ACGGGCCTGA CTTCTATGCG GCAATCGGTT GGAACGCCGC CCCGGATGCG
GAAGGACGAC CCGTCTGGAT CGGATGGGCG GGCAATCACG CCTATCAGAA GTACCTCGAG
CCGCAGGGCT GGCGCGGCGC GATGACGCTG CCGCGCCGTG TCTCGCTTCG CCGTGACGAA
AGTGGCTACG CGCTGGTGCA GGAAGTGGAA CCGGCCTGCC GCGCTCTGTT CGGCAAGGCC
GAGGCCACGT CGCTGTGCGA TGGGTGCATC CCGCTGCCGT CAGCGGCGCT GCTGAGCTTT
CCGGCCGGCA AGGACTGGTC CTTCAGGATC GAGGACGATC ATGGGCGCAG CATCGAGTGC
GCGCTCTCGG GGGGGCGGCT GACCGTGCGC CGACACGACC CGGTGACGCC GCAGCTCGAA
CATCGCGCGG CGATGGAAGC AGGGAAGGGC CCTGTCGAAC TGTGGCTCGA TGCAGGCAGC
CTTGAGGTCT TCGCGAACCG GGGCGGGGCA GTTCTGACGT TGCAGCATCG CCTCGCCGGC
GAGGCCTGGG CCTTGCGCGG CGAGGGCGCG TGCAGCGTGG CCTATCCCGC CGCGATGGCC
TGA
 
Protein sequence
MRPLYHYAPA ANWLSDPNGL VWQDGEWHLF YQYNPFGEDW GHMSWGHAVS RDLVTWQELP 
VALAEEDGTM IFSGSAVIDH QGSAGFGKGA MVAVYTGART DRAHQFQSIA ASTDRGRTFT
KFTGNPVLDL QMADFRDPNV FWHGPSGRWI MSVVLSEENR AQLYASVDLR HWDLLSDIGR
DGAPGHLWEC PWMVELPVEG TDETRWLFKV DVLSGAAGQA SPWRVGHFDG RRFVPETGWA
VGDHGPDFYA AIGWNAAPDA EGRPVWIGWA GNHAYQKYLE PQGWRGAMTL PRRVSLRRDE
SGYALVQEVE PACRALFGKA EATSLCDGCI PLPSAALLSF PAGKDWSFRI EDDHGRSIEC
ALSGGRLTVR RHDPVTPQLE HRAAMEAGKG PVELWLDAGS LEVFANRGGA VLTLQHRLAG
EAWALRGEGA CSVAYPAAMA