Gene Saro_1892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1892 
Symbol 
ID3917113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2002968 
End bp2005016 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content64% 
IMG OID640444636 
Productalpha-glucosidase 
Protein accessionYP_497166 
Protein GI87199909 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAACC TTCTGAACGT TGCAGCGCCG CTGGCGCTGA CCCTTTCGTG CGCCCTGGCG 
ACGCCTGCCA CGGCGGAGAC GGTGACGGCA ACATCGCCGG ACGGCAGCCT GGTGCTGTCG
GTGACCACCG ACAACGACGG CCACCCGCTC TACAGCCTGA CCCGCAAGGG CAAGCTCCTG
CTAGGATCGT CGATGCTGGG TTTCATCACC AGCGATGGCC CAACTATGCA ACGCGGGCAG
ACCATCATCG GCAGCGAGAA GGGATCGGGC AAGGAGACCT GGGAACAGCC CTGGGGTGAG
CGGCGCTATG TCACCGACAA TCATAACGAG CTTCTGGTGA AGTTCGAACA GGTGCCGGAC
TGGGGCGGGC GCCGCATGAA CGTGCGCTTC CGCCTGTTCG ACGATGGCTT CGGTTTCCGC
TACGAGATCC CCGAACAGCC CGCGATGAAG GTGATGAAGA TCGCGGACGA GCTTACCGAG
TTCAACGTGG CGCAGAACGG CACGGCTTGG TGGATTCCGG GCGGTGAATG GAACCGCTAT
GAGCAGGTCT ACCAGAAGAC GGCGATCGAC GGCGTCTCGA CCGCGCACAC TCCGATCACG
ATGAAGCTGG CGGACGGGAC GCACCTGTCG TTCCACGAGG CGGCGCTGGT GGACTATTCG
GCGATGTGGC TGAAGCGGCA GACGGGCACC TCGTTCCGCG CCACGCTTTC ACCTTCGCCG
AACGGGCCCA AGGTGACGCG CGCGGTTCCG TTCAACACCC CATGGCGCAC CGTGCGGATT
GCCGACAATG CGAAGGGCAT CGTCGAGAAC GACCTCGAAC TGAACCTCAA CGAGCCGAAC
AGGATCGGTG ACGTTTCGTA CTTCAAGCCG ATGAAGTACA TCGGCATCTG GTGGGGCATG
ATCCGGGGCG ACTGGTCCTG GGCGGAAGGC CCGAAGCACG GCGCGACGAC CGCGCGGACC
AAGCAGTACA TCGACTTCGC AGCCAGGCAC GGTTTTGGCG GGGTATTGGT GGAAGGCTGG
GACAAGGGCT GGAACGGGAC CTGGTTCGGC AGCGGCAAGG AGTTCTCCTA TACCGAGGCG
ACGCCCGACT TCGATCTTGA GGCAGTGACG AAATATGGCG CGAAGAAGGG CGTCATGCTG
ATCGGCCATC ACGAGACGGG CGGCAACATC GCGAACTACG AGGCGCAGCT CGAGGACGCA
ATGAAGCTCT ACGACAAGCT GGGCGTGCGC GCGGTGAAGA CCGGGTACGT CGCCGATGCG
GGTGGCATCC TTGCACCAGG CGATGCGCCG GGCACCTACC GGATGGAGTA CCACGACGGG
CAGCGGCAGG TGCAGCATCA CCTCAAGGTG GTCGAGATCG CCGCGAAGTA TCGCATCGCG
ATCAACGCGC ACGAGCCGGT GAAGGACACC GGCCTTCGCC GCACATATCC CAACTGGATC
GACCGCGAAG GCGCGCGCGG CATGGAATAC AACGCGTGGG GGCAGTTCGC CAACGGACCG
GACCACGAGC CGACGCTGGT CTATACGCGG ATGCTGTCGG GGCCGATGGA CTACACCCCG
GGCATCCTCA GCCTGGAAGG CGCCAACAAG GTGCCGCTGG CATCGACGCT CGCCAAGCAG
CTCGGGCTGT ACCTTGCGAT CTATTCGCCG ATCCAGATGG CAGCGGATTT CATCGAGAGC
CTCGAATCGC ACCCGAAGGA ACTGGCGTTC ATCAAACAGG TCCCGGCAGA TTGGTCCGAA
AGCCACCTGA TCGCGGGCGA AGTTGGCGAC TATGCGATCT TCGCGCGCAA GGACCGCAAC
AGCGAGGACT GGTACGTCGG CGGCGTCAAT GATGCGACGG CGCGCGACGT TTCGCTTTCC
CTCGACTTCC TCGATCCCGG CAAGACCTAC ACCGCGACGG TGTGGAAGGA CGGCGAAGGC
GCCACCTACG AAACCGAGGC ACGCCACCGG ATCGCCTATG CCACGCTCAA GGTGAAGAAG
GGCGACGTGC TGCCCGCCTG GCTGGCGCCG GGAGGCGGAC TTGCGGTGCG CCTCCACCCG
GGGAAGTAG
 
Protein sequence
MRNLLNVAAP LALTLSCALA TPATAETVTA TSPDGSLVLS VTTDNDGHPL YSLTRKGKLL 
LGSSMLGFIT SDGPTMQRGQ TIIGSEKGSG KETWEQPWGE RRYVTDNHNE LLVKFEQVPD
WGGRRMNVRF RLFDDGFGFR YEIPEQPAMK VMKIADELTE FNVAQNGTAW WIPGGEWNRY
EQVYQKTAID GVSTAHTPIT MKLADGTHLS FHEAALVDYS AMWLKRQTGT SFRATLSPSP
NGPKVTRAVP FNTPWRTVRI ADNAKGIVEN DLELNLNEPN RIGDVSYFKP MKYIGIWWGM
IRGDWSWAEG PKHGATTART KQYIDFAARH GFGGVLVEGW DKGWNGTWFG SGKEFSYTEA
TPDFDLEAVT KYGAKKGVML IGHHETGGNI ANYEAQLEDA MKLYDKLGVR AVKTGYVADA
GGILAPGDAP GTYRMEYHDG QRQVQHHLKV VEIAAKYRIA INAHEPVKDT GLRRTYPNWI
DREGARGMEY NAWGQFANGP DHEPTLVYTR MLSGPMDYTP GILSLEGANK VPLASTLAKQ
LGLYLAIYSP IQMAADFIES LESHPKELAF IKQVPADWSE SHLIAGEVGD YAIFARKDRN
SEDWYVGGVN DATARDVSLS LDFLDPGKTY TATVWKDGEG ATYETEARHR IAYATLKVKK
GDVLPAWLAP GGGLAVRLHP GK