Gene Saro_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3669 
Symbol 
ID5077817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp300381 
End bp303047 
Gene Length2667 bp 
Protein Length888 aa 
Translation table11 
GC content68% 
IMG OID640481392 
Productglycoside hydrolase family protein 
Protein accessionYP_001166054 
Protein GI146275894 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTGC GCAGCAGGTC CGCCGGTGTT TCGCTGATGG TGCTCGCGTT CGTCGCCTGT 
CCGGCGTTTG CGCAGGAAGT GCGCACAGTC TCGGCGCTCG AGTCGGGTTG GCGCTTTCTG
GACGGCGATG TGGCCGGAGC GGTGGCACCT GATTTCGCCG ACGCCGCGTG GGAGCAGGTC
GAGGTGCCGC ATACCTGGAA CCGCGTCGGC TTCTACCGCC ACGACATGGG CGGGGCCAAT
ACCGAAGCAA ACGTGCGCAA GCGGCAGGGG GTGGGCTGGT ATCGTCTGCG TTTCAGCCCT
GCCCCGGTCC GCCCGGGCGA TCGGAGCTGG CTGGAGTTCG ATGCAGCGAG CCGGACGGCG
CAAGTCTGGC TGAACGGCAC GTATCTGGGT GAGCATCGCG GCGGCTTCTC GCGCTTCCGC
CTCGATGCGA CGGCGGTTTT GCGTGCCGGG GCGGAGAACG TGCTGGTCGT GCGCGTCGAC
AACACCCAGC CTGCCGCCGG TGCGCCGACC GCCGACGTGC TGCCGCTGAC CGGCGACTTC
TTTGTCCACG GCGGGCTCTA TCGGCCGGTG CGGTTGGTGA CGACGGCGCC GCTGCATATC
GACATGACCG ACCACGGTGG CGCGGGCATT CGCGCGGCGA CGGTGTCTGC CTCGACCGGG
AGCGCGCGCG TGAAGGTGGC GGTGCGCGTG GCCAACGACG GTCGGGGTAG CACGCGGGCC
ATTGCGCGCG TGGCGCTCTT CGACCGCGAC GGGCGCGAGG TTGCCGCAGC CAGGGTTCCG
GCGACCGTCG CTTCGGGCAA GGTCGCCGAA GTCGAGGCGG AGCTTTCTGT CGCCGCACCG
CACCTGTGGC AGGGAACGAA AGATCCCTAT CTCCATGACC TGGTGACCGA GGTCCTCGAC
AAGGATGGCA AGGTCGTCGA CCGGATCAGC CAGCCCTTCG GCATCCGGAC GATGACGTTC
GATCCCGAGC GCGGCTTTGT CCTCAACGGG CAGCCCTATC GCCTCAAGGG GGTGGGCTAC
CATCAGGATC GAGATGGCAA GGGATGGGCG ATTTCGCGCG CCGACGTGGC CGAGGACGTC
GCGACCATCC GCGAGATGGG CGCCAATACC ATCCGCCTGA CCCACTACCA GCACGGGCAG
GACATCCATG ACCTGGCCGA CCGCGCGGGC ATCGTCGTCT GGGACGAGAT CCCGCTGGTC
TCGGCCTGGA CGCTGGGGGG AAAGCTCGAT CCCGATCCGG CGCTGGTGGC CAATGCCCGG
CAGCAGCTGA CCGAGCTGGT CAGGCAGAAC CAGAACCACG CCGCGACCGC GATATGGAGC
ATCGCCAACG AGGTGGACTT CGGCAATTCG ATGCCGATCT TCCTGACCGC CGATGCGCAG
GGGCGCGTGG CCGATCCGAT GGCGCTGCTC AAGGAACTGG ACGGGCTGGC AAAGGCGCTC
GACCCGTCGC GCCCGACCGC GCTTGCCACG TGTTGCGAGG GCAATCGGCA GGCACCGGGG
ACACAGATCC CGATCACCGC GACGGCGGCC GATCTCGGCG GAGCGAACCG CTACTTCGGG
TGGTATTACG GCAAGCCTTC CGAACTTGGC GCCCATCTCG ACAAGTTGCG CGCGGGGCGG
CCGGACCAGC CCCTGTCGCT GACGGAATAC GGCGCGGGTG GCGGGCTGAC GATCCACACC
GACAACGTGC TGGGCGGTCC CGCCGATTCA CGCGGGTTTG CGCAGCCCGA GGAGTACGAG
AGCTGGGTGC ACGAGCAGAC CCTGCCACAA CTCGACGCGC GGCCATGGCT TTACGCTACA
TGGCTATGGA ACTCGTTCGA CTTTGCCACG CGCATCCGCA CCGAGGGCGA TGCGCAGGAC
ATCAATACCA AGGGCCTTGT CGCCTATGAC CACAAGACCC GCAAGGACGC GTGGTACTTC
TACAAGGCGA ACTGGAACGT GGAGCCGATG GTCCACATCA CGGCCAAGCG CTACAAGGAG
CGCGCCTATC CGGTGACCGA AGTGAAGGTC TACAGCAACA CGGCCAGCAC CGAACTGCTG
CTGAACGGCC GTTCGCTGGG CACGAAGAGC GATTGCCCAG CCAAGGTCTG CGTGTGGAAT
GCCGTGGCGC TGGACGATGG CGCCAATGCG CTGGTCGCGC GCGGCACGCA CGCGGGCGGG
CAGGTCGAGG ACCGGGCCGA GTGGGCGCTT GCCCCCGAAA CGCGCCGGCA CATGGTGATC
GACGCGGGCA CACTGGTGGC GGGCAAGGGC GCGGGGCGCA TCCTTGGGTC GGACAACTGG
TTCGAGGGAG GCAGCACCGC GACGCTCGAC CAGCCGGCGG ATTTCGGCAA GCCGGGCAAG
CCGGCCGAGA TCGTGGGGAC GCAGGAGCGG GACGCGCTGG CGACCTATCG CACAGGCACT
TTCTCCTATC GCGTGCCGGT GGCCGACGGA CGCTACAAGG TGACGCTGTG GTTCGCGACG
GCTTCGGCCC AGAAGCCCGG GACGTTCGAG GTCAGGAATG GCAGGAAGAC ACTGCTGCGC
AAGTTCCAGC CCGCCATCCC GGCGTCGGGC GCGGTAGCCG AGGCCAGGGC TTTCACGGTC
CGGGCCAAGG GCACGCTCGC GCTGGATTTC GTGCCCGGGA GCGGCGACGC CCGCGTTTCG
ATGATCGAGA TCGAGCGGCT GCGCTGA
 
Protein sequence
MTLRSRSAGV SLMVLAFVAC PAFAQEVRTV SALESGWRFL DGDVAGAVAP DFADAAWEQV 
EVPHTWNRVG FYRHDMGGAN TEANVRKRQG VGWYRLRFSP APVRPGDRSW LEFDAASRTA
QVWLNGTYLG EHRGGFSRFR LDATAVLRAG AENVLVVRVD NTQPAAGAPT ADVLPLTGDF
FVHGGLYRPV RLVTTAPLHI DMTDHGGAGI RAATVSASTG SARVKVAVRV ANDGRGSTRA
IARVALFDRD GREVAAARVP ATVASGKVAE VEAELSVAAP HLWQGTKDPY LHDLVTEVLD
KDGKVVDRIS QPFGIRTMTF DPERGFVLNG QPYRLKGVGY HQDRDGKGWA ISRADVAEDV
ATIREMGANT IRLTHYQHGQ DIHDLADRAG IVVWDEIPLV SAWTLGGKLD PDPALVANAR
QQLTELVRQN QNHAATAIWS IANEVDFGNS MPIFLTADAQ GRVADPMALL KELDGLAKAL
DPSRPTALAT CCEGNRQAPG TQIPITATAA DLGGANRYFG WYYGKPSELG AHLDKLRAGR
PDQPLSLTEY GAGGGLTIHT DNVLGGPADS RGFAQPEEYE SWVHEQTLPQ LDARPWLYAT
WLWNSFDFAT RIRTEGDAQD INTKGLVAYD HKTRKDAWYF YKANWNVEPM VHITAKRYKE
RAYPVTEVKV YSNTASTELL LNGRSLGTKS DCPAKVCVWN AVALDDGANA LVARGTHAGG
QVEDRAEWAL APETRRHMVI DAGTLVAGKG AGRILGSDNW FEGGSTATLD QPADFGKPGK
PAEIVGTQER DALATYRTGT FSYRVPVADG RYKVTLWFAT ASAQKPGTFE VRNGRKTLLR
KFQPAIPASG AVAEARAFTV RAKGTLALDF VPGSGDARVS MIEIERLR