Gene Saro_3677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3677 
Symbol 
ID5077825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp310264 
End bp311664 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content69% 
IMG OID640481400 
Productglucuronate isomerase 
Protein accessionYP_001166062 
Protein GI146275902 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1904] Glucuronate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.339783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCGCG AACTTGAGCT TCATCCGGAC CGGTTGCTTC CGGTGGATCC ATCGGTGCGT 
GGGTTGGCGC GGGAGTTGTA TGCGAGCGTG CGGGGTCTGC CGGTGGTGAG CCCGCACGGG
CATACCGATC CGCGCTGGTT CGCGGGGAAC GGGACCTTCG GCAACGCGAC CGAGCTGCTG
CTGGTGCCGG ATCATTACGT GTTCCGCATG CTCTACTCGC AAGGGGTGGC GCTGGAGGAC
CTCGGCGTGC GCAACCGGCA GGTCGACCCG CGCGCGGCGT GGCGGCTTTT TGCCGAGCGC
TACTGGCTGT TCCGGGGAAC GCCGTCGCGC ATGTGGCTCG ACTGGGTCTT CGCCGAGGCC
TTCGGCATGG GCGTGCAGTT GTGCGCCGGG ACCGCCGACC TCTACTTCGA CACGATCACC
GAGATGCTGG CCAGCGACGC CTTCCGCCCG CGCGCGCTGT TCGAACGGTT CAACATCGAG
GTCCTGGCCA CGACCGAGAG CCCGCTCGAC AGCCTCGAGC ATCACGCCGC GATCCGGGCT
TCGGGCTGGA AGGGCCGCGT GATCACCGCC TATCGCCCCG ACCCGGTGGT CGATCCCGAC
TTCGAGGGCT TCGGCGCCAA CCTCGACCTG CTCTCGCACC TCACCGGCGA GGATTGCCGC
ACCTGGACCG GCTATCTCGC CGCGCACCGC CAGCGCCGCG CCTTCTTCGC GTCGATGGGC
GCGACCAGTA CCGACCATGG CCACCCGACC GCGGCGACCG CCAACCTCCC TGCCAGCGAG
GCCGAGGCCC TGTTCGGCAA GATCGTCGCG GGCGCCTTCA CCCCCGCCGA GGCCGAGCTG
TTCCGCGCCC AGATGCTGAC CGAGATGGCG GCGATGAGCC TCGACGACGG GCTCGTCATG
CAGATCCATC CCGGATCGTT CCGCAACCAC AACGCGCAGG TGTTCGAACG CTTCGGGCGC
GACAAGGGCG CCGACATCCC CACCCGCACC GACTTCGTCC ATGCCCTGAA GCCGCTGCTC
GACCGCTTCG GCAACGAGCG CGACCTGTCG ATCATCCTCT TCACGCTCGA CGAGAGCGCC
TATGCCCGCG AACTCGCGCC GCTTGCCGGG CATTATCCCT GCCTCAGGCT CGGCCCGGCC
TGGTGGTTCC ACGACAGCCC CGAGGGCATG CGCCGCTTCC GCCGCATGAC CACCGAGACG
GCGGGCTTCT ACAACACCGT CGGCTTCAAC GACGATACCC GCGCGTTCCT CTCGATCCCG
GCGCGGCACG ACGTGGCGCG GCGGATCGAC TGCGGGTTCC TCGCCGAACT GGTCGCCGAA
CACCGCATCG AGGACTGGGA AGCGGCGGAG CTGGCCCGGG ACCTGTCGTA CGATCTGGCG
AAGAAGGCGT ACCGCCTGTG A
 
Protein sequence
MPRELELHPD RLLPVDPSVR GLARELYASV RGLPVVSPHG HTDPRWFAGN GTFGNATELL 
LVPDHYVFRM LYSQGVALED LGVRNRQVDP RAAWRLFAER YWLFRGTPSR MWLDWVFAEA
FGMGVQLCAG TADLYFDTIT EMLASDAFRP RALFERFNIE VLATTESPLD SLEHHAAIRA
SGWKGRVITA YRPDPVVDPD FEGFGANLDL LSHLTGEDCR TWTGYLAAHR QRRAFFASMG
ATSTDHGHPT AATANLPASE AEALFGKIVA GAFTPAEAEL FRAQMLTEMA AMSLDDGLVM
QIHPGSFRNH NAQVFERFGR DKGADIPTRT DFVHALKPLL DRFGNERDLS IILFTLDESA
YARELAPLAG HYPCLRLGPA WWFHDSPEGM RRFRRMTTET AGFYNTVGFN DDTRAFLSIP
ARHDVARRID CGFLAELVAE HRIEDWEAAE LARDLSYDLA KKAYRL