Gene Saro_1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1147 
Symbol 
ID3916444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1195601 
End bp1196521 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content69% 
IMG OID640443883 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_496426 
Protein GI87199169 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGCAG AATTGCTGGC GATCACCGGT GCCACCGGGT TCGTCGGGCA GGCGGTTCTC 
GATTTCGCCG CCCGCGCAGG GATAGAAGTC CGTGCGCTTG CCCGCCGTCC CCAGGAAGCG
CGCGCCGGCG TGGAATGGGT GCAGGGCGAC CTTGACGACA AGCGCGCGCT CCAGCGCCTT
GTCGGCCGCG CCAGCGTGGT CCTGCACATC GCGGGCGTCG TCAATGCTCC CGATCCGCAA
GGCTTCGAGG CCGGCAACGT CCTCGGCACG CTCAACGTCG TCAATGCCGC GCTGGCTGCC
GGTGTGCCGC GCCTCGTCCA CGTTTCCTCT CTCTCCGCCC GCGAGCCGGA CCTGTCGATC
TATGGCAGGT CGAAGTTGCG GGGGGAAAAG ATCGTCAAGG CCAGCAGCCT CGACTGGACC
GTGGTGCGTC CGCCGGCCGT CTACGGCCCG CGCGATACCG AGATGTTCGA GCTGTTCAAG
CTCGCCCGCA GGGGCATCGT GCCGCTGCCG CCGCAGGGCC ACCTCTCGAT CATCCACGTC
AATGACCTGG CGCGTCTGCT CCTCTCGCTC ATCCCCGGCG GCGAGGAGGT GACGCACCTG
ACCTTCGAAC CCGACGACGG CACCACGGGC GGATGGACGA ATACCGAGCT GGCAAAGGCC
ATCGGTGTTG CGGTCGGCAA GCGGGTCACA GCGATGAACC TGCCGGCGGG CCTGCTGCGC
CTCGGCGCGA AGCTCGATGC GCGGTTCCGG GGCAAGGGCG CGAAGCTCAC GATGGACCGT
GTCGGCTACA TGTGCCACCC GGACTGGCGC GCGGGCGAAG GCAACCAGCC GCCGCCTGCG
ATCTGGACCC CCCAGGTCGA AACCCGCATG GGTCTCCACG CCACGGCCGC CTGGTACCGC
GAGGCGGGCT GGCTCAAGTA A
 
Protein sequence
MAAELLAITG ATGFVGQAVL DFAARAGIEV RALARRPQEA RAGVEWVQGD LDDKRALQRL 
VGRASVVLHI AGVVNAPDPQ GFEAGNVLGT LNVVNAALAA GVPRLVHVSS LSAREPDLSI
YGRSKLRGEK IVKASSLDWT VVRPPAVYGP RDTEMFELFK LARRGIVPLP PQGHLSIIHV
NDLARLLLSL IPGGEEVTHL TFEPDDGTTG GWTNTELAKA IGVAVGKRVT AMNLPAGLLR
LGAKLDARFR GKGAKLTMDR VGYMCHPDWR AGEGNQPPPA IWTPQVETRM GLHATAAWYR
EAGWLK