Gene Saro_1874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1874 
Symbol 
ID3917095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1975363 
End bp1976370 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content66% 
IMG OID640444618 
ProductGroES-related 
Protein accessionYP_497148 
Protein GI87199891 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.335289 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGCCG CCGTGATGCA GGGCCTGCAC AAGCCGCTGG CCATCGATAC GATCCCCGAT 
CCGACGCCGG GCGAGGGTGA TGTCGTGGTC AAGGTCGGGC GCTGCGGCAT CTGCGGGTCC
GACCTGCACA TGACCGAGGA CCCGGCATAC GGGCAGGGCG CGGGTTCGGT GCTGGGTCAC
GAGTTTGCGG GCGAAGTCGT TGCGCTGGGC AAGGGCGTGG AGGGCCTCAG GACCGGGGAT
CTCGTCTCGG TCATCCCGTT ACAAAGTTGC GGTCAATGTC ATTCGTGTCG CACGGGCGAA
GTGCAGTGGT GCGAGAGGTT CGGTCTTCAG GGTGGCGGCT ATGCCGAATT TGCCCTGACG
CGGCCGAACC AGTGCGTGCG CCTGCCGGCA AGCGCCAGCA TGGCTGATGG CGCCATCGTC
GAACCGCTAG CCGTGGCTCT GCACGGCCTG GCGCTTAGCC GGATGAAGAT CGGCGACAAG
GTGCTGGTGC TGGGCGCGGG ACCGATCGGT CTCGCCGTCG CGTTCTGGGC CCGGCGCTTC
GGGGCTGGGC GCGTGGTGGT GCAGGACCTG GCGGAGTGGC AGCGCGACCG CGCTTTGCAG
ATGGGCGCGC ACGATTTCGT CGTCGATGCG GCCGATCCGG TGGGGAGCGC CGGGCGCGCG
CTGGGCGGCA AGGCCGATAT AGTATTCGAA TGCGTGGGCG TGCCGGGACT GATCGCGCAG
GCTGTGGAGC AGGTGCGCAA TGACGGGACC ATCACGCTGC TTGGCCTGTG TACGCGGCCC
GATACGTTCA ACAGCTTCGC GATGCTGTCC AAGCAGGTGA AGCTGGTGAC GTCCGCGTTC
TTCACCAGGC AGGAATACGA AGCGGCGCTC GACGCCCTCG TCCGTGGTGC GGTGGAGCCG
CGCCTTCTGG TAACCGACAC CATTTCGCTC GATGCAACGC CGGACGTGTT CGAGAGCCTG
CGCAAGCGCA CGCATCAGTG CAAGGTGCTG ATAAATCCGG GCGAATGA
 
Protein sequence
MRAAVMQGLH KPLAIDTIPD PTPGEGDVVV KVGRCGICGS DLHMTEDPAY GQGAGSVLGH 
EFAGEVVALG KGVEGLRTGD LVSVIPLQSC GQCHSCRTGE VQWCERFGLQ GGGYAEFALT
RPNQCVRLPA SASMADGAIV EPLAVALHGL ALSRMKIGDK VLVLGAGPIG LAVAFWARRF
GAGRVVVQDL AEWQRDRALQ MGAHDFVVDA ADPVGSAGRA LGGKADIVFE CVGVPGLIAQ
AVEQVRNDGT ITLLGLCTRP DTFNSFAMLS KQVKLVTSAF FTRQEYEAAL DALVRGAVEP
RLLVTDTISL DATPDVFESL RKRTHQCKVL INPGE