Gene Saro_3558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3558 
Symbol 
ID5077707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp174917 
End bp175891 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content65% 
IMG OID640481282 
Productdehydrogenase, E1 component 
Protein accessionYP_001165944 
Protein GI146275784 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00222908 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCTGA GCCGTGAGGC GCTATTGCGC GCCTATCGCC AGATGAAGGT GATCCGCGAA 
TTCGAGGAAC GCCTACACGT CGATATCCAG ACCGGCGAGA TCGCCGGCTT CACCCACCTC
TACTGCGGGC AGGAAGCCGT CGCGGTCGGG GTGTGCGAAC ATCTGTCGGT CGAGGACAAG
ATCGTCTCCA CCCATCGCGG CCACGGCCAC TGCCTTGCCA AGGGTTGCGA CGTGAACGGG
ATGATGAAGG AGATCTGGGG CAGCCGCGAA GGCCTGTGCA AGGGCAAGGG CGGCTCGATG
CACATCGCCG ACGTCGACAA GGGCATGCTC GGCGCCAACG GCATCGTCGG TGCGGGCGCT
CCCATCGCGG TGGGCGCGGG GATCGCCGCC AAGATCGACG GCAAGGGCAA GGTCGCGATC
ACCTTCTCGG GCGACGGCGC ATGCAATCAG GGCACCACGT TCGAGGCCAT GAACATGGCC
GTGGTGACCA AGGCCGCGAC GATCTTCGTG TTCGAGAACA ACCACTATTC CGAACACACC
GGCTTCGAAT ACGCGGTCGG CACGACCAAG GATATCGCCA GCCGCGCCGA GGCCTTCGGC
ATGAAGGTGT GGCGCGGTGA CGGCACCGAC TTCTTCTCGG TGTTCGAGAC GATGCGCGAA
GTGCTCGACT ACGTGCGCGT CCCCGGCAAC GGCCCGGCCG CTGTCGAATT CGACACCGAA
CGCTTCTTCG GCCACTTCGA AGGCGACCCG CAGCGCTATC GCGGCCCCGG CGAGATCGAC
CGCATCCGCG AGACCCGCGA CTGCCTCAAG AAGTTCCGCG AAAGCGTGAC CGCCGCCAAG
CTGCTCACCC ACGAAGACCT CGACGCGCTC GATGCCGAAG TGATGGAAGC GATCGAGGAA
TCGGTCCGGC AGGCCAAGGC CGCAGACCGG CCCACGGCAG AAGACGTCCT CACCGACGTC
TATATCAGCT ACTGA
 
Protein sequence
MQLSREALLR AYRQMKVIRE FEERLHVDIQ TGEIAGFTHL YCGQEAVAVG VCEHLSVEDK 
IVSTHRGHGH CLAKGCDVNG MMKEIWGSRE GLCKGKGGSM HIADVDKGML GANGIVGAGA
PIAVGAGIAA KIDGKGKVAI TFSGDGACNQ GTTFEAMNMA VVTKAATIFV FENNHYSEHT
GFEYAVGTTK DIASRAEAFG MKVWRGDGTD FFSVFETMRE VLDYVRVPGN GPAAVEFDTE
RFFGHFEGDP QRYRGPGEID RIRETRDCLK KFRESVTAAK LLTHEDLDAL DAEVMEAIEE
SVRQAKAADR PTAEDVLTDV YISY