Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3558 |
Symbol | |
ID | 5077707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 174917 |
End bp | 175891 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640481282 |
Product | dehydrogenase, E1 component |
Protein accession | YP_001165944 |
Protein GI | 146275784 |
COG category | [C] Energy production and conversion |
COG ID | [COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00222908 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCTGA GCCGTGAGGC GCTATTGCGC GCCTATCGCC AGATGAAGGT GATCCGCGAA TTCGAGGAAC GCCTACACGT CGATATCCAG ACCGGCGAGA TCGCCGGCTT CACCCACCTC TACTGCGGGC AGGAAGCCGT CGCGGTCGGG GTGTGCGAAC ATCTGTCGGT CGAGGACAAG ATCGTCTCCA CCCATCGCGG CCACGGCCAC TGCCTTGCCA AGGGTTGCGA CGTGAACGGG ATGATGAAGG AGATCTGGGG CAGCCGCGAA GGCCTGTGCA AGGGCAAGGG CGGCTCGATG CACATCGCCG ACGTCGACAA GGGCATGCTC GGCGCCAACG GCATCGTCGG TGCGGGCGCT CCCATCGCGG TGGGCGCGGG GATCGCCGCC AAGATCGACG GCAAGGGCAA GGTCGCGATC ACCTTCTCGG GCGACGGCGC ATGCAATCAG GGCACCACGT TCGAGGCCAT GAACATGGCC GTGGTGACCA AGGCCGCGAC GATCTTCGTG TTCGAGAACA ACCACTATTC CGAACACACC GGCTTCGAAT ACGCGGTCGG CACGACCAAG GATATCGCCA GCCGCGCCGA GGCCTTCGGC ATGAAGGTGT GGCGCGGTGA CGGCACCGAC TTCTTCTCGG TGTTCGAGAC GATGCGCGAA GTGCTCGACT ACGTGCGCGT CCCCGGCAAC GGCCCGGCCG CTGTCGAATT CGACACCGAA CGCTTCTTCG GCCACTTCGA AGGCGACCCG CAGCGCTATC GCGGCCCCGG CGAGATCGAC CGCATCCGCG AGACCCGCGA CTGCCTCAAG AAGTTCCGCG AAAGCGTGAC CGCCGCCAAG CTGCTCACCC ACGAAGACCT CGACGCGCTC GATGCCGAAG TGATGGAAGC GATCGAGGAA TCGGTCCGGC AGGCCAAGGC CGCAGACCGG CCCACGGCAG AAGACGTCCT CACCGACGTC TATATCAGCT ACTGA
|
Protein sequence | MQLSREALLR AYRQMKVIRE FEERLHVDIQ TGEIAGFTHL YCGQEAVAVG VCEHLSVEDK IVSTHRGHGH CLAKGCDVNG MMKEIWGSRE GLCKGKGGSM HIADVDKGML GANGIVGAGA PIAVGAGIAA KIDGKGKVAI TFSGDGACNQ GTTFEAMNMA VVTKAATIFV FENNHYSEHT GFEYAVGTTK DIASRAEAFG MKVWRGDGTD FFSVFETMRE VLDYVRVPGN GPAAVEFDTE RFFGHFEGDP QRYRGPGEID RIRETRDCLK KFRESVTAAK LLTHEDLDAL DAEVMEAIEE SVRQAKAADR PTAEDVLTDV YISY
|
| |