Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1976 |
Symbol | |
ID | 3917294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2093149 |
End bp | 2094423 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444726 |
Product | branched-chain alpha-keto acid dehydrogenase E1 component |
Protein accession | YP_497250 |
Protein GI | 87199993 |
COG category | [C] Energy production and conversion |
COG ID | [COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0574324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGCG GCAATCTGCC GTCGCTATCG CTCCACGTGC CGGAACCGAA GTTCCGGCCG GGTGACAAGG TCGATTATTC CGACCTTGCC ATTTCGCGCG CGGGGGAACA GCCGCGACCC GACGAGCAGT GCGAGGCTTC CGAAACCCAC CCGTTGTGCC TCGATCTGGT GCGCGTGCTT GGCGATGACG ACCGTGCGAT CGGCCCTTGG GACCCCCGGT TGGACGCCGA CACGCTGCGC CGCATGCTGC GCACGATGGC GCTGACCCGT GCTTTCGACG ACCGCATGTA TCGCGGCCAG CGACAGGGCA AGACCAGCTT CTACATGAAG TGCACGGGCG AAGAGGCGAC ATCGGTCGCC CCGGCCATGG CCTTGGCGGA TGACGACATG GTCTTCCCCA GCTACCGCCA GCAGGGCATC CTGATCGCGC GTGGCTATCC GTTGGTCGAG ATGATCAACC AGATCTATTC CAATCGTGCC GACAAGCTGA AGGGACGCCA GTTGCCGATC ATGTATTCGG CGCGCGAGCA GTCGTTCTTC ACGATCTCGG GCAACCTCGC CACGCAGTAC CCGCAGGCCG TGGGTTGGGC CATGGCAAGC GCGATCAAGG GCGACAGCCG CATCGCCGCG ACCTGGATCG GCGAAGGGTC CACGGCTGAG GGCGACTTCC ATTCGGCCAT GACTTTCGCA GCAGTCTACA ATGCGCCCGT CATCTTCAAT GTGGTGAACA ACCAGTGGGC CATTTCCAGT TTTTCGGGTT TTGCCGGCGC GGAGAGGACG ACTTTTGCCG CCCGCGCGAT CGGCTATGGC ATCGCCGGCT TGCGGGTGGA CGGTAACGAT CCGCTTGCTG TCTTCGCGGC AACCCAGTGG GCCGCGAACC GCGCCCGCGC CAATGCCGGC CCTACGCTGA TCGAGCACTT CACCTACCGT GCCGAGGGGC ACTCGACTTC CGATGATCCC ACCCAGTACC GTTCCGCGCA GGAGCGGGAG GAGTGGCCGC TGGGCGACCC GGTCAACCGG CTGAAGAAGC ACCTCGTGGC CCTGGGCGAG TGGTCGGACG AGCAGCACGA GGCGATGGAC CGTGAACTCG TCGACCTGGT CAAGGCGGCC ACGAAGGAGG CCGAAAAGAA CGGCATCCTG GGGCACGGGC TGCATCACCC GTTCCATACA ATGTTCGAGG ACGTCTTCGA GGAACTGCCC TGGCATCTCC GCGAACAGAG CGAGCAGGCA ATCCGCGAGC GTCGGATCAA GTGGCCGGAA TGGAAAGAGT CATGA
|
Protein sequence | MARGNLPSLS LHVPEPKFRP GDKVDYSDLA ISRAGEQPRP DEQCEASETH PLCLDLVRVL GDDDRAIGPW DPRLDADTLR RMLRTMALTR AFDDRMYRGQ RQGKTSFYMK CTGEEATSVA PAMALADDDM VFPSYRQQGI LIARGYPLVE MINQIYSNRA DKLKGRQLPI MYSAREQSFF TISGNLATQY PQAVGWAMAS AIKGDSRIAA TWIGEGSTAE GDFHSAMTFA AVYNAPVIFN VVNNQWAISS FSGFAGAERT TFAARAIGYG IAGLRVDGND PLAVFAATQW AANRARANAG PTLIEHFTYR AEGHSTSDDP TQYRSAQERE EWPLGDPVNR LKKHLVALGE WSDEQHEAMD RELVDLVKAA TKEAEKNGIL GHGLHHPFHT MFEDVFEELP WHLREQSEQA IRERRIKWPE WKES
|
| |