Gene Saro_1908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1908 
Symbol 
ID3917131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2020312 
End bp2021457 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content62% 
IMG OID640444654 
Productpyruvate dehydrogenase (lipoamide) 
Protein accessionYP_497182 
Protein GI87199925 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.591814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCCAAGT CCGCCAGGCC CAGAGCCGCC AAACCCGCCG CCAAACCTGC TGCAAAATCC 
GCAGGCAAAT CGCCCGCTTC GGCTGCGGCC GATCCGGTGA TCGCTTCCTC GGTTGCCGAA
GAGGCAGCCT TCGCACTACG CAGCCTCCAG CAGGCCCATG CGAACAACAA GCGTTACGAC
GCGAGCGATG CGGAACTGCT GAAGTTCTAT GAGCAGATGG TCCTGATCCG CCGGTTCGAG
GAAAAGGCGG GCCAGCTTTA CGGCCTCGGC CTCATCGGCG GGTTCTGCCA TCTTTACATC
GGCCAGGAAG CCGTGGCAGT CGGTCTTCAG TCGGCGCTGA AGGAAGGTCA CGATAGCGTG
ATCACCGGCT ATCGCGATCA CGGGCACATG CTTGCCTACG GCATCGATCC GAAGGTGATC
ATGGCAGAGT TGACCGGTCG CGGCGCGGGC ATCTCGCGCG GCAAGGGCGG TTCGATGCAC
ATGTTCAGCA CGGACCACAA GTTCTACGGC GGTCACGGCA TCGTCGGAGC GCAGGTTCCG
CTCGGAGCGG GCCTTGCCTT TGCACACAAG TATCGCGGTG ACGACGGCGT GTGCATGGCT
TACTTCGGCG ACGGCGCGGC AAACCAGGGC CAGGTCTACG AGACCTTCAA CATGGCCGCC
CTGTGGAAGC TGCCGATCAT CTTCGTGGTC GAGAACAACG GCTACGCCAT GGGAACCGCG
GTCAAGCGGG GGTCGGCAGA GACCGAGTTC TATCGCCGTG GCACCGCGTT CCGCATTCCA
GGCATGGACG TCAACGGCAT GGACGTTCTC GAAGTGCGCC AAGCCGCCGA GGTCGCGCTC
GAGTATGTTC GTGCGGGCAA CGGCCCCGTG CTCATGGAAC TCAACACCTA CCGTTACCGC
GGGCATTCGA TGTCCGACCC CGCAAAGTAT CGCAGTCGCG AGGAAGTGCA GGAAATGCGG
GACAAGCACG ATCCTATCGA AGGCGCCAAG GCAGAACTGC TGAAGCGGGG CGTGACCGAG
GACAAGATCA AGGAAATCGA CAAGCGCATT CGCCAGATCG TCGCGGAATC GGCCGACTTT
GCCGAAACCT CGCCCGAGCC GGACATGGCC GAGCTCTACA CTGACGTGCT GGTGGAGAAG
TACTGA
 
Protein sequence
MAKSARPRAA KPAAKPAAKS AGKSPASAAA DPVIASSVAE EAAFALRSLQ QAHANNKRYD 
ASDAELLKFY EQMVLIRRFE EKAGQLYGLG LIGGFCHLYI GQEAVAVGLQ SALKEGHDSV
ITGYRDHGHM LAYGIDPKVI MAELTGRGAG ISRGKGGSMH MFSTDHKFYG GHGIVGAQVP
LGAGLAFAHK YRGDDGVCMA YFGDGAANQG QVYETFNMAA LWKLPIIFVV ENNGYAMGTA
VKRGSAETEF YRRGTAFRIP GMDVNGMDVL EVRQAAEVAL EYVRAGNGPV LMELNTYRYR
GHSMSDPAKY RSREEVQEMR DKHDPIEGAK AELLKRGVTE DKIKEIDKRI RQIVAESADF
AETSPEPDMA ELYTDVLVEK Y