Gene Saro_1975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1975 
Symbol 
ID3917293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2092097 
End bp2093152 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content61% 
IMG OID640444725 
Productbranched-chain alpha-keto acid dehydrogenase E1 component 
Protein accessionYP_497249 
Protein GI87199992 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.230446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTACG AAGAATCTCC GGTCAGCCTC GCCGAGGCCC CGACCCGTCG TCTGAACATG 
ATCGAAGCGA TCAATGACGC TCTCGACATC ATGATGGAGC GTGATCCCAA CGTCGTGGTC
ATGGGCGAGG ACGTCGGTTA TTTCGGCGGC GTGTTCCGTG CGACGGCGGG ACTCCAGAAG
AAATACGGCA AGACCCGCGT GTTTGACACG CCGATCAGCG AGTGCGGTAT CATCGGCGTG
GCTGTCGGCA TGGGCGCTTA TGGCCTGCGC CCGGTGCCCG AGATCCAGTT CGCCGACTAC
ATCTATCCAG GCCTCGATCA GCTCGTTTCG GAGGCTGCGA GGTTGCGTTA CCGTTCGGCC
GGCGAATTCA TTGCGCCGAT GACGGTGCGT TCGCCATTTG GCGGCGGCAT CTTCGGCGGG
CAGACCCACA GCCAAAGCCC CGAGGCGCTG TTTACCCACG TCGCCGGGCT GAAGACGGTG
GTTCCTAGCA CTCCGCACGA TGCGAAGGGT CTCTTGATCG CAGCGATCGA GGATAACGAT
CCGGTGATCT TCTTCGAGCC CAAGCGCATC TATAACGGGC CTTTCAACGG CTACTACGAC
AAGCCTGTCG AGCCCTGGAG CAAGCATGCG GACAGCGCCG TTCCGGAGGG CTATTATTCG
ATACCGCTAG GAAAGGCCCG CGTCGTGCGC CCGGGGCAGG CGTTCACTGT ATTGGCCTAT
GGCACCATGG TCCACGTCGC TGCAGCGGTC TGCGCGGAGA AGGGCGTCGA TGCCGAAATC
ATCGACCTCA GGACACTTGT CCCGCTGGAT ATCGAGACGG TGGAAAAGTC GGTGGAAAAG
ACCGGCAAAT GCCTGATCGT CCATGAAGCC ACGCGCACTT CGGGCTTTGG CGCGGAGTTG
TCCGCCCTGG TTCAGGAGCG TTGCTTCTAC CACCTCGAAG CACCGATAGA GCGCGTGACC
GGCTTCGACA CACCCTATCC ACACAGCCTC GAATGGGCCT ACTTCCCTGG CCCGGTCCGC
ATCGGCGAGG CCGTCGACCG ACTGATGAAG GCCTGA
 
Protein sequence
MTYEESPVSL AEAPTRRLNM IEAINDALDI MMERDPNVVV MGEDVGYFGG VFRATAGLQK 
KYGKTRVFDT PISECGIIGV AVGMGAYGLR PVPEIQFADY IYPGLDQLVS EAARLRYRSA
GEFIAPMTVR SPFGGGIFGG QTHSQSPEAL FTHVAGLKTV VPSTPHDAKG LLIAAIEDND
PVIFFEPKRI YNGPFNGYYD KPVEPWSKHA DSAVPEGYYS IPLGKARVVR PGQAFTVLAY
GTMVHVAAAV CAEKGVDAEI IDLRTLVPLD IETVEKSVEK TGKCLIVHEA TRTSGFGAEL
SALVQERCFY HLEAPIERVT GFDTPYPHSL EWAYFPGPVR IGEAVDRLMK A