Gene Saro_1974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1974 
Symbol 
ID3917292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2090753 
End bp2092093 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content65% 
IMG OID640444724 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_497248 
Protein GI87199991 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.015684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAACCT ACACATTCCG CCTGCCCGAT ATTGGCGAGG GTATCGCCGA GGCAGAAATC 
GTCGCCTGGC ATGTCAAGGT CGGCGACACT GTCGAGGAAG ACGGTCGCCT GGCTGACATG
ATGACCGACA AGGCCACGGT CGAGATGGAA AGCCCGGTCG CGGGCAAGGT CGTCTCGGTT
GCGGGGGAAG TTGGCGATGT CGTGGCGATC GGCTCGGCGC TGGTTGTGAT CGAGACCGAG
GGGGAGGACG AGGCACCGGC GCCTGCTGCG GCGCCCGCGC CCAAGGCGGC GATCGTCGAA
GAGCGCATCG AGGTCGAAAC GCCCGAGCCA CCGCAACCGC CATCACCGCC CCAGCCGCTG
TTCGTTTCGC GCGAAGTCGA GGCACCGCCC GCAGTGCCGG CTACAGGTTC TGGCGTGGCG
CCTGGCCCGC GTGCCTCGAC CGCGCCTGAC ACGATCGGTG GGGCGGGGGC AAAGGTCCTC
GCCAGTCCGG CCGTGCGGCA GCGTGCCCGC GATCTTGGCA TAGACCTGTC GGAAGTCCGT
CCGTCTGAGG AAGGCCGCAT TCGCCACGCC GACCTCGATC AGTTCCTCTC CTACAATGCC
TCTGGCGGTT ACCGTGCAGC CGGTGCCGAG CGCGGCGACG AAGTGATCAG GGTCATCGGT
ATGCGGCGAC GCATCGCCGA GAACATGGCC GCGTCGAAAC GACACATCCC GCACTTCTCC
TACGTCGAGG AATGCGATGT GACCGCGCTT GAAATCATGC GGGAACAACT CAACGCGGGC
CGGGGCGACA AGCCCAAGCT GACGATGTTG CCCCTGCTTA TCACCGCGAT CTGCCGTGCT
CTGCCGCAGT ACCCGATGAT CAACGCCCGC TATGACGACG AGGCCGGCGT GGTTACCCGC
TATGGTGCGG TGCATCTCGG CATGGCGGCG CAAACGCCTG CGGGCCTTAT GGTGCCTGTC
ATCCGCAACG CCCAGACCCT GAATCTCTGG CAACTCGCCC GCGAGATTGT CCGCCTGGCA
GAGGCCGCGC GCAGCGGCAG CGCAAAATCG GACGAGCTTT CCGGTTCGAC GTTGACGGTG
ACGTCCCTTG GCCCACTTGG CGGCGTGGCG ACCACGCCGG TCATCAACCG CCCGGAAGTT
GCCATCATCG GGCCCAATCG CATCGTCGAG CGGCCGATGT TCGTGTCCGA TGGCATGGGG
GGCGAGCGGA TCGAAAAGCG CAAGCTGATG AACATCTCGA TCAGTTGCGA CCATCGCGTG
GTCGATGGCC ACGATGCGGC AAGTTTCATC CAGGCGGTGA AGAAGCTGAT CGAAACGCCG
GTGCTGCTGC TGGCGGACTG A
 
Protein sequence
MGTYTFRLPD IGEGIAEAEI VAWHVKVGDT VEEDGRLADM MTDKATVEME SPVAGKVVSV 
AGEVGDVVAI GSALVVIETE GEDEAPAPAA APAPKAAIVE ERIEVETPEP PQPPSPPQPL
FVSREVEAPP AVPATGSGVA PGPRASTAPD TIGGAGAKVL ASPAVRQRAR DLGIDLSEVR
PSEEGRIRHA DLDQFLSYNA SGGYRAAGAE RGDEVIRVIG MRRRIAENMA ASKRHIPHFS
YVEECDVTAL EIMREQLNAG RGDKPKLTML PLLITAICRA LPQYPMINAR YDDEAGVVTR
YGAVHLGMAA QTPAGLMVPV IRNAQTLNLW QLAREIVRLA EAARSGSAKS DELSGSTLTV
TSLGPLGGVA TTPVINRPEV AIIGPNRIVE RPMFVSDGMG GERIEKRKLM NISISCDHRV
VDGHDAASFI QAVKKLIETP VLLLAD