Gene Saro_3741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3741 
Symbol 
ID5077889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp379356 
End bp380303 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content65% 
IMG OID640481464 
Productdehydrogenase, E1 component 
Protein accessionYP_001166126 
Protein GI146275966 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.418325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGGGC CGGACCCTGC GCTGCTTGAA AGCATGTTTC ACAAGTTGGC CGTCTCGCGT 
GCGGTCGAGA CGTTGATGTT GCGCCACACG CGGGAAGAGC GCTTCTCCGG TTGGTGGCAT
CCGGGTGAGG GGCAGGAAGC CGCGCCGATC GGCGCCACGG CTGCGCTCGA AGCTGACGAT
TACGTCTGGT ATCAGGGCCG CGGCTGCGCC TGGGCAATCG GAAAGGGCAT GGACCCGCTA
CCAATCCTTG GCGACCTCCT TGGCAAGACG AATGGCGCAA CGGGCGGCAA GGGTGGTGGA
GTCCCGCACT GGGCAGACTA CAGCCTTGGC ATCATGGGCG AGGGCGCGAC GCTTGGTTCC
GTCTATCCGC TTGCGGCCGG TTCTGCCCTT GCCTCGAAGA TCCGCAAGGA CGGCCGTGTC
AGTCTCGCCA ACTTCGGTGA CGGCACTGCC TCGCGCGGGA CGTTCCATGA AACCATGATG
CACGCGGCCG CTTGGAAGTT GCCACTGATC TACTTCTGCG AGAACAACGG CCTCCTTGTC
GGCACGCGGA CCGAGCAGGT CTCGGCGACC GCCGACATCG CGAACCTTGC CAAGGGCTAT
GGCATTCCCG GGGTGATCGT CGACGGGCAG GACGCGGTCG CCGTCTGGGA AGCAACGCGC
GAAGCGGCGG CCCGCGCCCG GGCCGGGAAG GGGCCGACCC TCATCGAGGC AAAGGTTACC
CGCAAGCACG GCCACTACGC CGGCGATCCT CAGCACTATC GCGACCCGGA CTATCTCAGG
GATTATCGCG ATCCGCTGGA CCTTCTCGCC GCAAGGCTGG CCGGAAACGT TGCTGCGCGC
ATCGTCGAGC AGGCCGATGC GGAAGTGGCT GCCGCTTATG AAGCGGCCAG AGCTGCGCCC
GAACCCGATG TCTCGGTGAT CGAGAGGGAC CTTTACCATG TCGTCTGA
 
Protein sequence
MSGPDPALLE SMFHKLAVSR AVETLMLRHT REERFSGWWH PGEGQEAAPI GATAALEADD 
YVWYQGRGCA WAIGKGMDPL PILGDLLGKT NGATGGKGGG VPHWADYSLG IMGEGATLGS
VYPLAAGSAL ASKIRKDGRV SLANFGDGTA SRGTFHETMM HAAAWKLPLI YFCENNGLLV
GTRTEQVSAT ADIANLAKGY GIPGVIVDGQ DAVAVWEATR EAAARARAGK GPTLIEAKVT
RKHGHYAGDP QHYRDPDYLR DYRDPLDLLA ARLAGNVAAR IVEQADAEVA AAYEAARAAP
EPDVSVIERD LYHVV