Gene Saro_1976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1976 
Symbol 
ID3917294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2093149 
End bp2094423 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content64% 
IMG OID640444726 
Productbranched-chain alpha-keto acid dehydrogenase E1 component 
Protein accessionYP_497250 
Protein GI87199993 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0574324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGCG GCAATCTGCC GTCGCTATCG CTCCACGTGC CGGAACCGAA GTTCCGGCCG 
GGTGACAAGG TCGATTATTC CGACCTTGCC ATTTCGCGCG CGGGGGAACA GCCGCGACCC
GACGAGCAGT GCGAGGCTTC CGAAACCCAC CCGTTGTGCC TCGATCTGGT GCGCGTGCTT
GGCGATGACG ACCGTGCGAT CGGCCCTTGG GACCCCCGGT TGGACGCCGA CACGCTGCGC
CGCATGCTGC GCACGATGGC GCTGACCCGT GCTTTCGACG ACCGCATGTA TCGCGGCCAG
CGACAGGGCA AGACCAGCTT CTACATGAAG TGCACGGGCG AAGAGGCGAC ATCGGTCGCC
CCGGCCATGG CCTTGGCGGA TGACGACATG GTCTTCCCCA GCTACCGCCA GCAGGGCATC
CTGATCGCGC GTGGCTATCC GTTGGTCGAG ATGATCAACC AGATCTATTC CAATCGTGCC
GACAAGCTGA AGGGACGCCA GTTGCCGATC ATGTATTCGG CGCGCGAGCA GTCGTTCTTC
ACGATCTCGG GCAACCTCGC CACGCAGTAC CCGCAGGCCG TGGGTTGGGC CATGGCAAGC
GCGATCAAGG GCGACAGCCG CATCGCCGCG ACCTGGATCG GCGAAGGGTC CACGGCTGAG
GGCGACTTCC ATTCGGCCAT GACTTTCGCA GCAGTCTACA ATGCGCCCGT CATCTTCAAT
GTGGTGAACA ACCAGTGGGC CATTTCCAGT TTTTCGGGTT TTGCCGGCGC GGAGAGGACG
ACTTTTGCCG CCCGCGCGAT CGGCTATGGC ATCGCCGGCT TGCGGGTGGA CGGTAACGAT
CCGCTTGCTG TCTTCGCGGC AACCCAGTGG GCCGCGAACC GCGCCCGCGC CAATGCCGGC
CCTACGCTGA TCGAGCACTT CACCTACCGT GCCGAGGGGC ACTCGACTTC CGATGATCCC
ACCCAGTACC GTTCCGCGCA GGAGCGGGAG GAGTGGCCGC TGGGCGACCC GGTCAACCGG
CTGAAGAAGC ACCTCGTGGC CCTGGGCGAG TGGTCGGACG AGCAGCACGA GGCGATGGAC
CGTGAACTCG TCGACCTGGT CAAGGCGGCC ACGAAGGAGG CCGAAAAGAA CGGCATCCTG
GGGCACGGGC TGCATCACCC GTTCCATACA ATGTTCGAGG ACGTCTTCGA GGAACTGCCC
TGGCATCTCC GCGAACAGAG CGAGCAGGCA ATCCGCGAGC GTCGGATCAA GTGGCCGGAA
TGGAAAGAGT CATGA
 
Protein sequence
MARGNLPSLS LHVPEPKFRP GDKVDYSDLA ISRAGEQPRP DEQCEASETH PLCLDLVRVL 
GDDDRAIGPW DPRLDADTLR RMLRTMALTR AFDDRMYRGQ RQGKTSFYMK CTGEEATSVA
PAMALADDDM VFPSYRQQGI LIARGYPLVE MINQIYSNRA DKLKGRQLPI MYSAREQSFF
TISGNLATQY PQAVGWAMAS AIKGDSRIAA TWIGEGSTAE GDFHSAMTFA AVYNAPVIFN
VVNNQWAISS FSGFAGAERT TFAARAIGYG IAGLRVDGND PLAVFAATQW AANRARANAG
PTLIEHFTYR AEGHSTSDDP TQYRSAQERE EWPLGDPVNR LKKHLVALGE WSDEQHEAMD
RELVDLVKAA TKEAEKNGIL GHGLHHPFHT MFEDVFEELP WHLREQSEQA IRERRIKWPE
WKES