Gene Saro_1679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1679 
Symbol 
ID3916254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1763443 
End bp1764999 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content67% 
IMG OID640444420 
Productbenzoylformate decarboxylase 
Protein accessionYP_496953 
Protein GI87199696 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACCG TCCGTCACGT TACCATCGAC CTTCTGCGCG AACTCGGCAT GACTACCGTT 
TTCGGCAATC CGGGCTCCAC CGAACTTCCG ATGTTCCGCG ATTTTCCGGA CGATTTCCGC
TATGTCATGG GCCTTCAGGA AAGCGTCGTG CTGGGCATGG CCGATGGCTT TGCGCAAGGG
ACGGGCCGGG CAGCCATCGT CAACCTGCAC AGTTCGGCGG GCGTCGGGCA CGCGCTCGGC
AACCTCTTTA CCGCGTTCAA GAACCAGACT CCGCTAGTGG TGACGGCGGG GCAACAGGCG
CGCTCGATCC TGCCTTACGA GCCCTTCCTC TTCGCCGAAC GCGCCAGTGA GTTTCCGCGT
CCGTTCGTGA AATGGTCCTG CGAACCGGCC CGGGCGGAAG ACGTTCCCGC CGCGATCCAC
CGCGCCTGGC TGGTGGCGAT GGAGCCGCCT TGCGGGCCGA CCTTCGTTTC GATCCCCATC
GACGACTGGG ATCGGTCGTG TGAGCCGTTC GCGGCGCGAA AGGTTCATGC AGACCGGGCT
GGCGATCCTG CTGCTCTCGC CGCGTGCGCG ACGTCGATGG CGCAGGCGCG ACGTCCTGCC
ATCGTCGTTG GTGCGGGGGT GGCGCGCGAT GGCGCCTGGG ACCGCGTGAT CGCCCTTGCG
GAGCGTCATC AGGCGGCGGT CTGGGTGGCG CCGATGTCCG CCCGGTGCAG TTTTCCGGAA
GACCACCCGC TTTTCGCGGG GTTCCTTACG GCCGGGCGCG AGGCCATCGT GGCCACGCTT
TCGTCCCACG ACTTCGTGCT TGCGCTGGGC GGGCCGATGA ACCTCTACCA CGTCGAGGGA
CATGGGCCGC ATATGCCGGA GGGCTGCGAT GTCTGGCTGA TCGGTGACAA CCATATCCAT
GCCGCATGGG CGCCGGCGGG AACTGCCATC GTGGCGCAGT GCGACAGGGC GCTCGATGCC
TTGCTGGCCG GGCCGGAGCC GGTGCAGCGC GATGCTCCAC CCGTGCGCCA GCGCCTGCCG
CGGCTTGATG GCGCGGCTCT TACCGATGCC TATGTCCTCC AGCGCCTTGC CGCCCTGCGC
AGTCCCGAGA GCATCGTCGT GGAGGAAGCG CCTTCGAGCC GGGGGCCGAT GCACGAACAC
CTGCCGATCC TGCGCAAGGA CACCTTCTAT ACCACGGCCA GCGGCGGGCT CGGCCACGGG
CTTCCGGCCG CTGTGGGCAT GGCGATGGCG CGGCCGGACG ACAAGGTCAT TGCGCTGCTT
GGCGATGGTT CCGCCATGTA CGCGATCCAG GGGCTCCACG CGGCGGCGCA GCATGGCCTG
CCGGTCAGCT TCCTCATCCT CAAGAACAAC CGCTATGAGG CGCTGCACCA TTTCGGGCGT
CATTTCGGCA TGCAGCAGCT CGTCGGCACG CAGTTCCCGG AACTGGACTT CTGCAAGCTT
GCCGAAGGCC ACGGCATCGT CGCACGCCGC GCCGAGGACG CGGCATCGCT CGACGAGGCG
CTGCGCTGGT CGCTCGCCGC CGACGTGCCG ACGCTGGTCG AAGCGGTGGT GCTGTGA
 
Protein sequence
MSTVRHVTID LLRELGMTTV FGNPGSTELP MFRDFPDDFR YVMGLQESVV LGMADGFAQG 
TGRAAIVNLH SSAGVGHALG NLFTAFKNQT PLVVTAGQQA RSILPYEPFL FAERASEFPR
PFVKWSCEPA RAEDVPAAIH RAWLVAMEPP CGPTFVSIPI DDWDRSCEPF AARKVHADRA
GDPAALAACA TSMAQARRPA IVVGAGVARD GAWDRVIALA ERHQAAVWVA PMSARCSFPE
DHPLFAGFLT AGREAIVATL SSHDFVLALG GPMNLYHVEG HGPHMPEGCD VWLIGDNHIH
AAWAPAGTAI VAQCDRALDA LLAGPEPVQR DAPPVRQRLP RLDGAALTDA YVLQRLAALR
SPESIVVEEA PSSRGPMHEH LPILRKDTFY TTASGGLGHG LPAAVGMAMA RPDDKVIALL
GDGSAMYAIQ GLHAAAQHGL PVSFLILKNN RYEALHHFGR HFGMQQLVGT QFPELDFCKL
AEGHGIVARR AEDAASLDEA LRWSLAADVP TLVEAVVL