Gene Saro_3877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3877 
Symbol 
ID5077488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp47125 
End bp48585 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content63% 
IMG OID640480986 
ProductUbiD family decarboxylase 
Protein accessionYP_001165648 
Protein GI146275487 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.358634 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATGA ACGATCTCCC TAACCGCGCC CGCTCGATCT CGTCGCTGCG CGACTTCCTC 
GAACTGCTCG AGGATGCCGG CCAGGCGATC ACCTGGAGCG ATGCGGTGAT GCCCGAACCC
GGCGTGCGCA ACATAGCCGT CGCCGCATCG CGCGATGCCA ACGGCGCGCC GGCGATCGTA
TTCGACAATA TCACCGGTTA CCCCGGCAAG CGCTTGGCGG TGGGCGTCCA TGGTTCGTGG
GACAACATCG CCCTGCTGCT GGGCCGACCT AAAGGCACGA CCATCCGCGA GCTTTTCTTC
GAGATCGCCG GCCGCTGGGG CGATCAGGAA GCGCAAATCA GCTTTGTCCC AGAAGCCCAG
GCCCCGGTGC ACGAATGCCG GATCGAACAG GACATCAACC TTTACGATGT CCTGCCGGTC
TATCGGATCA ACGAATACGA TGGCGGGTTC TACATCGGCA AGGCCTCGGT CGCCTCGCGC
GATCCGCTCG ATCCAGACAA TTTCGGCAAG CAGAATGTCG GCATCTATCG CCTGCAGATC
CAGGGGCCGG ACACCTTCAC CCTGATGACG ATCCCCTCCC ACGACATGGG ACGTCAGATC
ATGGCGGCCG AACGGGAAGG CGTTCCGCTA AAGATTGCGG TCATGCTGGG TAATCATCCC
GGCCTTGCGG TGTTTGCTGC CACCCCGATC GGCTACGAGG AATCGGAATA TTCCTATGCC
TCGGCGATGA TGGGCGCGCC AATCCGGCTG ACCAAATCGG GCAACGGGAT CGACATCCTG
GCCGACAGCG AAATCGTGAT AGAGGCCGAA CTGCAACCGG GTGGACGCGA GCTGGAAGGG
CCGTTCGGCG AATTCCCCGG TTCCTACAGC GGCGTGCGCA AGGCGCCGAT CTTCAAGGTC
ACGGCGGTGT CGCACCGGCG CGATCCGATC TTCGAGAACA TTTACATCGG GCGCGGCTGG
ACCGAGCACG ATACGCTGAT CGGCCTGCAC ACCTCCGCCC CGATCTATGC CCAGCTGCGC
CAGAGCTTCC CCGAAGTCAC CGCGGTCAAC GCGCTTTACC AGCACGGACT GACCGGGATC
ATCTCGGTCA AAAACCGCAT GGCCGGCTTT GCCAAGACGG TCGCGCTGCG CGCGCTGAGC
ACGCCGCACG GCGTGATGTA CCTCAAGAAC CTGATTATGG TCGATGCCGA TGTCGATCCG
TTCGATCTCA ACCAAGTGAT GTGGGCGCTT TCGACCCGCA CCCGTGCGGA CGATATCATC
GTGCTGCCCA ACATGCCTGC CGTGCCGATC GATCCTTCGG CAGTGGTCCC GGGCAAGGGG
CACCGCCTGA TCATCGACGC GACCAGCTAT CTCCCGCCCG ATCCGGTGGG TGAAGCGCAC
CTTGTCACCC CGCCGACCGG GGACGAGATC GACGCCCTGA GCAAGCGGAT CCGCGAAATG
CAGCTGGGAG CCCTGTCATG A
 
Protein sequence
MTMNDLPNRA RSISSLRDFL ELLEDAGQAI TWSDAVMPEP GVRNIAVAAS RDANGAPAIV 
FDNITGYPGK RLAVGVHGSW DNIALLLGRP KGTTIRELFF EIAGRWGDQE AQISFVPEAQ
APVHECRIEQ DINLYDVLPV YRINEYDGGF YIGKASVASR DPLDPDNFGK QNVGIYRLQI
QGPDTFTLMT IPSHDMGRQI MAAEREGVPL KIAVMLGNHP GLAVFAATPI GYEESEYSYA
SAMMGAPIRL TKSGNGIDIL ADSEIVIEAE LQPGGRELEG PFGEFPGSYS GVRKAPIFKV
TAVSHRRDPI FENIYIGRGW TEHDTLIGLH TSAPIYAQLR QSFPEVTAVN ALYQHGLTGI
ISVKNRMAGF AKTVALRALS TPHGVMYLKN LIMVDADVDP FDLNQVMWAL STRTRADDII
VLPNMPAVPI DPSAVVPGKG HRLIIDATSY LPPDPVGEAH LVTPPTGDEI DALSKRIREM
QLGALS