Gene Saro_2045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2045 
Symbol 
ID3917692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2183030 
End bp2184097 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content67% 
IMG OID640444797 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_497318 
Protein GI87200061 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGGCG CACTCGATGG GCTGACCGTA CTCGAATTCG CGGGCATCGG CCCGGGGCCG 
TTCGCGTGCA TGATGCTGGC CGACCATGGC GCCCGCGTCA TCCGCATAGA CCGGCCCTCC
AAGGGCGACC GCGTCGGCGA CAGCGGCAAC CGTGACATTC TCAACCGTAA TCGCGAGCGG
CTGGAACTGG ACCTCAAGGA CCCGGCTTCG ATCGCGCGCA TCCGCGAACT GGTGAAGCAG
GCCGACGCCA TCGTCGAGGG GTATCGCCCC GGGGTGATGG AACGGCTGGG CCTTGGCCCC
GACGTTCTGC TCGCCGACAA TCCCGGGCTG GTCTACGGGC GCATGACCGG CTGGGGACAG
GAGGGGCCGA TGGCGCCGCT CGCCGGACAC GACATCAACT ACATTGCACT GGCGGGCGCG
CTCCACAGCT TCGGGCAAGC GGGCGGAAAG CCACAGTTCC CGGTCAATCT TGTCGGCGAT
TTCGGCGGCG GCGGCATGTT GATGGCGTTC GGCGTGATGG CGGCGGTCTT CTCGGCGCAA
CGCACGGGCA AGGGACAGGT CGTCGATTGC GCGATGGTCG ATGGCGCGGC GATTCTTTCC
GCAATGACCT ACACGTTCCT CGGCAATGGC CGCTGGAAGG ACGAGCGCGG CGTGAACCTG
CTCGACGGCG GGGCCCATTT CTACGACACC TACGAGACGA GCGACGGCAA GTGGATATCG
ATCGGCTCGA TTGAACCCCA GTTCTATGCC CTGCTTCTGG AAAAGACCGG GCTGACAGAC
GATCCCGAAT TCGCGCCGCA GATGGACCCG CGCGTCTGGC CCAGGCTCAA GGACCGGCTT
GCGGCGCTTT TCCTGACCCG CACCCGCGAT GAATGGTGCG CCATCATGGA CGGCACCGAC
ATCTGTTTCG CCCCGGTACT CAGTCTGCGC GAGGCGCCCC GCCATCCGCA CAACGTCGCA
CGGGGGACCT TCGTCGAGGA CGGCGGCATG GTCATGCCCG CGCCCGCGCC CCGCTTTCTC
GGAACGCCGG CGCCGCAGCC CTCGCTGGCC GCGCGCGAGG GCGGCTGA
 
Protein sequence
MPGALDGLTV LEFAGIGPGP FACMMLADHG ARVIRIDRPS KGDRVGDSGN RDILNRNRER 
LELDLKDPAS IARIRELVKQ ADAIVEGYRP GVMERLGLGP DVLLADNPGL VYGRMTGWGQ
EGPMAPLAGH DINYIALAGA LHSFGQAGGK PQFPVNLVGD FGGGGMLMAF GVMAAVFSAQ
RTGKGQVVDC AMVDGAAILS AMTYTFLGNG RWKDERGVNL LDGGAHFYDT YETSDGKWIS
IGSIEPQFYA LLLEKTGLTD DPEFAPQMDP RVWPRLKDRL AALFLTRTRD EWCAIMDGTD
ICFAPVLSLR EAPRHPHNVA RGTFVEDGGM VMPAPAPRFL GTPAPQPSLA AREGG