Gene Saro_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2501 
Symbol 
ID3916822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2702379 
End bp2703551 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content65% 
IMG OID640445258 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_497771 
Protein GI87200514 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACA AGCCCGTAAT GCTCGAGGGC ATCCGCGTGG TGGACCTGAC CACGGTCGTT 
TTCGGCCCCT ATGCGACGCA GATCCTGGCG GATCTCGGCG CGGATGTGAT CAAGGTGGAA
TCGCCCGGGA TCGGCGATGC CTTCCGCTGG TCGGCAAAGC CTGCCGTCAC GCCAGGCATG
GCGCCAGCGT GGATGGCGCT CAACCGTGGC AAGAAGTCGG CGGCGCTCGA TCTCAAGGCT
GAAGCGGACC GTTCGGTCAT GCTTGACCTG CTGCGCGAGG CGGACGTTTT CGTCGTTAAC
GTCCGGGGCA AGGCGCTCGA GCGGATCGGG CTCGATTACG ACAGCCTCAA GGCCATCAAT
CCTTCGCTGA TCTACGTTCA CTGTGTCGGC TTCGGGCAGG ATGGGCCCTA TGCCGATCTC
CAGGCCTATG ATGACGTGAT CCAGGCGGCG ACCGGCACGA CCACGCTCCT GCCGCGCGTC
GACGGCAATC CGCACCCGCG CTATCTGCCC TCGCTCATCG CTGACAAGGT GGCGGGCCTG
CATGCGACCT ACGCGGCCTT GGCGGCGATC GTCCACAAGC AGCGAACGGG CGAGGGGCAA
CTGGTGGAAG TGCCGATGTT CGAGGCCTTC TCCAGCTTCA TGCTGCTCGA ACACCTCGGC
GGCCTGACTT TCGACCCGCC GAACGCGCCC GAAGGCTATT TCCGCCAGAT CGATCCGGAT
CGCCAGCCGT TCCCGACCGC TGACGGCTAC GTAAGCATCG TCGCCTATAC CGACGATGCC
TGGCAACGCA TCTTCACCCT GCTGGACCAG CCCGACTTCC TGAAGCAGGA CCACCTTGCC
ACGCCGCAGC AGCGATATGT TGCACAGGCC GAACTCTATC AGGCGATAGC GCGGTTCACG
CCGTTGCTTA CCACGTCGGA GATCGTCAGC CGATGCCATG CAGTGCAGAT ACCGGCCCAG
GCGGTGCGCG ACCTTGCCGA TGTGATGAAG GACCCGCACC TGCAGGCGGT CAACTTCTTC
AGGCGGCGTG TCCACCCGGT CGAGGGCGCC TACTTCGAGC AGGCCGCGCC AGTGAAATTC
GGCGCCGCCG AAGACGGGGA ACGCCTGTCC CCACCACTGG GCGGCGAACA TACCGAGGAA
CTGCGCGCAC GCGGCTGGAA CGCGTTCGGA TGA
 
Protein sequence
MSDKPVMLEG IRVVDLTTVV FGPYATQILA DLGADVIKVE SPGIGDAFRW SAKPAVTPGM 
APAWMALNRG KKSAALDLKA EADRSVMLDL LREADVFVVN VRGKALERIG LDYDSLKAIN
PSLIYVHCVG FGQDGPYADL QAYDDVIQAA TGTTTLLPRV DGNPHPRYLP SLIADKVAGL
HATYAALAAI VHKQRTGEGQ LVEVPMFEAF SSFMLLEHLG GLTFDPPNAP EGYFRQIDPD
RQPFPTADGY VSIVAYTDDA WQRIFTLLDQ PDFLKQDHLA TPQQRYVAQA ELYQAIARFT
PLLTTSEIVS RCHAVQIPAQ AVRDLADVMK DPHLQAVNFF RRRVHPVEGA YFEQAAPVKF
GAAEDGERLS PPLGGEHTEE LRARGWNAFG