Gene Saro_0799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0799 
Symbol 
ID3915853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp849016 
End bp850071 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content63% 
IMG OID640443530 
Productamidohydrolase 2 
Protein accessionYP_496078 
Protein GI87198821 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAAG ACCTTAAGAC CGGCGGCGAG CAGGGCTACC TGCGCATCGC CACCGAGGAA 
GCCTTCGCCA CGCGCGAGAT CATCGACGTC TACCTGCGCA TGATCCGCGA TGGCACTGCC
GACAAGGGCA TGGTCTCGCT CTGGGGCTTC TACGCCCAGT CCCCCTCAGA GCGCGCCACC
CAGATCCTCG AACGCCTGCT CGATCTTGGC GAGCGCCGCA TCGCCGACAT GGACGCGACC
GGCATCGACA AGGCTATCCT CGCGCTGACC TCGCCCGGCG TCCAGCCGCT GCACGACCTT
GACGAGGCCA GGACGCTCGC CACCCGCGCC AACGACACGC TTGCCGACGC GTGCCAAAAG
TACCCAGACC GCTTCATCGG CATGGGCACC GTCGCCCCGC AGGACCCGGA ATGGTCCGCG
CGCGAGATCC ATCGTGGTGC CAGGGAACTG GGCTTCAAGG GCATCCAGAT CAACAGCCAC
ACGCAAGGGC GCTACCTCGA CGAGGAGTTC TTCGACCCGA TCTTCCGCGC CCTCGTTGAA
GTCGACCAGC CGCTCTACAT CCACCCTGCC ACTTCGCCCG ATTCCATGAT CGACCCGATG
CTCGAAGCGG GCCTCGACGG CGCCATCTTC GGCTTCGGCG TGGAGACGGG CATGCACCTG
CTGCGCCTCA TCACCATCGG CATCTTCGAC AAGTATCCCA GCCTTCAGAT CATGGTCGGC
CACATGGGCG AGGCGCTGCC CTACTGGCTC TACCGCCTGG ACTACATGCA CCAGGCCGGT
GTCCGCTCGC AGCGCTACGA ACGCATGAAG CCCCTGAAGA AGACCATCGA GGGCTACCTC
AAGTCCAACG TCCTCGTCAC CAATTCGGGC GTCGCGTGGG AACCTGCGAT CAAGTTCTGC
CAGCAGGTCA TGGGCGAGGA CCGCGTTATG TACGCGATGG ACTACCCCTA CCAGTACGTT
GCCGACGAGG TGCGCGCGAT GGACGCCATG GACATGAGTG CGCAAACGAA GAAGAAGTTC
TTCCAGACCA ACGCGGAGAA GTGGTTCAAG CTTTGA
 
Protein sequence
MTQDLKTGGE QGYLRIATEE AFATREIIDV YLRMIRDGTA DKGMVSLWGF YAQSPSERAT 
QILERLLDLG ERRIADMDAT GIDKAILALT SPGVQPLHDL DEARTLATRA NDTLADACQK
YPDRFIGMGT VAPQDPEWSA REIHRGAREL GFKGIQINSH TQGRYLDEEF FDPIFRALVE
VDQPLYIHPA TSPDSMIDPM LEAGLDGAIF GFGVETGMHL LRLITIGIFD KYPSLQIMVG
HMGEALPYWL YRLDYMHQAG VRSQRYERMK PLKKTIEGYL KSNVLVTNSG VAWEPAIKFC
QQVMGEDRVM YAMDYPYQYV ADEVRAMDAM DMSAQTKKKF FQTNAEKWFK L