Gene Saro_2814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2814 
Symbol 
ID3916974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3037746 
End bp3038771 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content64% 
IMG OID640445593 
Productamidohydrolase 2 
Protein accessionYP_498084 
Protein GI87200827 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.162302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATGA TCATCGACTG CCACGGTCAC TACACCGTGC TGCCGAAGGC GCACGACGAG 
TGGCGCGAGA AGCAGAAGGC CGCATTCAAG GCCGGGACCG AGTGCCCGCC CTATCCCGAG
ATCTCAGACG ACGAGATCCG CGAGACGATC GAGAGCAACC AGTTGCGCCT GCTCAAGGAG
CGCGGCGCGG ACATGACGAT CTTTTCCCCC CGCGCGAGCG CGATGGCGCC GCATGTGGGC
GACCAGTCGG TCGCGGTGAA GTGGGCGCAG GTCTGCAACG ACCTGATCGC GCGCGTGGTC
CGGCTCTACC CCGAGACCTT CGCGGGCGTG TGCATGCTGC CGCAGTCGCC GGAAGCGGAC
ATGACCAGCT CCATCGCGGA GCTGGAGCGC TGCGTGAACG AACTGGGCTT CATCGGCTGC
AACCTCAATC CCGATCCGGG CGGCGGGCAC TTCAAGCATC CTCCCCTGAC GGACGAATAC
TGGTTCCCGT TCTACGAGAA GATGGTCGAG CTGGACGTTC CGGCGATGAT CCACGTCTCG
GGTTCGTGCA ACCCGGCGAT GCACGCGACA GGCGGCTACT ACATCGCGGC CGACACCATC
GCGTTCATGC AGCTTCTGGA GGGCGACCTG TTCAGCAGGT TCCCGACCCT GCGCTTCATC
ATCCCGCATG GCGGCGGCGC GGTGCCCTAT CACTGGGGAC GCTATCGCGG GCTGGCCGAC
ATGCTGAAGA AGCCCGGCCT CGACACGCAC CTGATGAACA ACGTGTTCTT CGACACCTGC
GTCTATCACC AGCCCGGGAT CAACCTGCTG GCCGACGTGA TCGAGAACAA GAACATCCTG
TTCGGATCGG AAATGGTCGG CGCGGTGCGC GGGATCGATC CGACGACCGG GTTCTATTTC
GACGACACCA AGCGCTATGT CGACGCGCTC GACATCAGCG ATGCTGAACG CCACGCGATC
TTCGAGGGCA ACGCGCGCCG CGTGTTCCCG CGCCTCGACG CCAAGCTGAA GGAGAGGGGC
CTGTGA
 
Protein sequence
MTMIIDCHGH YTVLPKAHDE WREKQKAAFK AGTECPPYPE ISDDEIRETI ESNQLRLLKE 
RGADMTIFSP RASAMAPHVG DQSVAVKWAQ VCNDLIARVV RLYPETFAGV CMLPQSPEAD
MTSSIAELER CVNELGFIGC NLNPDPGGGH FKHPPLTDEY WFPFYEKMVE LDVPAMIHVS
GSCNPAMHAT GGYYIAADTI AFMQLLEGDL FSRFPTLRFI IPHGGGAVPY HWGRYRGLAD
MLKKPGLDTH LMNNVFFDTC VYHQPGINLL ADVIENKNIL FGSEMVGAVR GIDPTTGFYF
DDTKRYVDAL DISDAERHAI FEGNARRVFP RLDAKLKERG L