Gene Saro_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1054 
Symbol 
ID3916349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1095516 
End bp1097246 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content65% 
IMG OID640443788 
Productamidohydrolase-like 
Protein accessionYP_496333 
Protein GI87199076 
COG category[R] General function prediction only 
COG ID[COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.655132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCGA TGCTAAAGGC AGCTTCCGCG CTTGCCGGTC TTGGCTTGAC GCTTGGTGCA 
GCGGTGCAGG GACACGCTGC GGAAAGCCGG GATTTCGTCA TCACGAACGC GGATGTGCTG
ACGCCGTCCG GTTCCGCCGA GGCTCTGGCA GTCCACGACG GTGTGATCGT GGCTGTCGGT
ACGACCGCGG ATGCCGAGGC AAAACTGCCC GGCGCGCGGC GGATCGACCT GAAGGGTGCG
GCGGTGATGC CGGGGCTGGT CGACAGCCAC GTCCACGTGA CCTTCGCCGG GCTCGAACAG
TTTGCCTGTC GCATCAGGCC CGGTGCCATG GCCAGGGAGA TCGCCGAAAC GGTGAAGGGC
TGCGTTGCCA AGGCCAAGCC GGGCGAATGG ATCAACGGCG GCAACTGGGT GGCGGCGGGT
TTCCGCAAGG GCGAGCAGAA CAAGGCCTTC CTCGATCGCC TCGCCCCGGC AAACCCGGTC
GTTCTGGTGG ACGAATCGCA CCACAGCCTC TGGGTCAATT CCGCGGCCCT GAGGGCTGCC
GGGATCACCC GCCAGACCCC TGATCCGGCA GGGGGCGTGA TCGACCGTGA CGGCAAGGGC
GAACCGACGG GCCTCCTGCG CGAAACGGCG GCGGGGCTCG TTTATTCAGT GGTCCCCGCG
CCCAGCGAGG AGATGCGGCG CGCCGCGCTC AAGCTCTCGA CCGGGCAGAT GCTATCCTAT
GGCATCACCG CATTTGCCGA TGCCGGGGTG ACGATGGCGG ATGTCGGAAC GCTGTCGGCG
CTTTCGGCCG AAGGCGTGCT CAAGCAGCGG GTGCGCGGCT GCATGCGCTG GACGCCGCTG
CTGGGAGACA CGCCCGAAGC CAACGGCATG GCGCTGATCA ATGCGCGCGC CGCCTATTCC
ACGCCGCGCT TCCGGCTGGA TTGCGTCAAG GTCGTGCTCG ATGGCGTCCC CACCGAGAGC
CGCACCGCCT ATATGCTCGA TCCCTATCTG GCGCATGGCC ATGATGACGT GCCGACGCGG
GGGCTGCCGA TGATCACGCC CGACCGGCTG AACCCGGCCA TCGCCGCGTT CGACAGGATG
GGCCTTACGG TGAAGTTCCA TGCGGTTGGC GATGCCGCCG TGCGCGAGGC CATCGATTCG
GTTGCCAATG CCCGCAAGGT CAATGGCTGG GGCGGACCAT CGCACGACGT CGGCCATAAC
AGTTTCGTCT CCCCCGAGGA TATCACCCGC GTGCGCGATC TGCAGATGAC ATGGGAATTC
TCGCCCTACA TCTGGTATCC CACGCCGATC GCTTCCAAAG ATATCCGTGG CGTGATCGGC
GACGAGCGGA TGAAGCGGTG GATTCCCATT CGCGATGCGC TCGAAACCGG CGCGCTGGTC
GTTGCCGGAT CTGACTGGTC GGTCGTTCCG TCGGTCAATC CGTGGATCGC CATCGAAACG
ATGGTCACCC GCCAGATCCC GGGAGGCAGC GCGGAGACAT TGGGCGAAGG GCAGAAGATC
ACTCTTGCCC AGGCTCTGCG CATCTTCACC GAGAACGGCG CCAGCTTCCT TGGTCAGCGC
GACCAGTTCG GCAGCATCGA GACCGGCATG AAGGCCGATT TCATCGTCGT GGAGCGCAGT
CCCTACAAGG TCCCTGTGAA CGAAATCCAC AAGACCAAGG TGTTGCAGAC GTTCATTGAC
GGCGAGCAAG TCTACCTGTC TTCCGAAGCA ACAGGTCAGG GCGCTCCATG A
 
Protein sequence
MKAMLKAASA LAGLGLTLGA AVQGHAAESR DFVITNADVL TPSGSAEALA VHDGVIVAVG 
TTADAEAKLP GARRIDLKGA AVMPGLVDSH VHVTFAGLEQ FACRIRPGAM AREIAETVKG
CVAKAKPGEW INGGNWVAAG FRKGEQNKAF LDRLAPANPV VLVDESHHSL WVNSAALRAA
GITRQTPDPA GGVIDRDGKG EPTGLLRETA AGLVYSVVPA PSEEMRRAAL KLSTGQMLSY
GITAFADAGV TMADVGTLSA LSAEGVLKQR VRGCMRWTPL LGDTPEANGM ALINARAAYS
TPRFRLDCVK VVLDGVPTES RTAYMLDPYL AHGHDDVPTR GLPMITPDRL NPAIAAFDRM
GLTVKFHAVG DAAVREAIDS VANARKVNGW GGPSHDVGHN SFVSPEDITR VRDLQMTWEF
SPYIWYPTPI ASKDIRGVIG DERMKRWIPI RDALETGALV VAGSDWSVVP SVNPWIAIET
MVTRQIPGGS AETLGEGQKI TLAQALRIFT ENGASFLGQR DQFGSIETGM KADFIVVERS
PYKVPVNEIH KTKVLQTFID GEQVYLSSEA TGQGAP