Gene Saro_0605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0605 
Symbol 
ID3915617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp649374 
End bp651143 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content70% 
IMG OID640443335 
Productallophanate hydrolase 
Protein accessionYP_495886 
Protein GI87198629 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID[TIGR02713] allophanate hydrolase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCGG CGCCGCGCCC CACCGCCCGC GCCATTGCCG CAGCGGTCAA CTCCGGGCAG 
ACCACCGCAC TGGCCGTGGC AGAGGCCACC CTGCTCCGGC TCGCCGCCTA CGACGCGGTC
CAGCCGCAAA TCTGGATCAG CCGCGCCAGT CCCGAAGCCC TGCTGGCAGC CGCTCGCGCC
ATCGATGTGC GCCTCGCCGC CGGAGAGGAC CTCCCGCTCG CGGGCGTCCC CTTCGCGGTA
AAGGACAACA TCGACGTCGC CGGCTTCCAC ACCACTGCCG CCTGCCCGGC CTTCGCCTAT
CGGCCCGCAA CGTCGGCGAC GGTTGTGGAG CGCCTGCTCG CGGCGGGCGC GCTCTGCGTC
GGCAAGACCA ACCTCGATCA GTTCGCGACC GGCCTCAACG GCACGCGCAG CCCCTATGGC
GCCCCGCGCA ATGCCCACAA CCTCGCCTAT GTCAGTGGCG GCTCCAGCTC CGGCTCGGCC
AGCGCGGTGG CGGCCGGGCT CGTCGCCTTC GCCCTCGGCA CGGACACCGC AGGGTCCGGG
CGCGTACCCG CCGCGTTCCA GCACCTCGTC GGCTTCAAGC CGAGCAAGGG CCGGTGGAGC
AATCGCGGGC TCGTCCCGGC GTGCCGCACG CTCGACTGCA TAACCGTCTT CGCCCACGAC
ACCGCCGATG CGCGCATTGT CGACGGCATC GTCGCGGGGT TCGATCCGGC CGACGCCTTC
TCCAAACCAC TCGCGGACCG ACCACGGAAG ATGCGCGCCA TCGGCGTGCC CCGCCGCGAC
CAGCGCGCCT TCTTCGGCGA TGTCGAGGCC GAACATCTCT ACGACCGCGC CCTGGACCGG
CTTTCGACGC TCGGCCGGAT CGTCGAGATC GACTATGCCC CGCTGCAGGA AGCAGCGCAG
CTTCTCTACG GGGGCCCCTG GGTTGCCGAA CGGACCGCCG CACTGGCCGG CCTCCTTGCC
GACAATCCCG ACGCCCTGGA CCCGACCGTC CGGGAAGTCG TGGCGCCCGG GCAGGACATC
GGCGCGGTAG ACCTGTTCAA CGGCATCTAC CGGCTCGCCG AACTGAAACG ACACGCCGAC
ACGCTCTGGG AAAGCATCGA CCTGCTGGCC TTTCCCACCA CGGGCACGAC CTATCGGGTG
GCGGAATTGC TGGCCGCACC AATCGCACTC AACAGTGCGC TTGGGTACTA TACCAACTTC
GTGAACCTGC TCGACATGGC CGCGCTCGCC GTGCCGGCCG GCTCGCGGGC CAACGCGACC
GGCTTCGGCG TGACCCTGAT CGGGCCGGCC GACACCGACC TGGCGCTTCT CGACGCGGCG
GAAGCCTATC TGTCCGTGGC AGATCTCCCA CCACCACCTC CGCTCGACCT GGAGGGCAAG
ATGCAGACCG TGAAACTCGC CGTCGTTGGC GCCCATCTCA AGGACATGCC GCTCCACTGG
CAACTCACCT CGCGCGACGC GAAATTCGTG GGCGCGTTCG AAACCGCCCC CAACTACCGC
CTCTACGCCA TGGCCGACAG CGTGCCGCCC AAGCCTGCGC TGATCCATAG CGAGGACGGC
GGCGCCATCG CTATCGAGGT CTACGAACTG GGGGTCGCCG AATTCGGCAG CTTCGTGGCC
GAAGTGCCGC CGCCGCTGGC GATCGGCACG GTCACGCTTG CCGATGGCAG CAGCGTCAAG
GGCTTCGTCG CGGAACCCCG CGCCCTCGTC GGCGCGCGGG ACATCACCCA CCTTGGCGGC
TGGCGCGCCT TCGTTGCGGC GGGAGCATGA
 
Protein sequence
MTAAPRPTAR AIAAAVNSGQ TTALAVAEAT LLRLAAYDAV QPQIWISRAS PEALLAAARA 
IDVRLAAGED LPLAGVPFAV KDNIDVAGFH TTAACPAFAY RPATSATVVE RLLAAGALCV
GKTNLDQFAT GLNGTRSPYG APRNAHNLAY VSGGSSSGSA SAVAAGLVAF ALGTDTAGSG
RVPAAFQHLV GFKPSKGRWS NRGLVPACRT LDCITVFAHD TADARIVDGI VAGFDPADAF
SKPLADRPRK MRAIGVPRRD QRAFFGDVEA EHLYDRALDR LSTLGRIVEI DYAPLQEAAQ
LLYGGPWVAE RTAALAGLLA DNPDALDPTV REVVAPGQDI GAVDLFNGIY RLAELKRHAD
TLWESIDLLA FPTTGTTYRV AELLAAPIAL NSALGYYTNF VNLLDMAALA VPAGSRANAT
GFGVTLIGPA DTDLALLDAA EAYLSVADLP PPPPLDLEGK MQTVKLAVVG AHLKDMPLHW
QLTSRDAKFV GAFETAPNYR LYAMADSVPP KPALIHSEDG GAIAIEVYEL GVAEFGSFVA
EVPPPLAIGT VTLADGSSVK GFVAEPRALV GARDITHLGG WRAFVAAGA