Gene Saro_0606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0606 
Symbol 
ID3915618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp651140 
End bp654733 
Gene Length3594 bp 
Protein Length1197 aa 
Translation table11 
GC content68% 
IMG OID640443336 
Productallophanate hydrolase subunit 2 
Protein accessionYP_495887 
Protein GI87198630 
COG category[E] Amino acid transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG0511] Biotin carboxyl carrier protein
[COG1984] Allophanate hydrolase subunit 2
[COG2049] Allophanate hydrolase subunit 1
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain
[TIGR02712] urea carboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0957568 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTTCG ACACCGTCCT CATCGCCAAT CGCGGCGCCA TCGCCACGCG GATCATCCGT 
ACCCTGCGGC GCATGGGACT GCGTTCGGTC GCGGTCTATT CCGAGGCCGA CAAGGATTCG
CTGCACGTCG TGCTTGCCGA CGAGGCAATC TGCATAGGCG CGGCGCGGGC GGCGGAAAGC
TACCTCAACA TCCCTGCAAT TCTCGATGCC GCCCGCCGCA CCGGCGCCGG GGCAATCCAC
CCCGGCTACG GCTTCCTTGC CGAGAACGTG GAGTTCGCCG AAGCCTGCGA GAAGGAAGGC
ATCGTCTTTA TCGGTCCCAC GCCCAACAAC ATCAGGACCT TCGGCCTGAA GCACAGCGCC
CGCGCGCTTG CCGCGGCCCA CGGTGTCCCG CTGGCTCCCG GCACCGATCT GCTGACCGAC
GAGACCGAGG CGGTCCAGGC CGCGAACGGC ATCGGATACC CGGTAATGCT CAAGGCCACC
GCTGGAGGCG GCGGCATCGG CATGCGCGTC TGCGAGGACG AAGCGGACGT CCGCGAAGGC
TTTTCCGCGG TGGCGCGGCA GGGCCTCGGC AATTTCGGCG ATGCCGGCGT CTTTCTCGAG
CGCTACATTC GCCAGGCGCG GCACATCGAG GTTCAAATCT TCGGCGACGG TCGCGGCCGT
ATCGTGGCGC TCGGCGAACG GGACTGCTCG CTCCAGCGGC GCAACCAGAA GGTCGTGGAG
GAAGCGCCCG CTCCCCTCCT TCCACCGGCG GTGCGCAGCG AACTGATCGC CGCCGCCATC
CGCCTCGGTC AGGCGGCCGG CTATCGCTCG GCCGGAACGG TCGAATTTCT CTATGATGCC
GAGCGCGAGG AATTCTTCTT CCTCGAAATG AACACCCGCC TCCAGGTCGA ACACGGCGTT
ACCGAAGAGG TCATGGGCGT CGATCTGGTC GAATGGATGG TCCGGGGCGG CGGCGGCGAT
TTCGGGTTCC TCGACGACGA CCCGCCGCGT CCATCCGGGC ACTCGATCCA GGTCCGGCTC
TATGCCGAAG ATCCCGCACT CGACTATCGT CCGACCTCGG GCACCCTTAC CGCAGTGACT
TTCCCGGAAG GCGTCCGGAC AGAGACATGG TGCATGGCCG GAACCACGGT CAGCACGTGG
TACGACCCGA TGCTGGCCAA GCTGATCGTC CACGCCGAAT CACGCGAGGC AGCAGTGGCG
GCGATGCAGG ACGCGCTCGA CCGCAGTCGC ATTGACGGGT TCGAAACCAA CCTGCGCTGG
CTGCGTGATG TCGTGCGCTC GCCCGCCTTC ACCAGCGGCG AGGTCTCCAC CCGCGCCCTG
TCTCACGTCG CTCACGTGCC ACGCAGCATA ACCGTCGTCA GCGGCGGCAC CGCGACGATG
GCGCAGGACT GGCCCGGACG GCAGCGGCTC TGGGCAGTGG GCGTCCCGCC ATCCGGACCG
ATGGACGACC TGTCGTTCCG GCTCGGCAAC CGCCTGCTCG GCAATCCCGA GGGTACTGCC
GGTCTCGAAG TGGCGATCAC CGGCCCGACC CTCACGTTCA ATACAGCCGC CCGCGTGTGC
GTGACCGGCG CGGATTTCGG GGCGCGGCTC GACGGGCAGC CTGTCCCGCG CGGCATGGCC
ATCGACATTG CCGCTGGCCA GACGCTCGCG CTCGGTCGCG CCTCGGGCGG AGGGATGCGC
GGCTACATCC TGTTTGCCGG CGGCCTCGAC ATCGCGCCGT ACCTCGGCAG CCGCAGCACG
TTCGAACTCG GCCAGTTCGG CGGTCACGCC GCGCGCCGCC TGCTGGCGGG CGACACGCTC
CATCTCGGCG ACGAACCGGC GCAGCCCGCC CTTCCCGCCG CAAACCTGCC CGAACTGTCG
AACGAATGGG CGCTGCGCGT CATGTACGGG CCGCACGGCG CACCCGACTT CTTCACCCGC
GAAGACATCG ACACGCTGGT CGCCGCTGAA TGGCAGGTGC ACTACAACAG CAACCGCACT
GGCATCCGGC TTGTCGGCCC CAAGCCGCAG TGGGCGCGCG AGGACGGCGG CGAGGCGGGC
CTGCACCCCT CGAACATCCA CGACAATCCC TATGCGATCG GCGCGGTCGA CTTCACCGGC
GACATGCCGA TCATCCTCGG ACCGGACGGT CCCTCGCTCG GCGGCTTCGT CTGCCCCTTC
GTGGTGATCG CGGCGGACCG CTGGAAGATC GGCCAGCTCA CGCCGGGCGA CAAGCTCCGC
TTCGTTCCGG TAAATTGCGC GGATGCCGCC GCGGCGAACG ACCAGCAACG GCGCTTCCTC
GAAACCGGCA AGCCCGCGCA GGGCTGTCCG GGGAGGCCGA TGGAGACGCT CTCGCCCATC
CTCGCCGTCA TCGATGAAAG CCCGCGCAGC CCCCGGACCG TCTATCGCCA GCAGGGCGAC
CGCAACATCC TGGTAGAATA CGGGCCGATC GTACTCGACA TCGAATTGCG CATCCGCGTG
CAGGCGCTGA TGACCGAGCT GGAACGGCTC GCCCTGCCCG GCGTGATCGA CATCGTCCCG
GGCATCCGCT CGCTCCAGTT CCATTTCGAC GGCGATGCGA TGACCCAGGA GGCCGCGCTC
TCGGTCCTGA TCGCGGCAGA GGAGCGGCTG GGCGACCTCG AGGACTTCAC CATCCCCTCG
CGCATCGTCC ATCTGCCGCT GAGCTGGCGC GATCCGGCCA CGATCGAAAC GATCGAGAAG
TACATGGGCG CCGTGCGCGA CGACGCGCCA TGGTGTCCCG ACAACATCGA GTTCATACGC
CGGATCAACG GCCTGCCCGA CGTGGCGGCG GTGGAGAACC TGATCTTCGA GGCGAACTAC
CTCGTCCTCG GCCTGGGCGA CGTCTATCTC GGTGCACCGG TGGCGACGCC CGTCGATCCC
CGCCATCGGC TGGTCACGAC CAAGTACAAC CCGGCGCGCA CCTGGACGCC GCCGAACGTG
GTCGGCATCG GCGGTGCCTA CATGTGCATC TACGGGATGG AAGGCCCCGG CGGCTACCAG
CTCTTCGGCC GCACCATCCA GGTGTGGAAC ACCCACCGCC AGACCGATGC CTTCATCGAT
GGCAAGCCGT GGCTGCTGCG CTTCTTCGAC CAGATCCGCT TCTACCCGGT CAGCGCCGAA
GAGCTTGAAG AATGGCGGCG GGACTTTCCC GCGGGCCGGC GCTCGATCCG GATCGAGCCT
TCCGAGTTCC GCCTTGCCGA CTACCGTCGC TACCTGGCCG ACAATGCCGA AGGGATCGCC
GAGTTCGAGG CAAGGCGTCA GGCCGCTTTC GACGAGGAAC GCGCCGAATG GCAAAGGCGC
GGAGAATTCG ACCGCACCGA CCTGGTCGAA CCCGAAGCGG CCGAAGCGGG CACCGTCGAA
GTGCCCGATG GCGCGGACCT CGTCGAAGCA CCCTTCGGCG GGAGCGTCTG GAAAATGCTC
GTGTCGGTGG GCGACGAGGT CGAGGCCGGC GAGACCATCG CGATCATCGA GGCGATGAAG
ATGGAATGCC GGGTCGAAAG TCCGGGGGCG GGCACTGTCG CCGCGCTCTA TGCGCAGGAG
CGCCAGTCGG TCCAGCCCGG CACGCCGATG CTCGCCCTGA CGAGGCACGC ATGA
 
Protein sequence
MNFDTVLIAN RGAIATRIIR TLRRMGLRSV AVYSEADKDS LHVVLADEAI CIGAARAAES 
YLNIPAILDA ARRTGAGAIH PGYGFLAENV EFAEACEKEG IVFIGPTPNN IRTFGLKHSA
RALAAAHGVP LAPGTDLLTD ETEAVQAANG IGYPVMLKAT AGGGGIGMRV CEDEADVREG
FSAVARQGLG NFGDAGVFLE RYIRQARHIE VQIFGDGRGR IVALGERDCS LQRRNQKVVE
EAPAPLLPPA VRSELIAAAI RLGQAAGYRS AGTVEFLYDA EREEFFFLEM NTRLQVEHGV
TEEVMGVDLV EWMVRGGGGD FGFLDDDPPR PSGHSIQVRL YAEDPALDYR PTSGTLTAVT
FPEGVRTETW CMAGTTVSTW YDPMLAKLIV HAESREAAVA AMQDALDRSR IDGFETNLRW
LRDVVRSPAF TSGEVSTRAL SHVAHVPRSI TVVSGGTATM AQDWPGRQRL WAVGVPPSGP
MDDLSFRLGN RLLGNPEGTA GLEVAITGPT LTFNTAARVC VTGADFGARL DGQPVPRGMA
IDIAAGQTLA LGRASGGGMR GYILFAGGLD IAPYLGSRST FELGQFGGHA ARRLLAGDTL
HLGDEPAQPA LPAANLPELS NEWALRVMYG PHGAPDFFTR EDIDTLVAAE WQVHYNSNRT
GIRLVGPKPQ WAREDGGEAG LHPSNIHDNP YAIGAVDFTG DMPIILGPDG PSLGGFVCPF
VVIAADRWKI GQLTPGDKLR FVPVNCADAA AANDQQRRFL ETGKPAQGCP GRPMETLSPI
LAVIDESPRS PRTVYRQQGD RNILVEYGPI VLDIELRIRV QALMTELERL ALPGVIDIVP
GIRSLQFHFD GDAMTQEAAL SVLIAAEERL GDLEDFTIPS RIVHLPLSWR DPATIETIEK
YMGAVRDDAP WCPDNIEFIR RINGLPDVAA VENLIFEANY LVLGLGDVYL GAPVATPVDP
RHRLVTTKYN PARTWTPPNV VGIGGAYMCI YGMEGPGGYQ LFGRTIQVWN THRQTDAFID
GKPWLLRFFD QIRFYPVSAE ELEEWRRDFP AGRRSIRIEP SEFRLADYRR YLADNAEGIA
EFEARRQAAF DEERAEWQRR GEFDRTDLVE PEAAEAGTVE VPDGADLVEA PFGGSVWKML
VSVGDEVEAG ETIAIIEAMK MECRVESPGA GTVAALYAQE RQSVQPGTPM LALTRHA