Gene Gobs_1918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_1918 
Symbol 
ID8753589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp1988194 
End bp1989399 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content71% 
IMG OID 
Producthomogentisate 12-dioxygenase 
Protein accessionYP_003408992 
Protein GI284990438 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0204233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTTCT ACCGGCAGGT GGGCGAGGTC CCACCCAAGC GGCACACCCA GTTCCGTCGT 
CCCGACGGCG GCCTGTACTC CGAGGAGCTG GTCGGTGAGG AGGGCTTCTC CTCGGACTCC
GCTCTGCTCT ACCACCGTGG CGTGCCGTCG GCGATCGTCG ACGCCCGGCC GTGGGAGCTG
CCCGACCAGA GCCTGACGCC GAACGCGCCG CTGGTGCCCC GGCACCTGAA GCTGCACGAC
CTGTTCCCCG GCGAGGAGCA CAAGGCCGTC GACGCGGTGA CCGGCCGCCG GCTGGTGCTC
GGCAACGGCG ACGTGCGCAT CTCCTACGCG GTCTCGTCGT TGCCGAGCCC GTACTACCGC
AACGCCACCG GCGACGAGTG CGTCTACGTC GAGCGCGGCA CCGCCACGGT GGAGACGACG
TTCGGCGCGC TGACCGTCGG CCGGGGCGAC TACGTGGTCA TCCCGCGGAC CACCACGCAC
CGCTGGATCC CGACCGGGTC CGAGCCGCTG CGCACCTACG CGATCGAGGC CAACAGCCAC
ATCGCCCCGC CCAAGCGCTA CCTGTCGAGG TACGGGCAGT TCCTCGAGCA CGCGCCGTAC
TGCGAGCGGG ATCTCCGTGC CCCCGCCGAG CCGCTGCTGG TCGAGGGCAC CGACGTCGAG
GTCTACGTCA AGCACCGCGG CAACGGCCCC GGCGGGCTGG CCGGCACGGT GCACGTGCTC
CCGGAGCACC CGTTCGACGT GGTCGGCTGG GACGGGCACC TCTACCCCTA CGCGTTCAAC
ATCGCCGACT ACGAGCCGAT CACCGGCCGG GTGCACCAGC CCCCGCCGGT CCACCAGGTC
TTCGAGGGTC ACAACTTCGT GATCTGCAAC TTCGTGCCGC GGAAGGTCGA CTACCACCCA
CTGGCCGTCC CGGTGCCCTA TTACCACTCC AACGTCGATT CCGACGAGAT CATGTTCTAC
GTCGACGGCG ACTACGAGGC CCGCAAGGGG TCGGGCATCG GCAAGGGCTC GATCTCGGTG
CACCCCGGCG GGCACTCCCA CGGCCCGCAG CCCGGCGCGG TGGAGCGCTC CCTGGGCGTG
GAGTACTTCG ACGAGCTCGC CGTCATGGTC GACACCTTCC GCCCGCTGGA CCTGGGCGAG
GCCGGCGTCG CCGTCGACGA CGGGAAGTAC GCCTGGACCT GGTCCGGACG AGGCCCGTCG
GCGTGA
 
Protein sequence
MAFYRQVGEV PPKRHTQFRR PDGGLYSEEL VGEEGFSSDS ALLYHRGVPS AIVDARPWEL 
PDQSLTPNAP LVPRHLKLHD LFPGEEHKAV DAVTGRRLVL GNGDVRISYA VSSLPSPYYR
NATGDECVYV ERGTATVETT FGALTVGRGD YVVIPRTTTH RWIPTGSEPL RTYAIEANSH
IAPPKRYLSR YGQFLEHAPY CERDLRAPAE PLLVEGTDVE VYVKHRGNGP GGLAGTVHVL
PEHPFDVVGW DGHLYPYAFN IADYEPITGR VHQPPPVHQV FEGHNFVICN FVPRKVDYHP
LAVPVPYYHS NVDSDEIMFY VDGDYEARKG SGIGKGSISV HPGGHSHGPQ PGAVERSLGV
EYFDELAVMV DTFRPLDLGE AGVAVDDGKY AWTWSGRGPS A