Gene Gobs_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_2022 
Symbol 
ID8753693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp2095562 
End bp2098900 
Gene Length3339 bp 
Protein Length1112 aa 
Translation table11 
GC content75% 
IMG OID 
ProductTrwC relaxase 
Protein accessionYP_003409082 
Protein GI284990528 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.715735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGGCG GAATGAAGGT CTACGCCGGT CCGCCGGCCG CCGCTCGGCA GTACCTGGAG 
GCCGACCGCG GCCGGGCCGA TGACTACTAC CTGGCCGAGG GCACCGGTCT GGCCCGCCGC
TTCACCGCCC GCGACGGGCG GGTGCAGGAA TGTGCGGCGC TGATCGGGGA GACCTACGAG
GCGTGGGTGG CCGGCCGGGA TCCCTACACC GGCACACCGC GGGGGCGGCT GCGCACCGAT
GACCGTGCGG TGCGGTTCGT CGAGGTCGTG GTGAACGGCC CGAAGTCCTG GTCACTGGCA
GCCGCGGTGC ACCCGGACAT CGCCGCGGCC TACGACGCGG CGCAGGACCG CGCCGCCGCC
GAGATCGTCG GCTGGCTCGC CGGGCACGCG ACCACCCGGG TGGGTCCGCG CGGCGGGCAG
GTGCAGGTGC CCATCGAGGT GCTGGAGGCG GTGACGGTGC GGCACCACAC CTCCCGGGCT
GGGGATCCGC ACCGGCATCT GCACCTGCAG ATCGGCGCCC GGGTGTTCGC CGCCGGGAAG
TGGCGCGGAC TGCACACCGG CGGCGTGCGG GACTTCCTGA CTGCCATCAA CGGAATCGGG
CACGCCGCGG TCGCCTGTGA CCCACAATTC CGGGCCGTGC TGGCCGCCCA CGGCTACACG
CTCGACGCGA CTGGAGAAAT ACTGGAGCTT GCCGAGTACA TCGGACCGTT CAGCGCCCGG
GCGGCGCAGA TCGGGCGAAA TCTGGATCGC TACGAACGCC AGTGGACCAA GGCGCACCCC
GGCGAGCAGC CAGGCCCGGC TCTGCGGCGG GCGTGGGACA CCCGGGCGTG GGCTGACGGC
CGCCCGGACA AGGTCACGCC GCGCTCGGGC ACCGACCTGA TCTCGCGTTG GCGAGCCGAA
CTAGCCGCCC TCGGCTTCCG CGACCCGGAC CGGCCGGTCG ATCTCCCACC GACCCCGGTC
GGCGCGCTGG ACCGCGACCG CGCCGTCGCC CAGATCCTCA CCCGGTTGGC CGCCGGTCGC
TCGGCCTGGA ACCCCGCCGA CATCCGTGGC GAGGCCGAGC GGCTCATCGC CGGTCAGGGC
ATCGTGGTCG ACACTGCGGT ACGTGTCGAG CTGGCCGAGG AACTCACCGC CCGTGCCCTA
GGAGAATGTG TCCCGCTTCT GCAGCGTGAG GGGGTGCCCG AGCACGTCCG GGCCTGGACC
TCGCCGGCGG TGCTCGAGGT CGAGGCCGAC CTGGTCGCCC GGCTCGCCGC CCGCAGTACC
CACCACGGTC GAGATGTCGC CGTCGGACCG GCCCTAGTGG CCGGCCTGGA GCGGCTGGAT
GGTGGGCAGG CCGCCACCGC CGAGGCGCTC GCCGGGAGCC GGCCGCTGGT CGTGGTGGAG
GGCGCGGCCG GCGCCGGCAA GACCACCACC CTGGCCGCCG CCCGCCGCCT TCTTGCTCGG
CAGGGGCGCG GGCTGCTGGT GGTCACCCCG ACGCTCAAGG CGGCCCGGGT CGCCTCCGCC
GAGGTCGGCA CCGCGGCCGG GTCGGCGGCC TGGCTGGCCT TCCAGCACGG CTGGCGCTGG
AACGACGACG GCGCCTGGAC CCGGCTGGTC GCCGGGCAGG CCGACCCGGC CACCGGCGCC
GTCTACGCCG GGCCGGCCGA GGAAGCGCGG CTGCTCCCCG GGGATCTCCT CGTGGTTGAT
GAAGCGGGCA TGCTCGACCA GGACACCGCC CGCGCCCTGC TCACCATCGC TGACGAGTAC
GAGGTCCGCG TCGCGCTGCT CGGTGACCGC CACCAGCTCG CGGCGGTCGG CCGCGGCGGC
GTCCTCGACC TCACCGTCCG CGCCGCCGAC CCGGCCGCCC GCCCGACCCT CGACACGGTG
CACAGGTTCA TACACAGGGA CGATGCCGGA CGCACGATCC CCGACACCGA CTACGCCGAG
CTGACCCTGG CCATGCGCCG CGGCCAGGAC GCGGGGGCGG TGTTCGACGC GCTGCACGCC
CGCGGCCAGG TCCGCCTCCA CCCGGACACC GCCGCCCGGC TCGAGACGCT GGCCGCACTC
GCCGCCTCCT GTGTGCATGA GGAGGCGGTC GCCGTGGTGG TGGATACCCG CGAGCAGGCC
GCCGAGCTCA ACGCCGCCAT CCGGGACCGG CTGGTCGCCG AGGGACGGGT CGACGACGGC
CAGGCGGTCG CCACCCGGGC CGGGCAGCGG ATCAGTGCCG GCGACCGCAT CGCCACGCGC
CGCAACGACC GGGCATTGGG GGTGGCCAAC CGCGACCTCT GGACCGTCAC CGCGGTCAGC
CCGCACGGGA ACCTCGCCGT CACCCCAGCG CGCGCTGATG CTTCCAACGT CACCCCGACC
GGCGTCACCC CTGCCGGGTC AGGGGAGCGA GTACTGCCTG CGGACTACGT CGTCTCCCAT
GTCGAGCTGG CCTACGCCTC GACCGCGCAC GGCGTGCAGG GCGATACCGT CGCCGCCGCC
CACGTGGTCG TCGGTGAGCA CACCGGTGCG GCGGCCGCCC ACGTCGGGAT GACCCGCGGC
CGCACGGCCA ACGTCGCGCA CCTCGTCGCC GATGATCTCG ACGAGGCCCG GAAGCAGTGG
ATCACGGTGT TCGACCGCGA CCGCGCCGAC CTCGGCCCCG CCCACTCTGC CGAACTGGCC
GCCACCGAGG CCGCCCGCTA CGTCCCACCC CGCTCGCTGG ACGCCGTCCT TGCCGAGTTG
CACGCGGCGT GGACGGCCGA GCAGCGCTGC CGGGACCGGC TTTCCTTGCT CGAGCCGATG
CGCAAGGAGC TGCGAGAACT GATCACGCTC CAGCCGCACT CGGTCGAGCA GCTGGCCGGT
CTCACCGGCA CCCACCGGCA GGCAGCTTGG GCCGCGGACC AAGCCACGCA TCGAGTCGAG
GCCAGCGACG CGTCCGTCGC TGCCGAGGCC GACCGACTAC GTGACGTCCT GCCGCCGCCT
GGACGGCGAG CGTGCCGCCG TCCAGGCGGC GGCGCGAGTG GTGCTCGACG GGCCCGGCCG
GCTCGGGCTC CGGCGGGGCG CCGTCGTCCG GGCCGGCGAG CAGCTCGCCG ACTGGGCCGA
CCGCTGGCGC TCCCACCTCC CGCAGCTGCC CACCGACCCG GGCGCGCTGG CCCGGGTCGC
CGGCCGGATC CACGACCAAC CCGCGTTGTG GTCCGCGCTG GACGCCTCCG CTCACCGCAC
CGCCGAACGC GCCCATCCCG AGCACGCTGC GCTGCGCGCG GCCGCCGACG CTGCCCGGCA
CGCCCGCGAG CAGGCCTGGC AGGTGCTGGC CGAGGCTCGC CGCCAGCACG GCGAGCGGCT
CACCCGCTTC GGAGCCCTGG CCTGGACCCC CGATCCTGA
 
Protein sequence
MHGGMKVYAG PPAAARQYLE ADRGRADDYY LAEGTGLARR FTARDGRVQE CAALIGETYE 
AWVAGRDPYT GTPRGRLRTD DRAVRFVEVV VNGPKSWSLA AAVHPDIAAA YDAAQDRAAA
EIVGWLAGHA TTRVGPRGGQ VQVPIEVLEA VTVRHHTSRA GDPHRHLHLQ IGARVFAAGK
WRGLHTGGVR DFLTAINGIG HAAVACDPQF RAVLAAHGYT LDATGEILEL AEYIGPFSAR
AAQIGRNLDR YERQWTKAHP GEQPGPALRR AWDTRAWADG RPDKVTPRSG TDLISRWRAE
LAALGFRDPD RPVDLPPTPV GALDRDRAVA QILTRLAAGR SAWNPADIRG EAERLIAGQG
IVVDTAVRVE LAEELTARAL GECVPLLQRE GVPEHVRAWT SPAVLEVEAD LVARLAARST
HHGRDVAVGP ALVAGLERLD GGQAATAEAL AGSRPLVVVE GAAGAGKTTT LAAARRLLAR
QGRGLLVVTP TLKAARVASA EVGTAAGSAA WLAFQHGWRW NDDGAWTRLV AGQADPATGA
VYAGPAEEAR LLPGDLLVVD EAGMLDQDTA RALLTIADEY EVRVALLGDR HQLAAVGRGG
VLDLTVRAAD PAARPTLDTV HRFIHRDDAG RTIPDTDYAE LTLAMRRGQD AGAVFDALHA
RGQVRLHPDT AARLETLAAL AASCVHEEAV AVVVDTREQA AELNAAIRDR LVAEGRVDDG
QAVATRAGQR ISAGDRIATR RNDRALGVAN RDLWTVTAVS PHGNLAVTPA RADASNVTPT
GVTPAGSGER VLPADYVVSH VELAYASTAH GVQGDTVAAA HVVVGEHTGA AAAHVGMTRG
RTANVAHLVA DDLDEARKQW ITVFDRDRAD LGPAHSAELA ATEAARYVPP RSLDAVLAEL
HAAWTAEQRC RDRLSLLEPM RKELRELITL QPHSVEQLAG LTGTHRQAAW AADQATHRVE
ASDASVAAEA DRLRDVLPPP GRRACRRPGG GASGARRARP ARAPAGRRRP GRRAARRLGR
PLALPPPAAA HRPGRAGPGR RPDPRPTRVV VRAGRLRSPH RRTRPSRARC AARGRRRCPA
RPRAGLAGAG RGSPPARRAA HPLRSPGLDP RS