Gene Gobs_3738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_3738 
Symbol 
ID8755423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp3904713 
End bp3907820 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content68% 
IMG OID 
Producttype III restriction protein res subunit 
Protein accessionYP_003410688 
Protein GI284992134 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGACA CGCCCACGGG GCTGTACGAG CACCTGGTCA CCGATCGGCT CAACAAGGAC 
CTGCCCGCGG CCGCTCCGGA CCTTATCCAG CTCGGCACGC TCGACCCGGC CGATGCGCAC
GAGCCGCTGA CCCGCCACAT CGCCGTACTC GTCAGCCGGG CCTTGCGGGC GGCCGGGGGG
AGCGACGCAA CGGCCATCAA TCGGCAGGTC GAGCTAGCCA ACCGCCTCGT TGCCGCCATC
GGCGACCTGG CGCCGGATGT CGCCGACATG GACGACGCCG TCACCGCCCC GGCCCGAACC
CTTCTGGCGA TCGCTCCGCC AAGTGGCATC CCCGGTCCCA CGTCATTCCC CGAGCGCCCC
GCAGTCCCGC TGTCGACGAG CGCCTTGCTC GTCAACGGCC GCGGACAACC GCGGATCGGC
TACGAGGTGA CCCGCGAGAT GGCCTCGGCT GACCGGGTGG ACCTGCTGTG TGCCTTCATC
AAGTGGCAGG GCCTACGGGT CATCGAGAAG CAGCTCACCG AGCTGCGCGA GCGCGGCGGC
CGGCTGCGTG TCATCACCAC CACGTACATG GGCGCCACTG ACCAGCGCGC ACTCGACCGC
CTGGTCGAGC TGGGCGCAGA GGTCAAGGTC TCCTACGAGA CCCGGACGAC GCGGCTGCAC
GCCAAGGCTT GGCTCTTCCA CCGCGTCACC GGCCTGTCGA CGGCATACGT CGGCTCGTCG
AACCTCTCCC GCACCGCCCT CACCGACGGT CTCGAGTGGA ACGTGCGGTT GTCCAGCGTC
GAGCAGGCGC ACCTGCTCAC CACGTTCGCC GAGACCTTCA ACAGCTACTG GCTCGACCCG
TCGTTCGAGA CCTACGAACC GCAGCGTGAC GGCGACCGGC TCCGCGAGGC GCTGGCCGCG
GAGCGCGGCG GCCCGTCCGA CCTGCCCATC GAGATCACGA CGCTCGACGT GCGCCCCTTC
GGCTACCAGC AGGAGATCCT CGAGGCACTC GACGCCGAGC GGACCGTCCA CGGCCGGTGG
CGCAACCTCG TGGTGATGGC GACCGGCACT GGCAAGACGG TCGTTTCCGC GCTGGACTTT
CGGCGGCTGC GCGAGAGCGG CATGGTCGAC CGGCTCCTCT TCGTCGCCCA CCGCGAGGAG
CTGCTGACGC AGAGCCACTC TACGCTCAGG CACGTGCTGC GCCAGGGTGA CTTTGGCGAG
CTCTTCGTCG GCGGGCAGCG TCCTCGCGAG TGGCGGCACG TGTTCGGCTC CGTGCAGTCG
CTCAACCAGC TCGACCTCGG CGAGCTCGAC CCCAGTCACT TTGACGTCGT CATCGTCGAC
GAGTTCCACC ACGCCGAGGC GAGCACCTAC CGGAGACTGC TCGAGCACCT CAAGCCGACC
GTCCTCCTCG GCCTGACGGC GACGCCGGAA CGAACGGACG GCGCCGACGT GCGCACCTGG
TTCGGCGGAC GCACCGCTGT CGAGCTGCGA CTGTGGGAGG CACTCGAGCA GAACCTCCTC
GCGCCGTTTC AGTACTTCGG CGTCCACGAC GACGTGGCGC TCGACCGGCT GCGGTGGAAG
CGCGGCCGCG GCTACGACGT CACCGAGCTG AGCAACGTCT ACACCGGCGA CGACCACCGC
GTCCGGCTCG TCCTGCAGGC GGTCAAGGAC AAGGTCGAGG ACCCCGGCCG GATGCGGGCG
CTGGGCTTCT GCGTCAGCAT TCAGCACGCG CAGTACATGG CCGACCGGTT CACCAGGGCC
GGCATCCCGA GCCGGGCTGT GACGTCGACG TCGTCTCGTG AGGAGCGGGC GGCGGCACTG
GCCGCCCTGC GCGACCGGGC GGTGAACGTC CTCTTCACCG TCGATCTGTT CAACGAGGGC
CTCGACATCC CGACCGTCGA CACCGTGCTG TTCCTACGGC CGACCGAGAG CGCGACGGTG
TTCCTCCAGC AGTTGGGCCG CGGCCTGCGG CTGGCCGAGG ACAAGGCCTG CCTGACCGTC
CTCGACTTCA TCGGCGCCCA GCACAAGAAG TTCCGCTTCG ACCTGCGCTT TCGTGCGCTG
ACCGGCAGCT CTCGACGCGG GCTACAGCGC GACGTCGAGC AGGGCTTCCC CACCCTGCCG
GCGGGCTGCC ACATCGAGTT GGACCGGGTG GCGCAGCGGA TCGTCCTCGA CAACATCAAG
CAGTCGCTGA GCGTGCCGTG GCAAGAACTG GTCAGCGAGG CGAAGCGCGA GCAGTCACCG
TCGCTGTCCG AGTTCCTCGA GGAGACGGGC GTCGAGCTCG AGGACCTGTA CCGCGGCAAC
GGTCGCAGTT GGTTGGACCT CAAGCGGGCC GCCGGCTGGG TCGACGACGC GCCAGGACCG
GACGACTCCG CGCTGGGTGC CGCGCTGGGC CGCATGCTGC ACGTCGACGA CCTGGAGCGG
TTGCGGTTCT TCGGGTCAAT TACGCGGTTG GACACCGTGG CGCAGACCAG CGTCCGCATA
CGGCGGCTGG CCGCGATGCT GCACTTCTCG CTGTTCGGCT CACAGAGGCC GTTCAGCAGC
GGCGAGGCGT CGCTCGCCCG ACTGTTGGCC CACCGGGGAC GCGCGGAAGA GCTCGTCGAG
TTGAGCGACG TCCTGCACGG GCGAATTCAC CGGGTGGCCC GACCGTTGGC TGAAGCCGGC
GACCGTCCAC TGCACGTGCA CGCCCGCTAT AGCCAGAACG AGGCTCTGGC GGCCTTCGGC
AAGACGAATT TGAGCGGCTC ATTCGGTGCG GGCGTGCACT GGGTGGAGGA GGACCAAGCC
GACGTCTTCT TTGTGACCTT AAAGAAGACT GAGACGCATT ACTCGCCCAC GACGATGTAC
GCGGACCATG CCATCTCTCC AACCATGTTC CAGTGGGAGT CTCAGAACAC GACCTCTGAG
CGCACCGCGG TAGGCAAGCG ATACATCCAC CACCGAGACA TGAGCACGTC CGTGCACCTG
TTTGTCCGCG AGACCAAGAC GGCGGATGGA ACACTCGGGA CGCCGCCGTA CCTCTACGCC
GGGCCAATGT CTTACGTCTC CCACACCGGC GAACGCCCCA TGCGGATCCT CTGGCGACTC
GAACACGCAC TCCCGGCCGA TGTCTTCCAC GCCGCTAGAG TCGCGTGA
 
Protein sequence
MGDTPTGLYE HLVTDRLNKD LPAAAPDLIQ LGTLDPADAH EPLTRHIAVL VSRALRAAGG 
SDATAINRQV ELANRLVAAI GDLAPDVADM DDAVTAPART LLAIAPPSGI PGPTSFPERP
AVPLSTSALL VNGRGQPRIG YEVTREMASA DRVDLLCAFI KWQGLRVIEK QLTELRERGG
RLRVITTTYM GATDQRALDR LVELGAEVKV SYETRTTRLH AKAWLFHRVT GLSTAYVGSS
NLSRTALTDG LEWNVRLSSV EQAHLLTTFA ETFNSYWLDP SFETYEPQRD GDRLREALAA
ERGGPSDLPI EITTLDVRPF GYQQEILEAL DAERTVHGRW RNLVVMATGT GKTVVSALDF
RRLRESGMVD RLLFVAHREE LLTQSHSTLR HVLRQGDFGE LFVGGQRPRE WRHVFGSVQS
LNQLDLGELD PSHFDVVIVD EFHHAEASTY RRLLEHLKPT VLLGLTATPE RTDGADVRTW
FGGRTAVELR LWEALEQNLL APFQYFGVHD DVALDRLRWK RGRGYDVTEL SNVYTGDDHR
VRLVLQAVKD KVEDPGRMRA LGFCVSIQHA QYMADRFTRA GIPSRAVTST SSREERAAAL
AALRDRAVNV LFTVDLFNEG LDIPTVDTVL FLRPTESATV FLQQLGRGLR LAEDKACLTV
LDFIGAQHKK FRFDLRFRAL TGSSRRGLQR DVEQGFPTLP AGCHIELDRV AQRIVLDNIK
QSLSVPWQEL VSEAKREQSP SLSEFLEETG VELEDLYRGN GRSWLDLKRA AGWVDDAPGP
DDSALGAALG RMLHVDDLER LRFFGSITRL DTVAQTSVRI RRLAAMLHFS LFGSQRPFSS
GEASLARLLA HRGRAEELVE LSDVLHGRIH RVARPLAEAG DRPLHVHARY SQNEALAAFG
KTNLSGSFGA GVHWVEEDQA DVFFVTLKKT ETHYSPTTMY ADHAISPTMF QWESQNTTSE
RTAVGKRYIH HRDMSTSVHL FVRETKTADG TLGTPPYLYA GPMSYVSHTG ERPMRILWRL
EHALPADVFH AARVA