Gene Gobs_4036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4036 
Symbol 
ID8755724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4233563 
End bp4235191 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content79% 
IMG OID 
ProductDAK2 domain fusion protein YloV 
Protein accessionYP_003410972 
Protein GI284992418 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGCCGG CGCTCGACGA CGCCGCGGTC GGCCAGTGGT GCCGGGCCGC GGTCGCCGGG 
CTGTCCGCAG CGCGGGGCCG CCTCGACGAC CTCAACGTCT TCCCCGTGCC CGACGGCGAC
ACCGGCACCA ACCTGCTGGC CACCGCCGAG GCCGCGCTGG CCACGCTCGA CGAGGCCGGC
CCCGACCGGG CCGAGCCGGC CTGGGCGCTC GTGGCCCGCG GAGCCGTGCT GGGCGCCCGC
GGCAACTCCG GCACCATCCT CGCCCAGCTG TGGCGCGGGC TGGCCGACCA GCTGGCGGGC
CAGCCCCCGG CCGACGGGCC CACCCTCGCC GCCGCACTGC AGAAGGCCGC TGACAGCGCC
TACGGCGCCG TCGCCGACCC GGAGGAGGGG ACGTTCCTCA CGGTGGCGCG GGCCGGCGGT
GAGGCGGCGG TCGCCGCGGT CGCCGGCGGG CACACCGCCC TGGGCGAGGT CGTGCGGGCC
GCGGCCGACG GCGCCCGTGC CGCCCTCGAG GCGACGCCGG GACAGCTGGC CGCGCTGCGC
GACGCCGGCG TGGTCGACGC CGGGGGAGCG GGGCTGTGCC TGGTCCTCGA CGCCCTGGTC
ACCACCGTGA CCGGTGTCGA GCCCGACCGC CCGCCGCTGG TCCGCCGGGC CGAGCGCGGC
CTCCACGCCG GGCACCACCA CGGACACGAC TCCGGTGACC TGCCCCACCA GCCGCCCGCC
GGCCCGGGCA GCGAGGTGCA GTACCTGCTC GCCGACAGCG ACGAGGCCGC CGTGGCCCAG
CTGCAGGACC GGCTGGCCGC CCTGGGCGAC AGCCTGGTGG TCGTCGGCGT CGACACACCC
GGCGGGCGCG AGTGGAACGT GCACGTGCAC GTCAGCGACG TCGGCGCGGC CATCGAGGCC
GGCATCGAGG CCGGCCGGCC GTACCGCATC TCGGTGACCC CGCTGGCCCC GGTCCGGGCG
CCGGCGCCGG ACCCCGGGGC GCGTGCGGTC GTCGCGATCG TCCCCGACGG CGGGCTCGCC
GAGCTCTTCA CCGACGAGGG CGCCACCGTC GTCCCCTGCG GCCCGGGCGG CGTGGCCGAG
GACGACGTGC TCGCCGCGGT CCTGGGGTCC GGCGCGGCGG GCGTCGTCGT GCTGCCCAAC
GACCCGGCGT TCACCGCCCT GGCCTCCCGC GCCGCCGAGC GCGCCCGCGA GGAGGGGCGC
GACGTCGCCG TCGTCCCCAC CCGCTCGCCG GTGCAGGGCC TCGCGGCGCT CGCCGTCGCC
GACCCCTCCC GGCGCTTCGG TGACGACATC GTCACCATGG CCGAGGCGGC CGCGGCCACC
CGCTGGGCCG AGGTCACCGT CGCCGAGCAC GAGGCGCTGA CCAGCGCCGG CCGGTGCGCG
CCCGGCGACG TGCTGGGCTC GGCGGAGGGC GACGTCCTGC TCATCGGCGG GGAGCCGGCC
GCGGTCGCCT GCGAGCTGCT CGACCGCATG CTGTCCGCCG GCGGGGAGCT GGTCACCGTC
GTCGCCGGCT CCGACACCGA CCTCGCCGAC GTGGTCTGCA CGCACCTGGC GGCCGTGCAC
CCGACCGTCG AGGTGACCCG CTACGACGGC GCACCCGAGG GGGTCCGGCT GCAGGTGGGG
GTGGAGTAG
 
Protein sequence
MLPALDDAAV GQWCRAAVAG LSAARGRLDD LNVFPVPDGD TGTNLLATAE AALATLDEAG 
PDRAEPAWAL VARGAVLGAR GNSGTILAQL WRGLADQLAG QPPADGPTLA AALQKAADSA
YGAVADPEEG TFLTVARAGG EAAVAAVAGG HTALGEVVRA AADGARAALE ATPGQLAALR
DAGVVDAGGA GLCLVLDALV TTVTGVEPDR PPLVRRAERG LHAGHHHGHD SGDLPHQPPA
GPGSEVQYLL ADSDEAAVAQ LQDRLAALGD SLVVVGVDTP GGREWNVHVH VSDVGAAIEA
GIEAGRPYRI SVTPLAPVRA PAPDPGARAV VAIVPDGGLA ELFTDEGATV VPCGPGGVAE
DDVLAAVLGS GAAGVVVLPN DPAFTALASR AAERAREEGR DVAVVPTRSP VQGLAALAVA
DPSRRFGDDI VTMAEAAAAT RWAEVTVAEH EALTSAGRCA PGDVLGSAEG DVLLIGGEPA
AVACELLDRM LSAGGELVTV VAGSDTDLAD VVCTHLAAVH PTVEVTRYDG APEGVRLQVG
VE