Gene Gobs_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_3003 
Symbol 
ID8754676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp3140408 
End bp3141535 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content81% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003409984 
Protein GI284991430 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.394056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTGAGC GGCTCCCGCT GCAGGGCCGC CGGGTGGCCG AGGTCCTGGC CACCAGCACC 
GGTGGGGTGG GCACGCACGT GCGGGCGGTG CTGCCCGTCC TGGTCGCGGC CGGCGCCGAC
GTCCGGGTGT GCGGACCTGC GGCCACCGAG CAGCTGTTCG GCTTCACCGC GGCGGGTGCC
GCCTTCGCGC CCGTCGGGAT CTCGGCCGGC CTCTCGCCCG GCGCCGACGC CCGCGCCGTC
GCCGCGCTGC GCCGCGCCAC CGCCGACGCG GACCTGGTGC ACGCCCACGG CCTGCGCGCC
GGCCTGGTCG CCGCGGCGGC CCGCCGGCTC GGCGACCGCA GCCGGCCCCT GGTGCTCACG
CTGCACAACG CCCTGCCCGA GGGCGGAGGC GCGCTGCGCC GGGTGCTGCG GCTGGCCGAG
CGGGCCACGA TCAGCGGCGC CGACGTCGTC CTCGCCGCCT CCGGTGACCT CGCCGAGAAC
GCCTGGCGGC AGGGTGCGCG GGACGTGCGG GTCGCCCCCG TCTCCGCGCC GCCGCTGCCC
GCCGCGGCCC GGACCGCCGC CGAGGTGCGC GCCGAGCTCG GCCTGGCCGA CGGCCGGCCG
CTGGTCCTCG CCGTCGGGCG GCTGCACCCG CAGAAGGGCT ATGACGTCCT GCTCGACGCC
GCCGCCCGGT GGGCCGGCAG CTCGCCGCCA CCGCTGGTGG CGGTCGCCGG CGACGGCCCG
CTGCAGGACG AGCTCGCCGC CCGGATCGCC GCCGAGCGGC TGCCCGTGGT GCTGCTCGGC
CGGCGCAGCG ACGTCGCCGA CCTGCTGGCC GCCGCCGACC TCGCCGTGCT GCCCTCGCGC
TGGGAGGCCC GCTCGCTGAC CGCACAGGAG GCGCTTCGCG CCGGCACCCC GCTGGTCGCC
ACCCGCACCG GCGGGCTGCC CGAGCTGCTC GGGGACGGCG CGCAGCTGGT GCCCGTGGGC
GACCCCGTCG CGCTGGCCGA CTCGGTCACC GGGTTGCTGG CCGACCCCGC GCGCGCCCGG
CGGCTGGCCG AGGCCGGCAG CCGGCAGGCG GCGACCTGGC CGGACGAGGC CGCCACCGCC
CGCCAGCTGG TCGCCCTCTA CCGCGAACTG CTCGGCGCAC CCCGATGA
 
Protein sequence
MAERLPLQGR RVAEVLATST GGVGTHVRAV LPVLVAAGAD VRVCGPAATE QLFGFTAAGA 
AFAPVGISAG LSPGADARAV AALRRATADA DLVHAHGLRA GLVAAAARRL GDRSRPLVLT
LHNALPEGGG ALRRVLRLAE RATISGADVV LAASGDLAEN AWRQGARDVR VAPVSAPPLP
AAARTAAEVR AELGLADGRP LVLAVGRLHP QKGYDVLLDA AARWAGSSPP PLVAVAGDGP
LQDELAARIA AERLPVVLLG RRSDVADLLA AADLAVLPSR WEARSLTAQE ALRAGTPLVA
TRTGGLPELL GDGAQLVPVG DPVALADSVT GLLADPARAR RLAEAGSRQA ATWPDEAATA
RQLVALYREL LGAPR