Gene Gobs_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_2054 
Symbol 
ID8753725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp2133149 
End bp2134270 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content77% 
IMG OID 
ProductGlucose-6-phosphate dehydrogenase subunit 
Protein accessionYP_003409113 
Protein GI284990559 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCC TGTGGGACAC CACCGGCTCC GCCGTGGTCA AGGAGCTCGC CGCGCAGCGC 
CGCACCGGCG GCGCGGTGCT CTCCGGCGTC GCGCTGACCC TGGTGGTCGT GGCCGACGAG
AACCGGGTGG CCGAGGCCGA GGAGGCCGCG ACGGCCGCCG CCGAGCAGCA CCCCTGCCGG
CTGCTGATCG TCGTCCGCCG GCAGATCGAG GCGCCGTCGC CGCGGCTGGA CGCCGAGGTG
CTGATCGGCG GGCGGCTGGG CCCCGGCGAG GCCGTGGTCA TGCGCATGTA CGGCCGGCTG
GCGCTGCACG CCGAGTCGGT GGTGCTGCCG CTGCTGGCCG CCGACGCCCC CGTGGTGACC
TGGTGGCACG GCACGCCGCC GGAGCAGCTG TCCACCGACG CGCTGGCGGT GATCGCCGAC
CGGCGGATCA CCGACAGCTC GCTGGCCCAG GACCAGCTGG CCGCGCTGCG CGCCCGTGCC
GAGGACTACG CGCCGGGTGA CACCGACCTG GCCTGGACCC GCAGCACCCC GTGGCGCGCC
ACGCTGACCT CGACGGTGGA CTCGGTGTCC GGGCGCCGCG GCGACGCCGT CCAGGTGCTG
GGCGGCCGGG TCGAGGGGGA TCCGGGGAAC GCCACCGCGC TGCTGCTGGC CGGCTGGCTG
AGCGCGCGCT GCGGGCGCGA GATCCCGGTG GTGTCCGGCG AGCGGCGGGC CGGGCCCAGC
GGGGTCTCGG CAGTGGTGCT GGAGCTGGAC CAGGACGAGC AGGTCCGGGT GGCGGCCGAC
CGGCGCGGCG GGGCGGTCAT CCACCAGCCC TACCGGCCGG AGGCGACGGT TGCCCTGCCC
GACCGGCAGC TCGGCGACCT GGTCGGCGAG GAGCTGCGTC GCCTCGACCC CGACGAGCCC
TTCGCGGAAG CGCTGGAGAG CACCACCGGC GTGCCGGACC TGGCATACCG CTCCGCCGTG
CGCGAGTACG TGTGGTTCGA CCCGGCCGAG GAGAACGCCG GCAGCGGCGG GGAGACGCTC
CCTGCCGTCC CGCCGGACTC GGAGGGCACG GACGAGGTCG CCCAGCAGGG CGTGGACGAC
GCCGACCAGG GCGCCCGCGA CGCCGGGCAG GCCGTTCCGT GA
 
Protein sequence
MTTLWDTTGS AVVKELAAQR RTGGAVLSGV ALTLVVVADE NRVAEAEEAA TAAAEQHPCR 
LLIVVRRQIE APSPRLDAEV LIGGRLGPGE AVVMRMYGRL ALHAESVVLP LLAADAPVVT
WWHGTPPEQL STDALAVIAD RRITDSSLAQ DQLAALRARA EDYAPGDTDL AWTRSTPWRA
TLTSTVDSVS GRRGDAVQVL GGRVEGDPGN ATALLLAGWL SARCGREIPV VSGERRAGPS
GVSAVVLELD QDEQVRVAAD RRGGAVIHQP YRPEATVALP DRQLGDLVGE ELRRLDPDEP
FAEALESTTG VPDLAYRSAV REYVWFDPAE ENAGSGGETL PAVPPDSEGT DEVAQQGVDD
ADQGARDAGQ AVP