Gene Gobs_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_1007 
Symbol 
ID8752668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp1065485 
End bp1067044 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content77% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003408139 
Protein GI284989585 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.299981 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCGTG AGATCCCACA GGTCGTCCGT GCGGTGGTCG GCCTGGCCGC CACCGTCCTC 
GACGACACCC TCCAGCTGCC CCGGACGCTG CCGGGCCTGC CCGTGCGGGT GCTCGGGCTG
GCCATGCAGG CCACGATGAA GCTGCAGCAG CACTACTCCG GTCTGGTCGC CCGCGGCGAC
GAGGTCTTCA CCGGCCTGCG CGGCGAGGCC GAGCCGGGGC TGGCGACCTT CGACGAGGAC
ATGCCCGGGC CGGCCGCCGG CCGCAGCTCC GCCTTCGACC GCGCCCCCGG GTTCGACCAG
CCCACCTCCT TCGACCGCGC CTCCGCGCCG GTCGCCGACG ACGAGGAGAT GGCCGAGGAG
GTCGCCGCGC TGATCGACGA CGAGGTCTCC GGCCTGCCCG CCGACCCGGC GCCCGAGGAG
GTCGTCGAGG CCCTCGCCGA CATCAGCGAC GAGGTGGCCG CCGCGGGGCT CGGCCTCGAC
CGCACCGAGC CCGGTCTCTC CACGGAGGAC GCGCTGGAGA CCGCACTGCT CGAGGCCGAC
GGCGCCCCCG ACAGCGCCCT GGTCGACGGC GGCACCGGGG ACACCGGGAC CGTCGACACC
GACGAGATCG CCGAGAGCCC GACCACCCCG CCGGCGTCCC TGCTGGACGC CGACGCCACC
GAGTTCGACG TGCACGGGTC CGCGGCGACC GACGCACCGG CGCCCAGCGA CCCGGGCGGC
TCGCCGGTCG GTGAGGTGTC CGGCGCCCCC GACGTGACCA CCGACGTCGA CGTCCTCACC
CCGGACGGCG GTGTCGCGAC CGTCGAGGGA ACCGTGACCG ACGAGGGCGT CGCCGCACCC
GAGGCGCCGA CCGACGGGGA CGCCGCCGCG CCCCAGGCGC CGACCGACGA GGGAGCCGGC
GCACCCGGGG CGTCCGCCGA CGAGACCCCC GGCGCCCCGG CCCAGGCCGA CGAGCAGGAC
ACCGGTGTCG ACGAGGCGCT CGGCAGCGAG GCCAGCGACG CCGACACCCG GCCCTCCACC
GAGGGCACGC CCGCCGGGGA CGACCGGGAC GCCACCGGCA GCGGCGACGA GGTCGTCACC
GCCGCGGGCG CGCAGGTGGA CGACGCGATC GGCACGGGTG AGGACACGTC CGCACCGACC
GCGACGGACG ACGCCGGCAC CACCGACGAG GGCGACACCG GCGAAGGCGC CACCGAGGCG
GCCGGCACCG GCGAGGGCGG CACCGCTGAC GTGGTCGCCA CCGACGAGGG CGGCACCGAC
GTGGCCGCCG CGACCGGCGT CCGCACCGAC GTGGACGAGT CGGCCACCGC CGAGGACGCC
GCGGACACCA CCGGCGCCGT GGCGGCGGGC TCCGCGCCGG TCGAGGGCTA CGACAGCTTC
ACCGTCGCCC AGCTGCGTGG TCGGCTGCGC GGTTACCAGC TCGCCACCGT GGCCGACCTG
GTGGCCTACG AGGAGGCCAC CCGGGCCCGC GAGCCGTACC TGCGGATGCT GCGCAACCGG
CTGGAGAAGT TGGAGCGCCA GGCGGTCGAG GACAGCCCGC TGGCCCCGCG CGGCGCCTGA
 
Protein sequence
MSREIPQVVR AVVGLAATVL DDTLQLPRTL PGLPVRVLGL AMQATMKLQQ HYSGLVARGD 
EVFTGLRGEA EPGLATFDED MPGPAAGRSS AFDRAPGFDQ PTSFDRASAP VADDEEMAEE
VAALIDDEVS GLPADPAPEE VVEALADISD EVAAAGLGLD RTEPGLSTED ALETALLEAD
GAPDSALVDG GTGDTGTVDT DEIAESPTTP PASLLDADAT EFDVHGSAAT DAPAPSDPGG
SPVGEVSGAP DVTTDVDVLT PDGGVATVEG TVTDEGVAAP EAPTDGDAAA PQAPTDEGAG
APGASADETP GAPAQADEQD TGVDEALGSE ASDADTRPST EGTPAGDDRD ATGSGDEVVT
AAGAQVDDAI GTGEDTSAPT ATDDAGTTDE GDTGEGATEA AGTGEGGTAD VVATDEGGTD
VAAATGVRTD VDESATAEDA ADTTGAVAAG SAPVEGYDSF TVAQLRGRLR GYQLATVADL
VAYEEATRAR EPYLRMLRNR LEKLERQAVE DSPLAPRGA