Gene Gobs_3935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_3935 
Symbol 
ID8755620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4126170 
End bp4127405 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003410874 
Protein GI284992320 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGGTACC TCTTCCGGCC ACCGGTCACC GACGGCGCGG GCCTGCTGAG CCTGCCCTTC 
CACCTGCCCC TCGAGGAGTG GGAGCCCGAG CTGCTGCTCG AGGTCCCGCA GCGGGGCATC
TCGCGGCACG TCGTCCGGTT CACCGCGCAG GGCGGGCACG TCTACGCCCT CAAGGAGATC
CCCGAGCGGC TGGCCCGCCA CGAGTACGCG CTGCTCGGCC AGTTCGAGGA GGAGGGGCTG
CCGTCGGTCT CGGTGCTGGG CATCTGCGTC GACCGACCCG ACGACCAGGA CGCCGTCCTG
GTGACCCGCT ACCTCGAGTA CTCGATGTCC TACCGGTACC TGTTCTCCCG TCCCCACGGG
GAGCACTCCG AGGAGCAGCT GCTCGACACC ATGGTGGTAC TGCTCGTCCG GCTGCACCTG
GCCGGCGTCT TCTGGGGCGA CTGCTCGCTG TCCAACACCC TGTTCCGGCT CGACGCCGGC
GCCTTCACCG CCTACCTGGT GGATGCCGAG ACCTCCGAGC GGTACCCCCA GCTGAGCCCG
GGGCAGCGGC GCTACGACGT CGACCTCGCA CGGGAGCGGG TGGGCGCCGA GCTGCTGGAC
CTGCAGGCCG GCAGCCTGCT GCCGTCGCAC GTGGACCCCA TCGAGGTCGC CGACAGCCTG
CCCGTTCGCT ATGAGGCGCT CTGGGACGAG GTGAACCGCG AGGAGGTCTT CGCGCTGGCC
GAGCAGCGCC AGCGGGTGGC CGAGCGGCTG CAGCGGCTCA ACGACCTGGG GTTCGACGTC
GGCGAGATCG AGCTCGTCAC CGACCCCGAG GGCGGTGCGC GGCTGCGCGT GGAGACCCGC
GTCTCCGAGC CCGGGCAGAA CCGCCGGGAG CTGTTCCGGC TGACCGGCCT GGAGGTGCAG
GAACGCCAGG CCCGCCGGCT GCTCAACGAC CTGCGCGCCT ACCGCGCCAC GCTGGAGCAG
AAGACCGGCG CCCCGGTCCC GGAGACCGTC GCCGGCTACC GCTGGTTGGC CGAGTCCTAC
CAGCCCGTCG TCGAGGCCAT CCCGCCCGAC CTCGCCGGCC GGCTGGCACC GGCCGAGGTC
TTCCACGAGA TCCTCGAGCA CCGTTGGTTC CTGTCCGAGA AGGCCGGCCA CGACGTCGGG
ACGACGAGGG CCGCGCGCTC CTACTTCGAC ACCGTGCTGC CCCGCACGCC CAAGGAGCTG
ACCACCCCCT CGGCCATCGT CGGCAGCCGC GACTGA
 
Protein sequence
MRYLFRPPVT DGAGLLSLPF HLPLEEWEPE LLLEVPQRGI SRHVVRFTAQ GGHVYALKEI 
PERLARHEYA LLGQFEEEGL PSVSVLGICV DRPDDQDAVL VTRYLEYSMS YRYLFSRPHG
EHSEEQLLDT MVVLLVRLHL AGVFWGDCSL SNTLFRLDAG AFTAYLVDAE TSERYPQLSP
GQRRYDVDLA RERVGAELLD LQAGSLLPSH VDPIEVADSL PVRYEALWDE VNREEVFALA
EQRQRVAERL QRLNDLGFDV GEIELVTDPE GGARLRVETR VSEPGQNRRE LFRLTGLEVQ
ERQARRLLND LRAYRATLEQ KTGAPVPETV AGYRWLAESY QPVVEAIPPD LAGRLAPAEV
FHEILEHRWF LSEKAGHDVG TTRAARSYFD TVLPRTPKEL TTPSAIVGSR D