Gene Gobs_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_2031 
Symbol 
ID8753702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp2108048 
End bp2109079 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content75% 
IMG OID 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_003409090 
Protein GI284990536 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.916487 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATCC TAGGCGGGCT GGACATTCAC CGGCGGCAGA TCACCTTTGA CTATCTCGAC 
GAGCGCAGCG GCGAGACCCG GCACGGGCGG ATCGCCCCGG CCGACCGGAT GCTGCTGCGC
GGCTGGCTGC AGAAGCTCCT CGCCGACGCG GGCAAGCCGG CGGCGTTCGC GGTGGAGGGG
TGCACCGGCT GGCGGTTCGT GGTCGAGGAG TTGCAGCGTG CCGGGGTCGA GGCGCATCTG
GCCGAGCCGG CCGACACCTC CGCCGCGCGC GGGCCGAAAC GGCGGGCCAA GACCGACCGG
GCCGACGCCC GGCTGCTGCG CGAGCTGCTC GCCGACGGCC GGCTGCCCGA GTCGTGGATC
CCGCCGGCGC AGGTGCTGGA GATGCGCGCC CGGCTGCAGC TGTTCCGCGA CCTGCGCGAG
CAGCACACCG CCTGGGTGCA GCGCGTCCAC GCCATCCTGC TGCACCACGG CGTCCCGGCG
GTCACCGGCG GCCTGCTCGG CGCCGACAAC CGCCGCCGGC TCGAGGTCGG TGAGGGCCTG
TCACCGGCGG GGCGCGAGGC GGTGGCCGCC GCGCTGCGGA TCCTCGACGC CCTGGATGCC
GAGCTCGATC CGCTGCGCAA GCAGATCACC GCGTTCGCCG CCCGCCAGCC CGGCTGCCGG
GCGCTGCAGG CCGACTACGG CATCGGGCCG ATCACCGCGA CCGCACTGTG GACCGAGCTG
GGCGATTCCC GCCGCTTCTC CGCCTCCCGC AAGGCGGTGC GGCACACCGG GCTGGACATC
ACCGTGCACT CCTCCGACGG CAAGCGCAGC GCCGCGCACC TGTCCCGGCA GGGCTCGCCG
CTGCTGCGCT GGGCGCTGTT CGAGGCCGCC CAGTGCGCCG CCCGCCCCGG CTCTCCCGAC
CACGCCTACT ACCGGCGGGT CGCCGAGCGG GTCGGCGGCA ATCGGGCCGC CCTGTCGGTG
GCCCGCAAGA TGGTTCGCCG GGCGCATCAC ACGCTGCGCG CCCTCGGTGA CCAGGCCCTC
GCGCCGGTCT GA
 
Protein sequence
MGILGGLDIH RRQITFDYLD ERSGETRHGR IAPADRMLLR GWLQKLLADA GKPAAFAVEG 
CTGWRFVVEE LQRAGVEAHL AEPADTSAAR GPKRRAKTDR ADARLLRELL ADGRLPESWI
PPAQVLEMRA RLQLFRDLRE QHTAWVQRVH AILLHHGVPA VTGGLLGADN RRRLEVGEGL
SPAGREAVAA ALRILDALDA ELDPLRKQIT AFAARQPGCR ALQADYGIGP ITATALWTEL
GDSRRFSASR KAVRHTGLDI TVHSSDGKRS AAHLSRQGSP LLRWALFEAA QCAARPGSPD
HAYYRRVAER VGGNRAALSV ARKMVRRAHH TLRALGDQAL APV