Gene Gobs_3509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_3509 
Symbol 
ID8755193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp3693885 
End bp3695063 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content78% 
IMG OID 
Productputative transcriptional regulator, PucR family 
Protein accessionYP_003410472 
Protein GI284991918 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCGACC GGGTCAGCAG GCGCCCGCCG TCCGCCGCCA CGCTGCGGCG GCTGGAGCGC 
GCGTCCGGGT CGCTGGCCAC CCGCGCGGTG GCCCGGATGG ACGACGAGCT GCCCTGGTTC
CGCACCATGC CCGCCGACCA GCGCTCCTGG GTGATGCTCG TGGCGCAGGC CGGACTGGCC
TCCTTCGTCG AGTGGTGCCG CACCCCCGAC CGCCCGCCGC GGCTCACCGG CGAGGTCTTC
GGCGCCGCTC CGCGCGAGCT GGCCCGCGTC GTGCCGCTGA AGCACACCGT CGACCTGGTC
CGGGTGACCG TCGAGGTGAT GGACGACTTC GTCGGCACCG TCGCCGAGCC CGGGGACGCC
GACGTGCTGC GGCTGGCGGT GCTGCAGTAC AGCCGGGAGG TGGCCTTCGC CACCGCGCAC
GTCTACGCCA CCTTCGCCGA GAGCCGCGGC GCCTGGGACG CCCGGCTGGA GGCCCTGCTC
GTCGACGCCC TGGTCCGGGG CGAGCGCCCC GAGGAGCTCT CCGGACGGGC CTCGGCGCTG
GGCTGGGCCG CGATGACCCC GGTGACCGTG CTGGTCGGCG CGGCCCCCCA GGACGGCGAC
GTGCACGCCG CCATCCGGCG AGCGGCGCGG CAGGTGCGCA GCGAGGTGCT GGTCGGGGTG
CACGCCGACC AGTTGGTCGT CGTCCTGGGG GGCTCCCCGG ACCTGCAGTC GGCGGCCGAG
CGGGTCAGCG ACGAGTTCGG CCCCGGCCCG GTCGTCACGG GTCCGGTGGT CGACGGCCTG
GGGCGGGCCG GCGACTCCGC GTCGGCCGCG CTCGCCGGGC TGCGTGCCGC GCCCGCCTGG
CCCGGCGCCC CCCGGCCGGT GGCGGCCGAC GCGCTGCTGC CCGAACGGGC CCTGGACGGG
GACCCGTCGG CACGCACCAC GCTGCAGGAG CGGATCGCGC AGCCGCTGCA GGCCGCCGGC
GGCGAGGTGC TGGAGACGGT GCGCGCTGTG CTCGCCAGCG GGGGCAACTT GGAGGCCAGT
GCCCGCGCGC TGTTCGTCCA CCCGAACACC GTCCGGTACC GGCTCCGGCG GGCCACCGAG
ATGACCGGTC TGCCGGTCAC CGACCCGCGC GGCGCGTGGA CGGTGCAGGT CGCGCTGGCC
CTCGCCGCCC TGGACCAGGA GCGCTCCCTC TGGCACTGA
 
Protein sequence
MSDRVSRRPP SAATLRRLER ASGSLATRAV ARMDDELPWF RTMPADQRSW VMLVAQAGLA 
SFVEWCRTPD RPPRLTGEVF GAAPRELARV VPLKHTVDLV RVTVEVMDDF VGTVAEPGDA
DVLRLAVLQY SREVAFATAH VYATFAESRG AWDARLEALL VDALVRGERP EELSGRASAL
GWAAMTPVTV LVGAAPQDGD VHAAIRRAAR QVRSEVLVGV HADQLVVVLG GSPDLQSAAE
RVSDEFGPGP VVTGPVVDGL GRAGDSASAA LAGLRAAPAW PGAPRPVAAD ALLPERALDG
DPSARTTLQE RIAQPLQAAG GEVLETVRAV LASGGNLEAS ARALFVHPNT VRYRLRRATE
MTGLPVTDPR GAWTVQVALA LAALDQERSL WH