Gene Gobs_0688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_0688 
Symbol 
ID8752345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp729986 
End bp731137 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content75% 
IMG OID 
ProductPeptidase M23 
Protein accessionYP_003407836 
Protein GI284989282 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.50956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGGA CGGCGGAAGG CGCGGCCCGG GCGTGGCTGT GGAAGGCCGC CGTCCCTGTC 
GCGGTGCCGA TGTTCGCCGC GCTCTTCCTC CTGCTGGTCC CGCTTGCCGT GCTCGCCGCT
CCGCAGACAT CAGCGGCGTC CGCGGCCTGC TCGACCGGCG GCACCGGCGC CACCGTCGCC
GGCGTCGACC TGGACGCCCT CCAGATGGGC CACGCGCAGA CGATCGCCAC CGTCGCCGCC
CGGCTCGGGC TGGACCCGTA CGCCGCGACC GTGGCGCTGG CCACCGCCTA TCAGGAGTCC
CGCATCCGGA TGCTGGCCAA CGACGGCAGC AGCCCCGAGC TCACCGCCGA CCAGGCCGCC
GTCACCGCCA CCAGCCTGAC CTACCCGCAC GACGGGCTCG GCTCCGACCA CGACAGCGTC
AACACCTTCC AGCAGCGCTG GATCGCCGGC TGGGGCACCG TCGGCGAGCT GATGGACCCC
GTCTACGCCG CCGAGACCTT CTACGCCCGC CTGGTCCAGG TGCCGAACTG GCGGACCATC
CCGCTGACCC GCGCGGCTCA GTCCGTGCAG GTCTCGGCCT ACGGCGGCGC CTACGCCCGC
TGGGAGCCAC TCGCCCGCGA GCTGACCGCG ATGCTGTGGC CGGCCGCCCG GGTCGCCGCA
GCCGACCCGT CCGGCGCCGC AGCCGGGGTC TGCCCGGGTC TGCCGGTGGC CGCCGGGTCG
TGGATCCGGC CGACCGCCGG GCAGGTCACC TCCGGCTTCG GGCCCCGCTG GGGCACCCTG
CACGCCGGTG TCGACATCGC CGGCCCCCGC GATACCCCGG TCTACGCCGC GTCCGACGGC
ATCGTGGTGC GCGCCGAGTG CACCAGCGCC TACTGCAACC GCGACGGCAA CCTGGACCTG
GGCGGCTACG GCAATCTCGT CGAGCTGGAC CACGGCGGCG GGGTGACCAC CCGCTACGGG
CACCTGTCGG CCTACACCGT CACCGCCGGC CAAACCGTCA CCGCCGGGAC GCTGATCGGC
TTCCAGGGCT CCACCGGCAA CAGCACCGGC GTCCACCTGC ACCTTGAGGT CCGCATCGAC
GGCACCCCGG TCGACCCCGT CCCGTGGCTG GCCGACCGCG GCGTCGACCT GCGTGCTGCC
AACCCCGGAT GA
 
Protein sequence
MTGTAEGAAR AWLWKAAVPV AVPMFAALFL LLVPLAVLAA PQTSAASAAC STGGTGATVA 
GVDLDALQMG HAQTIATVAA RLGLDPYAAT VALATAYQES RIRMLANDGS SPELTADQAA
VTATSLTYPH DGLGSDHDSV NTFQQRWIAG WGTVGELMDP VYAAETFYAR LVQVPNWRTI
PLTRAAQSVQ VSAYGGAYAR WEPLARELTA MLWPAARVAA ADPSGAAAGV CPGLPVAAGS
WIRPTAGQVT SGFGPRWGTL HAGVDIAGPR DTPVYAASDG IVVRAECTSA YCNRDGNLDL
GGYGNLVELD HGGGVTTRYG HLSAYTVTAG QTVTAGTLIG FQGSTGNSTG VHLHLEVRID
GTPVDPVPWL ADRGVDLRAA NPG