Gene Gobs_4752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4752 
Symbol 
ID8756453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4963442 
End bp4964818 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content78% 
IMG OID 
ProductPeptidase M23 
Protein accessionYP_003411663 
Protein GI284993108 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.817263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCAGA GCCGTCCTCG CCGGCGCACG ACCGGGACCG CCACGGTCCC GGTGTACCGC 
TCGCGGACGC GCCCCGGCGT GGTGCGCAGC CTGCTGGCCG GCGTCGTCGC CGGGACGGTG
GTGCTGACCG GGGCGTCTCC CGCGAGCGCC GCTCCCGAGC CGCCGCCCAA CCCCACCGAC
GAGCAGATCG GGCAGGCGCA GTCGGCGCAG GACGCTGCCG CCGCCGAGGT CGGCCGGATC
GCCGCCCTCG TCGCCCAGGC CCAGTCCCAG CTCGAGGGCT ACGCCGTCCA GGCCGAGGCC
GCCGGCGCGG CCTACCTCGC CGCGGAGGAG GCCCTCCTCC AGGCGCAGGC CGAGGCCGAG
CGGACGGCGC TGGAGCTGGA GGCCGCCACC GCGGCCGTGG AGGCGTCCCT CGGCCGCATC
GCCGGCTTCT CCCGCGACAG CTACATGAGC GGCAACACGC TCTCCACCGC CGCGGCCCTC
CTCGATGCCG ACGGGCCGGC CGAGCTGATC GAGCGGGCTG CCATGCTCGA GTACGTCTCG
GCCAACCAGC TCGACGTCCT CGGCCAGCTG GAGGTCGCCC AGGTCAAGCA GGCCAACGCC
GACTCCGCGG CCCGCGCCGC CCGTGACAAG ACCGCCGCGG CCGAGGCCGC CGCGGCCGCC
GCCAAGGCCA CCGCCGACCA GCAGCTGGCC GCCCAGCGCG CCGCCTACGA CCGGGTCGCC
GCGGAGAAGG CCGGCTACGA CCGGCAGCTG CAGGCCGCCG AGATCGAGCT GCTGCGCCTG
CAGGGCGCCC GCGACGCCTT CCAGGCCTGG CAGCAGCAGA AGGCCGCCGA GGAGGCCGCC
GCCGCCGCGG CCGCCCGCCG GGCCGAGGAG GAGGCCGCCG CCGCGGCCGC CGCAGCCCGC
GCCGCCGCCC GCAGCCAGGG CTCCGGGTCG AGCAGCAGCG GCTCCACCGG CGCCGGCAGC
TCCGGCGGCT CGGGCCCCTA CGTCAAGCCG ACCTCGGGTC GCACCTCCAG CTGCTACGGC
TTGCGCTGGG GCGCCCTGCA CGGTGGCGTG GACATCGCCG CCCCGATCGG GACGCCGATC
TACGCCGCGC ACTCGGGCGT CGTCGCCCGG GCCGGGACCG CCACGGGCTT CGGCTACGCC
GTCTACATCC GCGGCGACGA CGGCGCGGTC ACCGTCTACG GGCACGTCAA CGAGTACTTC
GTCCGCGCCG GCGAGCGGGT CGACGCCGGC GAGCGGATCG CCACGGTCGG CAACCGCGGC
CAGTCGACCG GCCCGCACCT GCACTTCGAG GTGCACCCCG GCGGGGCGAT GTACGGCGGC
CAGGTCGACC CGGTGCCGTG GATGCGCGCC CGCGGCGTGT CCATCAGCGG CTGCTGA
 
Protein sequence
MLQSRPRRRT TGTATVPVYR SRTRPGVVRS LLAGVVAGTV VLTGASPASA APEPPPNPTD 
EQIGQAQSAQ DAAAAEVGRI AALVAQAQSQ LEGYAVQAEA AGAAYLAAEE ALLQAQAEAE
RTALELEAAT AAVEASLGRI AGFSRDSYMS GNTLSTAAAL LDADGPAELI ERAAMLEYVS
ANQLDVLGQL EVAQVKQANA DSAARAARDK TAAAEAAAAA AKATADQQLA AQRAAYDRVA
AEKAGYDRQL QAAEIELLRL QGARDAFQAW QQQKAAEEAA AAAAARRAEE EAAAAAAAAR
AAARSQGSGS SSSGSTGAGS SGGSGPYVKP TSGRTSSCYG LRWGALHGGV DIAAPIGTPI
YAAHSGVVAR AGTATGFGYA VYIRGDDGAV TVYGHVNEYF VRAGERVDAG ERIATVGNRG
QSTGPHLHFE VHPGGAMYGG QVDPVPWMRA RGVSISGC