Gene Gobs_3974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_3974 
Symbol 
ID8755662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4172804 
End bp4174168 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content72% 
IMG OID 
Productpeptidase M50 
Protein accessionYP_003410912 
Protein GI284992358 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.17805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTGCTGA CCGTCCTCGG GATCGTCGCC TTCGCGGCCG GTCTGCTGTT CTCGATCGCC 
TTCCACGAGT ACGGGCACTT CTTCTGGGCC CGGAAGTTCG GCATGCGGGT GCCGCAGTTC
ATGGTCGGTT TCGGCCCGAC GCTGTTCTCC CGCACGCGGG GGGAGACCGA GTACGGGATC
AAGGCCGTCC CGCTGGGCGG CTACATCCGC ATCGTCGGGA TGATCCCGCC GGCCGAGGAG
AACGAGAGCA CGCGGGCCAC CCGCATGCGC TCGTTCATCG CCGAGGTGCG CGGCGCCGCG
CTCGACGACG TCCGCCCCGG CGACGAGGGC CGGGTGTTCT ACGCCAAGCC CTGGTGGCAG
CGGGTCATCG TGATGTTCGC CGGCCCCTTC CACAACCTGG TGCTCGCGGT CCTGCTCTTC
ACGGTGCTGC TCACCGTCGT CGGCACCAGC GTGCTGACCA CGACGGTGCG CGACGTCCCC
GCGTGCGTGC TGCCCGCGGG TGCCGTCACC GCGCTGCAGG ACGACGCCTG CTCGGTGCCG
CTCACGCCCG AGGGGCAGAC CTGCGAGGCG GGGGCGGCAG GCTGCGCGCT GCCGCAGCAG
AGCCCCGCCG CGGCCGCCGG GCTGCGTTCC GGCGACACGA TCGTCGCCAT CGGGGGCCGG
CCGCTGGACC CGACCGCGTA CGACAGCTGG ACGGCGGTGC AGGAGGCGAT CCGCACCAGC
CCCGGTCAGC CGCTGGACGT CACCATCGAG CGGGACGGCG CGCGGCAGCG GCTCACCGTC
ACGCCGATCC CCAACACCGT CTACGCCGAC CCGACCGACC CCACCGAGGG GACGACGACC
GCCGGCTACC TCGGGATCTC GCCGAGCGTC CAGCTGGCCC GGCAGGACGC CGCGGCCATC
CCCGGCTACT TCGGGATGAT CGTGACGAAC GCCGTCGAGC GGCTGGTCGA GATCCCCGAG
CGCATCCCGC AGCTGTTCCG CGCGGCGTTC CTGGGTGAGG AGCGCGACCC CAACGGGCCG
ATCGGCGTCG TGGGCGTCGG CCGCATCTCC GGCGAGGTCT TCGCCATCCC CGAGCTCACC
GGCACGGAGA AGGTCAGCAC GTTCCTGCAG CTGCTGGCCA GCATCAACCT GGTGCTGTTC
CTGTTCAACC TGCTGCCGAT CTACCCGCTC GACGGCGGGC ACGTCGCCGG CGCGCTGTAC
GAGAAGGCGC GCGCGGTCGT CGCCCGGCTG CGTGGCCGGC CCGACCCCGG CCCGTTCGAC
ATCGCCCGGC TGATGCCGGT CGCCTACCTC GTGGCGGGCC TGTTCGTCGT CCTCTCGGGC
CTGCTGCTGA TCGCCGACAT CGTCAACCCG ATCACCCTGC AGTGA
 
Protein sequence
MLLTVLGIVA FAAGLLFSIA FHEYGHFFWA RKFGMRVPQF MVGFGPTLFS RTRGETEYGI 
KAVPLGGYIR IVGMIPPAEE NESTRATRMR SFIAEVRGAA LDDVRPGDEG RVFYAKPWWQ
RVIVMFAGPF HNLVLAVLLF TVLLTVVGTS VLTTTVRDVP ACVLPAGAVT ALQDDACSVP
LTPEGQTCEA GAAGCALPQQ SPAAAAGLRS GDTIVAIGGR PLDPTAYDSW TAVQEAIRTS
PGQPLDVTIE RDGARQRLTV TPIPNTVYAD PTDPTEGTTT AGYLGISPSV QLARQDAAAI
PGYFGMIVTN AVERLVEIPE RIPQLFRAAF LGEERDPNGP IGVVGVGRIS GEVFAIPELT
GTEKVSTFLQ LLASINLVLF LFNLLPIYPL DGGHVAGALY EKARAVVARL RGRPDPGPFD
IARLMPVAYL VAGLFVVLSG LLLIADIVNP ITLQ