Gene Gobs_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_2998 
Symbol 
ID8754671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp3132520 
End bp3134856 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content78% 
IMG OID 
Productpeptidase M28 
Protein accessionYP_003409979 
Protein GI284991425 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAGTC GCGACGACAC GACTGCGCGG GCCCCGCGGC CCGTGGTCGG CATCGCCGCA 
TCGGCGGTCC TGCTGCTGGT CGCCGTCCTG GCGGTGCTCG CCATGGCCCC GGCACGACCG
CGGGCGGCGG ACGCGGCGGT GGAGGTGTTC AGCGCTGCAC GAGCCCTCGA CCAGCTGGCG
CACGTCGCCG TCGTCCCGCG CCCGGTGGGG TCGCCCGGCC ACGCCCTCGC CCGTGGACAC
CTGCTCGCCA CCCTCGGGGC CTGGGGCTGG CGCACCGAGG TGCACCAGGC CGTGGGGGCC
ACCGACTTCG AGGAGGCCGG CACCCAGCCG GTGGCGCTCG TGCGCAACGT CGTCGCCACC
TGGCCGGGCA CCGACCCCAC CGGCACGGTG GTGCTCGCCG CGCACTACGA CACCGTCGCC
GGCTCCCCGG GTGCCGCCGA CGACGGCATC GGGGTGGGCA CCGTGCTGGA GGTGGCGCGG
GCGCTGAGCG CCGAGGACGC CGCCCCGCTG CGCAACGACG TCGTGGTGCT GCTGACCGAC
GCCGAGGAGC CCGGCCTGCT CGGCGCCGAG GCCTTCGCCC GTGAGCGCGC GGCCTCGCTC
GGGGAGACGG TGGTGCTCAA CCACGAGGCC CGCGGGGCGT GGGGAGCCCC GACCACCTTC
CGGACGACGT CCCCGAACGG GGTGCTGCTC GAGGCGCTGT CGGGTGCGCC GGGGGCCTCG
GCCGACTCCG CCTCCGAGGC GGCCTTCGAG GCACTGCCCA ACGGCACCGA CTTCACCCCG
CTCACCGGGG CCGGCCTGCA CGCGCTCGAC ACGGCGATCG CCGCCGGCAG CGCCCACTAC
CACTCGCCCG TCGACGACCT GGCGCACCTG AGCCCCGCCT CCGTGCAGCA GATGGGCGAC
ACCAGCCTCG CGGTGGCCCG CGACCTCGCC GCAGCGGACC TGGCCACGGT CGCAGCCGGC
GGCGGGCAGG TGGTCACGAC CCTGCCCTGG GGCCTGCTGC GGTACCCGCA GGCGGCGGAG
GGACCGCTGG CCCTCGCCGC GCTCACCGGC GCGACGGCGC TGGTGGCGCT GCGCCGGCGC
CGGCGCGCAC TGACGCTGCC GCGCACCGCG CTGTCCGCCC TCGTCGCGGT GGTGCTGCTG
GCCGCCGCCG GGACCGCGGG GTGGGCGACG TGGCAGCTGG CGCTGCTGGT CGACCCCGGC
CAGGCCTCCG CGGTGGTCGG GGACCCGTAC CGGCCCACCG CCTACGGGGT GGCGGTGCTG
CTGGCGGCTG CCGGCATGGT CCTGGGTGGG TACGCCTCGG TCCGCTCCCG ACTGGGTCCA
GCCGCGGTCG CCACCGGTGC GCTGACCGCG CTGGCCGTCG CCGGAGCGCT CCTGGTGCTG
GTACCCGGGG TCTCCGGCCT GCTGGTCCTG CCGGTGCTCT GCGCCGTCGT CGGGTCCCTG
GCCGCGGAGG TGATACCCCC GCGGGCGGCC GCCGTGCGCG CCGCCGCGGT CCTGGTCGGC
TCGGCGGGCG CCACCCTGCT GCTCGGGCCG GGCGCCTGGG TCGCCGCGGA CGTCGGGCTG
GCCACGGGCG GGCCCGCCGC TGCGGCGTTC CTCTGCGTGC TCCTGCTGCT GGTGCTGCCC
GTGGTCGACC TCGCCTGGCC GCTGCCGCAC ACGCCCCGGC GCCGGCAGGT GCGGAGGACG
GCCGCGGTGC CGACGGCGGT GCTCCTCCTC GCGGTGGGAA TGACGGCCGC CGGGCTCGTG
ATCAACCGGG CAGGCGCCAC GGCGCCGCGC CAGGAACAGC TGGAGTACGT CCTCGACGCC
GACGCCAGGA CGGCGCTGTG GACCTCGCGC ACCGGCCCGC GCAGCACCTG GAGCGCCGAG
CTGCTGACCC GGTCCCCCGC CCGGCTCGAC GACGTCCTCC CCCGCGCGGG GGACGCCCCG
CTCGCCCACG GGCCCGCACC GTTGGTCGAC CTGGCGGCGC CGGAGGTCGC GGTCGTGGCC
GACACGGCCC GCGACGGGCA GCGCGAGCTG GTCCTGCGCC TGTCCTCCGC TCGCGGCGCT
GCTGCGGTGG GCCTGTGGGT GGACGCCGCC GGCGCCACGG TGCGCGGCGC GCGTGTCGCC
GGCCGGGAGC TGCCGCTCAA CGGCGCGTTC GGCCCCTGGG ACTTCGGGTT CGTCCTCGAG
GGCGCGCCGG CGGACGGTGT CGAGGTCCGC CTGCTGCTGG ACCAGCGCGC CGGTGCGCTG
GCGCTGCGCA TCGCCGACCG CAGCGACGAC CTCGCTGCCG TGCCCGGCGC CATCCCGCCA
CGGGGACGGG TGCTCGTGAC CCCTCACCTG TGGGTGGTGC GCGGGATCGA GCTGTGA
 
Protein sequence
MRSRDDTTAR APRPVVGIAA SAVLLLVAVL AVLAMAPARP RAADAAVEVF SAARALDQLA 
HVAVVPRPVG SPGHALARGH LLATLGAWGW RTEVHQAVGA TDFEEAGTQP VALVRNVVAT
WPGTDPTGTV VLAAHYDTVA GSPGAADDGI GVGTVLEVAR ALSAEDAAPL RNDVVVLLTD
AEEPGLLGAE AFARERAASL GETVVLNHEA RGAWGAPTTF RTTSPNGVLL EALSGAPGAS
ADSASEAAFE ALPNGTDFTP LTGAGLHALD TAIAAGSAHY HSPVDDLAHL SPASVQQMGD
TSLAVARDLA AADLATVAAG GGQVVTTLPW GLLRYPQAAE GPLALAALTG ATALVALRRR
RRALTLPRTA LSALVAVVLL AAAGTAGWAT WQLALLVDPG QASAVVGDPY RPTAYGVAVL
LAAAGMVLGG YASVRSRLGP AAVATGALTA LAVAGALLVL VPGVSGLLVL PVLCAVVGSL
AAEVIPPRAA AVRAAAVLVG SAGATLLLGP GAWVAADVGL ATGGPAAAAF LCVLLLLVLP
VVDLAWPLPH TPRRRQVRRT AAVPTAVLLL AVGMTAAGLV INRAGATAPR QEQLEYVLDA
DARTALWTSR TGPRSTWSAE LLTRSPARLD DVLPRAGDAP LAHGPAPLVD LAAPEVAVVA
DTARDGQREL VLRLSSARGA AAVGLWVDAA GATVRGARVA GRELPLNGAF GPWDFGFVLE
GAPADGVEVR LLLDQRAGAL ALRIADRSDD LAAVPGAIPP RGRVLVTPHL WVVRGIEL