Gene Gobs_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_2042 
Symbol 
ID8753713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp2120829 
End bp2122901 
Gene Length2073 bp 
Protein Length690 aa 
Translation table11 
GC content73% 
IMG OID 
Productexcinuclease ABC, C subunit 
Protein accessionYP_003409101 
Protein GI284990547 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGACC CGTCCACGTA CCGACCGGCG GTCGGCAGCA TCCCGGAGTC GCCCGGCGTC 
TACAAGTTCC GCGACCCCAA CGGCCGGGTG GTCTACGTCG GCAAGGCCAA GAACCTCCGG
CAGCGGTTGA ACAGCTACTT CGCCGACGTC GCGGGCCTGC ACCCTCGCAC CCGGCAGATG
GTGACGACGG CGGCCAGCGT CGAGTGGACC GTCGTCGGCA CGGAGGTCGA GGCCCTCCAG
CTCGAGTACA ACTGGATCAA GGAGTTCGAC CCGCGGTTCA ACGTCCGCTA CCGCGACGAC
AAGAGCTACC CGAGCCTCGC CGTGACGCTC AACGAGGAGT ACCCGCGGCT GCAGGTGATG
CGCGGCCCGA AGAAGAAGGG CGTCCGCTAC TTCGGCCCGT ACGCGCACGC CTGGGCCATC
CGCGAGACGC TCGACACCCT CACCCGAGTC TTCCCGGCGC GGACCTGCTC CAACGGCGTC
TTCAAGCGCG CCGGCCAGAT CGGCCGGCCC TGCCTGCTCG GCTACATCGG CAAGTGCGCC
GCCCCCTGCG TCGGCCGGGT CTCCGCGGAG GAGCACCGGC AGATCGTCGA CGGCTTCTGC
GAGTTCATGG CCGGGCGCAC CGACCAGATG ATCCGCCGCC TCGAGCGCGA GATGGCCGAG
GCCGCCGAGG CGATGGAGTA CGAGAAGGCC GCCCGGCTGC GCGACGACCT CGGCGCCCTC
CGGCGGGCGA TGGAGAAGCA GGCCGTCGTC CTCGGGAACG GCACCGACGC CGACGTGGTC
GCCTTCGCCC AGGACGAGCT GGAGGCCTCC GTGCAGGTCT TCCACGTCCG CGGCGGTCGG
GTCCGCGGCC AGCGCGGGTG GATCGTCGAC AAGGTCGAGG AGGTCAGCAC CGGCGAGCTG
GTCGAGCAGT TCGTGCTGCA GGTCTACGGC GGCGTCGACG AGGCGACCGG CGTCGGGGAG
GCGGGGGAGG CGGTGCCCAG GGAGGTGCTC GTCCCCGAGC TGCCCGACGA TGCCGACGTC
TACGAGGAGC TGCTCAGCGA GCTCCGCGGC AGCCGCGTGA GCCTGCGGGT GCCACAGCGG
GGCGACAAGC GGGCCCTGCT GGAGACGGTG GAGCGCAACG CCAAGGAGGC CTTCGCCCGG
CACCGGGTCA AGCGGTCCAG CGACCTCACC GCCCGCTCGC TGGCGCTGTC GGAGCTGCAG
GAGGCGCTGG AGCTGCCCGA CGCGCCGCTG CGCATCGAGT GCATCGACGT CTCGCACGTC
CAGCAGACCA ACGTCGTGGC CAGCATGGTC GTCTTCGAGG ACGGTCTGGC GAAGAAGTCC
GACTACCGCC GGTTCTCCGT CACCCACGGC ACCGACGACA CCGCGGCGAT GGCCGAGGTC
GTCCGGCGCC GGTTCGCCCG CCACCTCAAG GAGGAGCAGG ACCGCCGCGA CGAGCAGGGG
GTGGCGGCGG AGGAGGGCCG GCCGCGCCGG TTCGCCTACC CGCCGAACCT GCTCGTGGTG
GACGGCGGTG CACCGCAGGT CGCCGCGGCC GCCCGGTCGC TCGACGAGCT GGGCATCGTC
GACGTGGCCG TCTGCGGGCT GGCCAAGCGG ATGGAGGAGG TGTGGCTGCC CGGGGAGTCC
GACCCGGTCA TCCTGCCGCG CACCTCCGAG GCGCTGTACC TGCTGCAGCG GGTCCGCGAC
GAGGCGCACC GCTTCGCGAT CACCTACCAC CGGCAGAAGC GGTCCACCAG CATGCTGGTC
TCCCTGTTGG ACGACGTCCC CGGGCTCGGG GAGACGCGAC GGAAGGCGCT GATGAAGCAG
TTCGGATCCC TCAAGAGGCT GCGGGCTGCC ACGGTCGAGG AGCTGATGGT CGTGCCCGGC
ATCGGTCGGC GGACGGCGGA GGCAGTGCTC GCGGCGGTGG CGCAGCCGGA GGCGGGGGAG
GCGGGCGCCC CCGAGGCGAG CGGGCCGGCG GACGACGCCG CGCCCGCCGG GGAGACCGTG
GAGACGACGA CCGCCCCGGC CGGTCCGGCC GCGCGCGGCA CCCCCCGCAG CGGCTCCGGC
GCCACCGCCG AGGTCGGGCT GGTGGCGTCG TGA
 
Protein sequence
MPDPSTYRPA VGSIPESPGV YKFRDPNGRV VYVGKAKNLR QRLNSYFADV AGLHPRTRQM 
VTTAASVEWT VVGTEVEALQ LEYNWIKEFD PRFNVRYRDD KSYPSLAVTL NEEYPRLQVM
RGPKKKGVRY FGPYAHAWAI RETLDTLTRV FPARTCSNGV FKRAGQIGRP CLLGYIGKCA
APCVGRVSAE EHRQIVDGFC EFMAGRTDQM IRRLEREMAE AAEAMEYEKA ARLRDDLGAL
RRAMEKQAVV LGNGTDADVV AFAQDELEAS VQVFHVRGGR VRGQRGWIVD KVEEVSTGEL
VEQFVLQVYG GVDEATGVGE AGEAVPREVL VPELPDDADV YEELLSELRG SRVSLRVPQR
GDKRALLETV ERNAKEAFAR HRVKRSSDLT ARSLALSELQ EALELPDAPL RIECIDVSHV
QQTNVVASMV VFEDGLAKKS DYRRFSVTHG TDDTAAMAEV VRRRFARHLK EEQDRRDEQG
VAAEEGRPRR FAYPPNLLVV DGGAPQVAAA ARSLDELGIV DVAVCGLAKR MEEVWLPGES
DPVILPRTSE ALYLLQRVRD EAHRFAITYH RQKRSTSMLV SLLDDVPGLG ETRRKALMKQ
FGSLKRLRAA TVEELMVVPG IGRRTAEAVL AAVAQPEAGE AGAPEASGPA DDAAPAGETV
ETTTAPAGPA ARGTPRSGSG ATAEVGLVAS