Gene Gobs_2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_2120 
Symbol 
ID8753791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp2204759 
End bp2206069 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content78% 
IMG OID 
ProductDNA polymerase LigD, ligase domain protein 
Protein accessionYP_003409176 
Protein GI284990622 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0836628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCGACCC GGGGGAGCGC CCGGCGGCCC ACCCCGCCGG CCGGGGAGCC GTTCGTCGTC 
ACCGAGCACC CGTCCAGCGG CGCCGAGCTG CGGCTGGAGC GGAACGGCGT GCCGTGCTGC
TGGGACCTGC CGGAGGGGCC ACCCGCCGCG CGGGGCCGGG CGCTGCCCGC CGTCCCCACC
ATGGGTGGGG ACGGGGGCGG GCTGCCGACC TGGGACGCCG GCCGCTACGC CGTCGAGCAG
TGGACCGACG ACCGCGTCGT CGTGGTCCTC GCCGGGCGGC GGCTGCGGGG CCGGCACGTC
CTCTTCCGGT CGCCCGACGG CGGCTGGTCC GTCCGCGCGC TGGACGCCCC GGCGGAGGGG
GCGCCCCTGG TCCCGATGCT CGCCACCGCG GGGGAGCTGC CGCCGTCCGC GCAGGACCAC
GACTGGGGCT ACGAATTCAA GTGGGACGGC GTCCGGGCGC TGGCCGTCGT CGAGGCCGGC
GGGCTCGCGC TCTGGGCCCG CAGCGGCACC GACATCACCG TCCGTTATCC CGAACTGAGC
CTTCCGACAG CCCTGACCGG CCACGACGCC GTCGTCGACG GGGAAGTGGT CGCCCTCGAC
GCGCGCGGCA GGCCGGACTT CGGCGCGCTT CAGGGGCGGA TGCACCGCAC CGGCCCCGAG
GTGCGCCGGA TGGCCGCGAC CACCCCGGTG ACCTACCTGG TGTTCGACCT GCTCGCGTGG
GAGGGCGAGA GCCTGCTCGC GCTGCCGTAC GCGCAACGCC GCGAGCGGCT GGAGGCGCTG
GGCGTCGCCG GCGAGCGGTG GGTGCCCACG CCCTGGTTCC GCGGCGGTGG AGCCGCGGTG
CTGGCCGCCA GCCGCGACAA CGGACTGGAG GGGATCGTCG CCAAGCGGCT GGACTCGCCG
TACCGCCCCG GCCTCCGGGG GCCGGACTGG CGCAAGGTGA AGAACGTCCG GACGCAGGCG
GTCGTCGTCG GCGGCTGGCG GCCCGGGCAG GGCCGGCGGG CGGGGGGAGT GGGGTCGCTG
CTCGTGGGGG TGCACGACGA CGCCGGACGG CTGGTCTACG CCGGGCACGT CGGCACCGGC
TTCACCGCCG CGGCCCTCGC GGAGCTCCAG CCGCTCTTCA CGCCGGCCCA CCGCCCGCCG
TTCGCCGACG CACTGCCCCG CGAGGTCACC CGCGACGCGC GCTGGGTGGC GCCGGAGCTG
GTCGGCGAGG TGGCGTTCGC CGCGTGGACC GCCGACGGCC GGATGCGCCA CCCGTCGTGG
CGAGGGCTGC GCGACGACCT GGCGCCGGAG GACGTGGTCG AGGAGTGGTG A
 
Protein sequence
MPTRGSARRP TPPAGEPFVV TEHPSSGAEL RLERNGVPCC WDLPEGPPAA RGRALPAVPT 
MGGDGGGLPT WDAGRYAVEQ WTDDRVVVVL AGRRLRGRHV LFRSPDGGWS VRALDAPAEG
APLVPMLATA GELPPSAQDH DWGYEFKWDG VRALAVVEAG GLALWARSGT DITVRYPELS
LPTALTGHDA VVDGEVVALD ARGRPDFGAL QGRMHRTGPE VRRMAATTPV TYLVFDLLAW
EGESLLALPY AQRRERLEAL GVAGERWVPT PWFRGGGAAV LAASRDNGLE GIVAKRLDSP
YRPGLRGPDW RKVKNVRTQA VVVGGWRPGQ GRRAGGVGSL LVGVHDDAGR LVYAGHVGTG
FTAAALAELQ PLFTPAHRPP FADALPREVT RDARWVAPEL VGEVAFAAWT ADGRMRHPSW
RGLRDDLAPE DVVEEW