Gene Gobs_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_3043 
Symbol 
ID8754719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp3189458 
End bp3190588 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content79% 
IMG OID 
ProductUroporphyrinogen III synthase HEM4 
Protein accessionYP_003410024 
Protein GI284991470 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.178966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGATG TCCTGCCCGA GGCCCCGCCG GGGACCGAGG CACCGCTGCC CCTGGCCGGG 
TACACCGTCG CGGTGACCGC CGCCCGGCGC CGCGAGGAGC TGGGTGCGCT GCTGGCCCGC
CGCGGCGCCC GCGTGGTGTA CGCGCCGGCC ATCCGCATCG TGCCGCTGGC CGACGACACC
GAGCTGGTCG CCGCGACGCG CCAGGTACTG GCGCAGCCGG TGGACCTGGT CGTGGCGACC
ACCGGCGTCG GCTTCCGCGG GTGGCTGGAG GCGGCCGACG CGTGGGACCT GCCGCTGGTG
GAGCACCTGC GCGGCGCCCG GGTGCTCGCG CGCGGGCCCA AGGCGCGGGG CGCCATCCGC
GGCGGCGGGC TGGTCGACGC CTGGTCGCCG GCGTCGGAGT CCTCGGCCGA GGTGCTCGAG
CACCTGCTCG CCGGGGCCGA GGGCCCGCTG CAGGGACGCC GCATCGCCGT CCAGCTGCAC
GGCGACCCGC TGCCGGACTT CGTCGAGGCG CTGCGCGCGA CCGGCGCCGA GGTCGTCACC
GTGCCGGTGT ACCGCTGGGT GCTGCCCGAG GACGTCGAGC CGGTGCGCCG GCTGGTGCGC
TCGGTGGTCA CCGGCGCGGT CGACGCGGTG ACCTTCACCA GCGCCCCGGC CGCCGCGAGC
CTGCTGACCG TCGCCGACGA GCTCGGTCAG CGCGCCGAGC TGATCGCCGC GCTGACCGAC
GGCGTCCTGC CGGTGGCGGT GGGGCCGGTG ACCGCCGGGC CGCTGACCGC CGCGGGCATC
CCCTCCGTGC AACCGGAACG CGCCCGGCTC GGCGCCCTGG CCCGCGAGGT GGTCGCCCGG
CTGCCCGAGC GCACCCCGGT GCTGCGGGTG GGCGAGCGGG ACCTGCAGGT GCGCGGGCAC
GCCGTCCTGC TCGACGGGCG GGTGGTGGAG CTGGCGCCGG GCCCGATGGC GGTGCTGCGC
TCGCTGGCCG CGCGGCCGGG CACCGTCGTC GCCAAGGCCG ACCTCGTCGC GGGGCTGCCC
GGCGGCGGCG ACGAGCACGC CGTGGAGATG GCCGTGACCC GGCTGCGCGC CGCGCTCGGC
CGCGGCGTGG TGGAGACCGT GGTGAAGCGG GGCTACCGCC TGGCTGCGTG A
 
Protein sequence
MTDVLPEAPP GTEAPLPLAG YTVAVTAARR REELGALLAR RGARVVYAPA IRIVPLADDT 
ELVAATRQVL AQPVDLVVAT TGVGFRGWLE AADAWDLPLV EHLRGARVLA RGPKARGAIR
GGGLVDAWSP ASESSAEVLE HLLAGAEGPL QGRRIAVQLH GDPLPDFVEA LRATGAEVVT
VPVYRWVLPE DVEPVRRLVR SVVTGAVDAV TFTSAPAAAS LLTVADELGQ RAELIAALTD
GVLPVAVGPV TAGPLTAAGI PSVQPERARL GALAREVVAR LPERTPVLRV GERDLQVRGH
AVLLDGRVVE LAPGPMAVLR SLAARPGTVV AKADLVAGLP GGGDEHAVEM AVTRLRAALG
RGVVETVVKR GYRLAA