Gene Gobs_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_2106 
Symbol 
ID8753777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp2191860 
End bp2192981 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content76% 
IMG OID 
Productferrochelatase 
Protein accessionYP_003409162 
Protein GI284990608 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCCGCG CCACCGTCGA CATCAGCCCC GGCCAGCGAC CCGACGAGGA CCCCTCGCTG 
GCCGTCCGCA ACAACGGCGG CGTCCCGCCG TCGGCCGACC CCGCCCCCGC CGGACGACGG
GAGGCGCTGC TGGTGCTCTC CTTCGGCGGT CCCGAGGGCC ACGACGACGT CATGCCGTTC
CTCGAGAACG TCACCCGCGG CCGCGGGATC CCGCGCGAGC GGCTCGAGGA GGTCGCCGAG
CACTACCACC ACTTCGACGG CGTCAGCCCG ATCAACGGCC AGAACAAGGC GCTGATCGCG
GCGGTCGAGG CCGACCTCGC CGCCGCCGGC GTCGAGCTGC CGGTCTACTG GGGCAACCGC
AACTGGGCGC CCTACGTCGA GGACACCTGG GCGCAGATGG CCGACGACGG CATCGAGCAC
GTCTACGTCC TCGCCACCTC CGCCTACGCC TCGTACTCCG GCTGCCGGCA GTACCACGAG
GACATCGCAC GGGCCCGCGT CGCGACCGGC GGCGGGCCGA CCGCGGAGAA GCTGCCGCAC
TACTTCGACG CGCCGGGCTT CGTGCAGGCC AACGCCGACG CCCTGGCCGC CGCGATCGCG
TCGCTGCCCG AGGAGGTCCG CGGCACCGCC CGGCTGGTGG CCACGGCGCA CTCCATCCCC
GACACGATGG CCGCCGTGGC GGGGCCCGAG GGGCACGCCT ACGAGGCCGA GCTCACGACC
GCCGCGCAGC TGGCGGTCGA CGCCGCCGCA CCCGGCCGGT CCTTCGACCT GGTGTGGCAG
AGCCGCAGCG GCCCGCCGTC CGTGCCGTGG CTGGAACCCG ACGTCAACGA CCACCTGCGC
GCGCTGGCCG CGGCGGGGGA GCAGGCCGTC GTCCTGTTCC CGGTCGGGTT CGTCAGCGAC
CACCTCGAGG TCGTCTGGGA CCTCGACAAC GAGGCGAAGG AGACCGCCCG GGAGTGCGGG
CTGGCCTTCG CCCGCGCCGC GACCGCCGGG ACCCACCCGG CGTTCGTCCG CTCGCTGGTG
GAGCTGCTGC GGGAGCGCCG CGCCGGCGGG CAGCCCCGCC TGGGGACCGA CTGCCCCGCG
TCGTGCTGCT TCGTCGCGCG GCCGGCCCGC CCGACCGCCT GA
 
Protein sequence
MPRATVDISP GQRPDEDPSL AVRNNGGVPP SADPAPAGRR EALLVLSFGG PEGHDDVMPF 
LENVTRGRGI PRERLEEVAE HYHHFDGVSP INGQNKALIA AVEADLAAAG VELPVYWGNR
NWAPYVEDTW AQMADDGIEH VYVLATSAYA SYSGCRQYHE DIARARVATG GGPTAEKLPH
YFDAPGFVQA NADALAAAIA SLPEEVRGTA RLVATAHSIP DTMAAVAGPE GHAYEAELTT
AAQLAVDAAA PGRSFDLVWQ SRSGPPSVPW LEPDVNDHLR ALAAAGEQAV VLFPVGFVSD
HLEVVWDLDN EAKETARECG LAFARAATAG THPAFVRSLV ELLRERRAGG QPRLGTDCPA
SCCFVARPAR PTA