Gene Gobs_4089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4089 
Symbol 
ID8755780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4292727 
End bp4293932 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content76% 
IMG OID 
ProductCysteine desulfurase 
Protein accessionYP_003411025 
Protein GI284992471 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGCT CCGAGGCGGT CTACCTGGAC CACGCGGCAA CCACGCCGAT GCTGCCCGCA 
GTGCTGGCCG CGATGACCGG GCAGCTGGGC CGCGTGGGCA ACGCCTCCTC GCTGCACGCC
AGCGGCCGCG CCGCGCGCCG GGTCGCCGAG CAGTCGCGCG AGCGGCTGGC GGAGGCGCTG
GGCGCGCGCC CGTCGGAGGT GCTGTTCACC GGCGGCGGCA CCGAGAGCGA CAACCTCGCC
GTCAAGGGCC TGTTCTGGGC CCGGCGCGAC GCCGACTCCC GCCGCCGGCG CATCGTGGTC
AGCCCCGCCG AGCACCACGC GGTGCTCGAC AGCGTCGAGT GGCTGACCAA GCACGACGGC
GCCGACGTCA CCTGGCTGCC CGTCGAGCCG ACGGGCCGCG TCACCCCCGA GGCGCTGCAC
GCGGCCCTGG GCAGCGGTGA GGACGTCGCC CTGGTCAGCG TCATGTGGGC CAACAACGAG
ATCGGCACGG TCAGCGACCT GGCCGCGCTC GCCGAGGTCG CGCACGACGT GGGCGTCCCG
CTGCACACCG ACGCGGTCCA GGCGGTCGGG CAGGTGCCGG TCGACTTCGC CGCCAGCGGC
GTCGACGCGC TGACCATGAC CGGCCACAAG CTCGGCGGGC CGATGGGTGC CGGCGTCCTG
CTGCTGCGCC GCGAGGCTGA GTGCACCCCG TTGCTGCACG GCGGCGGCCA GGAGCGCGAC
GTGCGCTCGG GCACCCTCGA CGTCGCGGCG ATCGTCGGCC TGCAGGTCGC CACCACGCTG
GCCGTCGCCG AGCGGGAGGA CCGCGCCGCG CGGCTGGCCG CCCTGCGCGA CCGGCTGGTG
TCCGGCGTGG TGGCGCAGGT GCCCGACGCC CAGCTCAACG GCCCCCCGCT GGACGACGTC
GTCGCCGGTG GGCCGGGACG GCTGCCGGGC AACGCGCACC TGTCCTTCCC CGGTGCGGAG
GGCGACGCGC TGCTCATGCT GCTCGACGCC CGCGGCGTGG AGTGCTCCAC CGGATCGGCC
TGCAGCGCCG GCGTCGCCCG GCCCAGCCAC GTGCTGCTGG CCACCGGCGC CGACCCCGAC
CGGGCACGCA GCTCACTGCG CTTCAGCCTC GGGCACACCT CGACCGACGC CGATGTCGAC
GCCGTCCTCG ACGTGATCGG CCCGGTGGTG GAGCGTGCCC GCCGGGCCGG GATGGGCAGG
CGATGA
 
Protein sequence
MSSSEAVYLD HAATTPMLPA VLAAMTGQLG RVGNASSLHA SGRAARRVAE QSRERLAEAL 
GARPSEVLFT GGGTESDNLA VKGLFWARRD ADSRRRRIVV SPAEHHAVLD SVEWLTKHDG
ADVTWLPVEP TGRVTPEALH AALGSGEDVA LVSVMWANNE IGTVSDLAAL AEVAHDVGVP
LHTDAVQAVG QVPVDFAASG VDALTMTGHK LGGPMGAGVL LLRREAECTP LLHGGGQERD
VRSGTLDVAA IVGLQVATTL AVAEREDRAA RLAALRDRLV SGVVAQVPDA QLNGPPLDDV
VAGGPGRLPG NAHLSFPGAE GDALLMLLDA RGVECSTGSA CSAGVARPSH VLLATGADPD
RARSSLRFSL GHTSTDADVD AVLDVIGPVV ERARRAGMGR R