Gene Gobs_4101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4101 
Symbol 
ID8755792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4311404 
End bp4313236 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content68% 
IMG OID 
Producttrehalose synthase 
Protein accessionYP_003411037 
Protein GI284992483 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTTC CCGTCCCCGA CCTCCCAGCC AGCCGCCCCG AGGCGCCGGC GTCGGCGCTG 
CCCGCACCGG GCACCGACCC CGAGTGGTAC AAGCGCGCCG TCTTCTACGA AGTGCTCGTC
CGCGGGTTCG CCGACAGCAA CGCCGACGGC GTCGGCGACC TGCGCGGCAT GATCGACAAG
TTGGACTACC TGCAGTGGCT CGGCGTCGAC TGCCTGTGGC TGCCGCCGTT CTTCGCCTCG
CCGCTGCGCG ACGGCGGTTA CGACGTCAGC GACTACACCG CCGTGCTGCC CGAGTTCGGC
GACATCGACG ACTTCACCGA GTTCCTCGCC GCGGCCCACG CCCGCGGCAT CCGCGTGATC
ATCGACTTCG TCATGAATCA CACCTCGGAC CAGCACCCCT GGTTCCAGGC CAGCCGCAGC
GACCCCGAGG GGCCCTACGG CGACTTCTAC GTGTGGTCCG ACGACGACAC CCGCTACGCC
GACGCCCGGA TCATCTTCGT CGACACCGAG AGCTCGAACT GGACGTTCGA CCCGGTGCGC
AAGCAGTACT TCTGGCACCG GTTCTTCTCC CACCAGCCCG ACCTCAACTT CGAGAACCCG
CGGGTGATCG AGGCGATCAT GGACGCCCTG CGGTTCTGGC TCGACCTGGG CATCGACGGT
TTCCGGCTGG ATGCCGTGCC CTACCTGATC GAGGAGGAGG GCACCAACTG CGAGAACCTG
CCGGGGACCC ACGAGATCCT CAAGCAGGTC CGCAAGGTCG TCGACGCCGA CTACCCCGAC
CGCGTGATGC TGTGCGAGGC CAACCAGTGG CCGGCCGACG TCGTCGAGTA CTTCGGCGAG
GACGGCGACG AGTGCCAGAT GGCCTTCCAC TTCCCGGTGA TGCCGCGCCT GTTCATGGCC
GTGCGGCGGG AGCAGCGCTT CCCGATCTCG GAGATCATGG CGCAGACGCC GGAGATCCCG
GACAACTGCC AGTGGGGCAT CTTCCTGCGC AACCACGACG AGCTGACCCT GGAGATGGTC
ACCGACGAGG AGCGCGACTA CATGTGGGCG GAGTACGCCA AGGACCCCCG CATGAAGGCC
AACATCGGCA TCCGCCGCCG GCTGGCGCCC CTGCTGGACA ACGACATGGA CACCCTCGAG
CTGTTCAACG CGCTGCTGCT GTCGCTGCCC GGCTCGCCGG TCCTGTACTA CGGCGACGAG
ATCGCCATGG GCGACAACAT CTGGCTCGGT GACCGTGACG GCGTCCGCAC GCCCATGCAG
TGGACGCCGG ACCGCAACGG CGGCTTCTCC ACCGCCGACC CGCAGCGGAT GAACCTGCCG
CTGAACCAGG ACCCGGTCTA CGGCTACCAG GTCACCAACG TCGAGTCCCA GCTGCGCAAC
ACCAACTCGA TGCTGCACTG GCTGCGGCAG ATGATCCACG TGCGCAAGCA GCACCCGACC
TTCGGCCGGG GCAGCTACGC CGAGATCGGC TCGCGCAACC CGACGGTGCT CTCCTTCGTC
CGCGAGTTCG GCGACGACGT GGTGCTCTGC GTCAACAACC TCTCCCGGTT CCCGCAGCCG
GTGGAGCTGG ACCTGCGCCG CTTCGAGGGC TACACCCCGA TCGAGCTGAC CGGCCGGGTG
GAGTTCCCGC AGATCGGCGT CCTGCCGTAC ATGCTCACCC TGTCCGGGCA CGGCTTCTAC
TGGTTCGAGC TGGCCAAGCC CGCCGAGCCG GTGGAGGAAC CCACCGAGAC CGACGACACC
GCGGTGGCCG ACTCCCTGCT GGCCGCGGGC CTCGTCGGCT CCGGCGAGGG ACCAGCTGCG
GGCGACACGG CCGACACGGG AGGAGCCCGA TGA
 
Protein sequence
MTVPVPDLPA SRPEAPASAL PAPGTDPEWY KRAVFYEVLV RGFADSNADG VGDLRGMIDK 
LDYLQWLGVD CLWLPPFFAS PLRDGGYDVS DYTAVLPEFG DIDDFTEFLA AAHARGIRVI
IDFVMNHTSD QHPWFQASRS DPEGPYGDFY VWSDDDTRYA DARIIFVDTE SSNWTFDPVR
KQYFWHRFFS HQPDLNFENP RVIEAIMDAL RFWLDLGIDG FRLDAVPYLI EEEGTNCENL
PGTHEILKQV RKVVDADYPD RVMLCEANQW PADVVEYFGE DGDECQMAFH FPVMPRLFMA
VRREQRFPIS EIMAQTPEIP DNCQWGIFLR NHDELTLEMV TDEERDYMWA EYAKDPRMKA
NIGIRRRLAP LLDNDMDTLE LFNALLLSLP GSPVLYYGDE IAMGDNIWLG DRDGVRTPMQ
WTPDRNGGFS TADPQRMNLP LNQDPVYGYQ VTNVESQLRN TNSMLHWLRQ MIHVRKQHPT
FGRGSYAEIG SRNPTVLSFV REFGDDVVLC VNNLSRFPQP VELDLRRFEG YTPIELTGRV
EFPQIGVLPY MLTLSGHGFY WFELAKPAEP VEEPTETDDT AVADSLLAAG LVGSGEGPAA
GDTADTGGAR