Gene Gobs_4320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4320 
Symbol 
ID8756014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4540481 
End bp4542400 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content72% 
IMG OID 
Productglycosyltransferase 
Protein accessionYP_003411253 
Protein GI284992699 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCA CTGCGGTGTC CCCCGACGTG GTGGCCGACC TGGCCGCCGA GGACCTGGCC 
CCCGGCCGGG CCGTGACCCT CCTCCAGCGG GTCCTCATGC CGCGCGGCGC CGACCCGCGC
AAGGTCCGCG CCCTCTACCT CGACGAGGTC CGGTCCGAGA AGGTGCACGT CGCCTCCCGG
TCGACCGGGC ACCTCTCGGC CGGTGACGAG GTCTCCTTCG CCACGTACTT CAACGCCTTC
CCGGCGGGTT ACTGGCGGCG GTGGACCTCC CTGACCAGCG TGACGCTGCA GCTGCGCATC
TCAGGCACCT GCCGCATCGA CGTGTACCGC TCCAAGGGCA ACGGCGACGT GCTGCACGCC
CTGGGCACCA CCGTCGAGGG GCAGGGCCGG ACGGTCGAGC TCGACCTGGA CCTCACCCCC
TTCGTCGAGG GCGGCTGGTA CTGGTTCGAC GTCACCGCCG ACGACGACAC GATCATCGAG
GACGCCGGCT GGTACGCCGA CCGGGAGCCG CTGCGCCCGG GGCGGCTGGC CGTCGGCATC
TGCACCTTCA ACCGTCCGGT CGACTGCGTC GCCGCGCTGC AGACGGTCGC CTCCGACCCG
GTGCTCGACG CCGAGCTCGC CGCGGTGGTC GTCGCCGACC AGGGCAACCT CAAGGTGTGC
GACGAGGCCT CGTTCCCCGA GGTGGCCGAC CAGCTCGGCG ACCGCCTGCA CCTGGTGGAG
CAGGGCAACC TCGGCGGCAG CGGCGGCTTC GCCCGTGCCA TGCACGAGAC GCTGACGACC
ACCGACGCCA CGCACCTGGT CTTCCTCGAC GACGACGTCC AGCTCGAGGC CGACAGCCTG
CACCGGGCGC TGACCTTCGC CCGGTTCACC GACGAGCCCA CCCTCGTCGG TGGGCAGATG
CTCACCCTGC AGGATCGCTC GGTGCTGCAC TCCATGGGCG AGAGCATCGA CCGCCGGCTC
ATGAGGTGGC GTCCGGCCCC CTATGCCGCG GCGGGCCACG ACTTCGCGCA CTGGTCGCTG
CGCGACGCCC GGCACCTGCA CCGTCGCGTG GACGTCGACT TCAACGGGTG GTGGATGTGC
CTGATCCCCC GCGAGGTGGC CGAGCACATC GGGCTGCCGC TCCCGCTGTT CATCAAGTGG
GACGACGCCG AGTACGGCCT GCGGGCCGGC GCTGCCGGAT ACCGCACGGT GACCCTCCCG
GGCGCGGCGA TCTGGCACCT GTCCTGGACC GACAAGGACG ACGTCAGCGA CTGGCAAGCG
TACTTCCACG CCCGCAACCA GCTCATCGTG GCCGCCCTGC ACAGCCCGCT GAAGCGGGCC
GAGGACATCG TCCGCGAGAA CGTCCGGGCC GACATCCGGC ACCTGTTCCG GCTGGAGTAC
TCCGCGGTCG CGCTGCACCT GAAGGCCTAC CGCGACTTCC TCGCCGGCCC GCAGGAACTC
TTCCGGCAGC TGCCCGGCGT GCTGGCCGAG GTGCGTGCCG AGCGGGCGCG GTACTCCGAC
GGTCAGGTCA TCACCGAGCG GGCCCGCATC CCGCTGCCGC AGATGGGCCA GGACGCGACC
GAGGGCATGG TGCACCCGCC GGTGGCCAAG CGCGCGATCG CCCGCGCCGC GCTGCAGGCG
CTGCGCAACA ACGTGCGGCC GGTCCAGGAC GCCGACGGCC GGCCCCAGGT GGAGCTGCCG
GCCCGCGACG CCCAGTGGTT CGTGCTGGCG CAGCTGGACA GCGCCTCGGT CGCCACGGCC
GACGGCCGGG GCGTCACGGT GCGCCGCCGC GACCCCGCGA CGTTCTGGCG GCTGGCCCGG
GAGTCGGTGC GGCTCAACCT GGAGATCGCC CGGCGCTTCC CCCGCGCCAA GCAGCAGTAC
CGCGACTCCT ACGGTGACCT GACCTCGGCG GAGAACTGGG TGAGCGTCTT CCAGGCGTGA
 
Protein sequence
MTTTAVSPDV VADLAAEDLA PGRAVTLLQR VLMPRGADPR KVRALYLDEV RSEKVHVASR 
STGHLSAGDE VSFATYFNAF PAGYWRRWTS LTSVTLQLRI SGTCRIDVYR SKGNGDVLHA
LGTTVEGQGR TVELDLDLTP FVEGGWYWFD VTADDDTIIE DAGWYADREP LRPGRLAVGI
CTFNRPVDCV AALQTVASDP VLDAELAAVV VADQGNLKVC DEASFPEVAD QLGDRLHLVE
QGNLGGSGGF ARAMHETLTT TDATHLVFLD DDVQLEADSL HRALTFARFT DEPTLVGGQM
LTLQDRSVLH SMGESIDRRL MRWRPAPYAA AGHDFAHWSL RDARHLHRRV DVDFNGWWMC
LIPREVAEHI GLPLPLFIKW DDAEYGLRAG AAGYRTVTLP GAAIWHLSWT DKDDVSDWQA
YFHARNQLIV AALHSPLKRA EDIVRENVRA DIRHLFRLEY SAVALHLKAY RDFLAGPQEL
FRQLPGVLAE VRAERARYSD GQVITERARI PLPQMGQDAT EGMVHPPVAK RAIARAALQA
LRNNVRPVQD ADGRPQVELP ARDAQWFVLA QLDSASVATA DGRGVTVRRR DPATFWRLAR
ESVRLNLEIA RRFPRAKQQY RDSYGDLTSA ENWVSVFQA