Gene Gobs_5017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_5017 
Symbol 
ID8756719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp5236798 
End bp5238552 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content74% 
IMG OID 
ProductGNAT-family acetyltransferase TIGR03103 
Protein accessionYP_003411916 
Protein GI284993361 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCCGCG TCGCGGGGCA CCGGCGCCGG ACCCCGGTGT CCGAGACCCC CTCCCCCACC 
AGCCGTCCCT GGCAGCAGAT CTCCGAGCAC CAGCGGCAGG GGATGGAGTC CGACGTCGTC
CTCGACCTGG CCTGGGGCCG GCTGGTGTTC GGGCAGACCT TCTCCAGCCT CGAGGGCATC
GTGAAGGCGC TGCGCTCGGA GGAGAGCGGC CGGCGCGACA TCTGCATCTA CCCCCGCGAC
CCGCACGTGA TGGTCGGGAT GGCGCCCGAC GAGCTGTTCA TCGACCCCAG CTACGTCTAC
CGGCTGGACC TGTACCGCTA CCGGTCGCGT GCCGAGCTGA TCCGCGGGGT CTTCGTGCGC
ACGGTGACCG CGCTGGACGA GATGCGGCAG ATCAACGACA TCTACGCGCG CAACGGCATG
GTCACCGGGG ACGCCGCCAC CATGTGGGCC AACCACCGCA CCCGCTGCTT CACCTACCTG
GTCGCGGAGG ACCGGCGCAC GGGGCGGATC GTCGGCACGG TCACCGGTGT CGACCACGTG
CTCGCCTTCG GCGACCCGGA GGGCGGCGCC AGCCTCTGGT GCCTGGCCGT GGACGCGCAG
GACGCCCCGC CCGGCACCGG TGAGGCGCTG GTGCGGGTGC TGGCCGAGCG CTACGTCGGC
CGCGGCCGCG CCTACCTGGA CCTGTCGGTC ATGCACGACA ACTCCGGGGC GATCGCGCTG
TACCGCAAGC TCGGCTTCCA CCGGGTGCCC GCGCTGTGCG TCAAGCGGAA GAACCCGATC
AACACCCCGC TGTTCTCCTC CCGCCCGCCG GGGCTGGAGG AGCTCAACCC CTATGCGCGG
ATCATCGCCG AGGAGGCGCT CGAGCGCGGG ATCCGGGTCG AGGTCAGCGA CGCCGAGTGG
GGTGAGATGC GGCTGGCGCT GGGCGGGCGC ACGGTGCTCA CCCGGGAGTC GCTGTCGGAG
TTCACCAGCG CGGTCGCGAT GAGCCGCTGC GACGACAAGC GCGTCACCCG GCGGATCATG
GAGCGGGCCG GCGTCCGCGT GCCCCGCGGT GCGACCGTCA CCGAGGACGA CCGGGAGTCC
GCCGGGGCGC TGCTCGCCGA GTGCGGCGAG GTGGTGGTCA AGCCCGCCCG CGGCGAGCAG
GGGAAGGGCA TCACCGTGGG CGTGACCTCG GCCGACGCGC TGGAGCGGGC GGTCGCGCTG
GCCGCCCAGT TCTGCCCCGA CGTGCTGGTC GAGGAGCTCG TCGAGGGCGA CGACCTGCGG
GTGGTCGTCA TCGACCACGA GGTGGTGGCC GCCGCGGTGC GCCGGCCGGC CGAGGTGGTC
GGCGATGGCC GTAACCCGGT GACCGACCTG ATCCGGTCGA CCAGCCGCCG GCGCGAGCGG
GCCACCGGCG GGGAGTCGCG CATCCCGCTG GACGGTGCGA CCGCCGAGGT GGTCGCCGAG
TCCGGGTACG CGATGGACGA CGTCCCGCCC GCGGGCGAGC GGATCCGGGT CCGGCGGACG
GCGAACCTGC ACACCGGCGG CACCATCGAA GACGTCACCG CGCGGCTGCA CCCGGCGATC
GCGCAGGCCG CCGTCCGGGC CAGCCGGGCG ATCGGCATCC CGGTGACCGG GCTGGACTTC
CTGGTGCCCG ACGTCGAGGG GCCCGACCAC GTGTTCATCG AGGCCAACGA GCGGCCCGGG
CTGGCCAACC ACGAGCCGCA GCCGACCGTG GAGCGCTTCG TCGACCTGCT CTTCCCCGAG
ACCCGCCGGC GCTGA
 
Protein sequence
MPRVAGHRRR TPVSETPSPT SRPWQQISEH QRQGMESDVV LDLAWGRLVF GQTFSSLEGI 
VKALRSEESG RRDICIYPRD PHVMVGMAPD ELFIDPSYVY RLDLYRYRSR AELIRGVFVR
TVTALDEMRQ INDIYARNGM VTGDAATMWA NHRTRCFTYL VAEDRRTGRI VGTVTGVDHV
LAFGDPEGGA SLWCLAVDAQ DAPPGTGEAL VRVLAERYVG RGRAYLDLSV MHDNSGAIAL
YRKLGFHRVP ALCVKRKNPI NTPLFSSRPP GLEELNPYAR IIAEEALERG IRVEVSDAEW
GEMRLALGGR TVLTRESLSE FTSAVAMSRC DDKRVTRRIM ERAGVRVPRG ATVTEDDRES
AGALLAECGE VVVKPARGEQ GKGITVGVTS ADALERAVAL AAQFCPDVLV EELVEGDDLR
VVVIDHEVVA AAVRRPAEVV GDGRNPVTDL IRSTSRRRER ATGGESRIPL DGATAEVVAE
SGYAMDDVPP AGERIRVRRT ANLHTGGTIE DVTARLHPAI AQAAVRASRA IGIPVTGLDF
LVPDVEGPDH VFIEANERPG LANHEPQPTV ERFVDLLFPE TRRR