Gene Gobs_1451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_1451 
Symbol 
ID8753116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp1499643 
End bp1500758 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content72% 
IMG OID 
Productintegrase family protein 
Protein accessionYP_003408552 
Protein GI284989998 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCAGAC CCCATCTGGA GCTCGGCACC CACGGCCGGG TCCGCGTCTA CCCGGATCCG 
GCCGGGTACC GGGCGGTGTG TCTGTACCGG GACTGGGACG GGGCCACCCG GCAGGTGCAA
CGGCAGGCCA AGACCAAGGG GGCGGCGGAG CGGGCCCTCG CTGTGGCGCT GCGAGATCGG
GGACGTCCGG GAACGGGCCA CGAGATCACT CCGGACACCA AGGTCGCCGA CCTCGCGGCG
AAGTGGTTCA GCGAGCTCGA GGGCAAGAGC CCGTCGACGA TGCAGGCCTA CCGTGACCGG
CTCGATCGGC AGGTCCTCCC GGCACTGGGC AGCGTGCGGG TGCGTGAGCT CAGCGTCGGG
TTGCTCGATC GTCACCTGGC GGCCGTGCGG GCGTCGCACG GCCCGGCGCT GGCGAAGATG
ACCAAATCGG TGATCAGCGG CATGTGCGGC CTGGCCTGCC GCCACGACGC CCTGAAGGCC
AACCCCTGCC GGGACGTGGC GCGCATCCCC AGCCAGACCC GGCGGGCGCC GCGGGCGCTG
ACCGCGGACG AGGTCAGGTC GGTGCGGGCA TGGCTGAGCG AGGACGCGAC GGCTCGCGAG
CGGGATATGC CGGACCTCGT GGCGTTCATG GTCGCCACCG GTCTCCGCAT CGGCGAGGCC
TGTGCAGTCA GCTGGCCGGA CGTGGACCTC GATGCCGACA CCGTCACGGT CACGGGGACG
GTGCTGCGGG TCAAGGGTCA GGGCCTGGTC GTCAGCCAGC CGAAGTCGAT GGCGGGGGAG
CGGGTGCTGG AGCTGCCGAG CTGGTGTGTC GCGCTCCTGC GGCGGCGCGG GCCGTCGAGC
GGACCGGTCT TCCCCGCGCC GCGCAGCCGC AAGCTGCGCG ACCCGAACAA CACCCGCCGG
GCCCTTCGCG AGGCGTTCCA TGCAATGGGG ATGCCGGGCG TCACCTCCCA CGCCTTCCGC
AAGACCGTCG CCACGCTCAT GGACGAGGCG GGGTTGTCCG CCAGGAGCGC GGCCGACCAG
CTGGGGCACG CCAAGCCGTC CGTCACGCAG GACGTCTACT ACGGCCGCAG GAGGCGGGCC
ACCGGAGCGG CTCAGGTCCT CGAGCAACTG GCTTGA
 
Protein sequence
MARPHLELGT HGRVRVYPDP AGYRAVCLYR DWDGATRQVQ RQAKTKGAAE RALAVALRDR 
GRPGTGHEIT PDTKVADLAA KWFSELEGKS PSTMQAYRDR LDRQVLPALG SVRVRELSVG
LLDRHLAAVR ASHGPALAKM TKSVISGMCG LACRHDALKA NPCRDVARIP SQTRRAPRAL
TADEVRSVRA WLSEDATARE RDMPDLVAFM VATGLRIGEA CAVSWPDVDL DADTVTVTGT
VLRVKGQGLV VSQPKSMAGE RVLELPSWCV ALLRRRGPSS GPVFPAPRSR KLRDPNNTRR
ALREAFHAMG MPGVTSHAFR KTVATLMDEA GLSARSAADQ LGHAKPSVTQ DVYYGRRRRA
TGAAQVLEQL A