Gene Gobs_2220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_2220 
Symbol 
ID8753891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp2306165 
End bp2307772 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content75% 
IMG OID 
ProductHNH endonuclease 
Protein accessionYP_003409274 
Protein GI284990720 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.176223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGTACC CCCGGCCTAG CGTGAGGGCG TGTTCCCTCG GCGGGGGTTT CGGGGTCGGC 
CTGACGGTGG CCGACCGGAC GCCGCCGCCC CCTGGGTTCG GCCCACCGGC CAACTCCCCA
CCGGCGCTGG CCGAGGTGCT GCCGGTGTAC GCGCGGACGG CGGAGGAGAA GGTCGCGGAG
CTGCAACGGG TGCAGCAGCT GGAATCCGGA CTCGCCGCCT ACAAGCTGGA GCTGATCGCG
TCGTTCGCTG CCGACCGCCC GGCTCAGCTC GATCGCCGGC CCGGGCAACC GGGCGCCGCC
GCCGGAGATG ACTCGACGCC GGACGGTGTG TCGGAGTTCT TCGCCGACGA GCTGGCGTTG
ACGCTGAACT GCGCCCGGGC GTCGGCGACC ACGCTGACCG AGCACGCGCT CACGCTCACC
GGCTCGCTGC GGGCCACGCT GGAGGAGCTG GCGCAGAGCC GACTGGACTG GCCCCGCGCC
CGGACCATGG CCGAGGAGCT GGGCGAGAAG GTGGGCGGCA CTCACCCGCA GGTGATCGCC
GCGGTCGAGG CCGCGGTGCT GCCCGAGGCG CCGTCGCTGT CCGTCCGCCG GCTCAAGGAC
CGGCTGCGCC AGGAGCTGGC CGCCCGGGAC GCCGCGGCCT CCGACCGGGC GCGCGAGGAT
GCCCAACGGG CGGTGACCGT GCGCCGCCGG CCGGTGGGCG GCGGCGTCAG CGAGCTGATC
GCCGGCATGC CCGACGAGCT GGCCGCGGCG TGTCAGGCGA CGATCGACGA GCTGGCCTGG
AGGGCGAAGA AGGCCGGCGA CGACCGTCCG ATCGGGATGC TGCGGGTCGG GGTGCTCGCC
GACCTGATCC AGCGGCCCTG GCTGGTGCCC GAGCCGGTGG CCGCCCACGT CGAGGTGCAG
GTGCCGCTGC GTGCGCTCAC CCCCGGCGGG TTCCTGGCGC AGGGCTCCCC GCTGCCGCCG
GCCTACACCC GGCCGGGGTC GGTGGCCGGA CCCACCGGCG CGGTGGCGGG GGTACCGATC
ACCGCCGCGC ACGTCCGCAA CCTGCTCGCC CAGTTTGACG CGATCGGCCT GCAGGCCCCG
CCGGGTGGGT CGATCAGCTT CTCCTTCGCC GACGACCGCG GGGCGCTGCG GGCGGTCGCG
ACCCTGCGCG AACTGCGGCA GGCTGCCAGC CGGGGTTGCC CTGTCCACCG CGACGGCGCC
TGTGACTGCG CGGTCATCGA CCGCCCCGAG GCCACCGACG CCTACGCACC CACCGCCGCG
CAAGGCCGCT TCCTCACCAC CCGCGACCGC ACCTGTCGGC ACCCGGGCTG CAGCAACCGC
GCCGGGTGGG CCGACGCCGA CCACGTCATC CCCTACGCCC AGGGCGGAGA GACCGACTGC
GCCAACCTGT GCTGCCTGTG CCGCCGGCAC CACCGGCTCA AGACCTTCGC CCCGGGCTGG
ACCTACGCCA TGACCGCCGA CGGCATCCTC ACCGTCACCA CACCCGCCGG CGTGACCCGC
ACCAGCCGAC CACCTGGCCT GCACCTCACC GGCCCCCGAG TGCTCACCCG GCCACCGGAC
CAGCCGCCAG CGGCACCCGA CCCCGCCGAC GACCCACCAC CGTTCTGA
 
Protein sequence
MSYPRPSVRA CSLGGGFGVG LTVADRTPPP PGFGPPANSP PALAEVLPVY ARTAEEKVAE 
LQRVQQLESG LAAYKLELIA SFAADRPAQL DRRPGQPGAA AGDDSTPDGV SEFFADELAL
TLNCARASAT TLTEHALTLT GSLRATLEEL AQSRLDWPRA RTMAEELGEK VGGTHPQVIA
AVEAAVLPEA PSLSVRRLKD RLRQELAARD AAASDRARED AQRAVTVRRR PVGGGVSELI
AGMPDELAAA CQATIDELAW RAKKAGDDRP IGMLRVGVLA DLIQRPWLVP EPVAAHVEVQ
VPLRALTPGG FLAQGSPLPP AYTRPGSVAG PTGAVAGVPI TAAHVRNLLA QFDAIGLQAP
PGGSISFSFA DDRGALRAVA TLRELRQAAS RGCPVHRDGA CDCAVIDRPE ATDAYAPTAA
QGRFLTTRDR TCRHPGCSNR AGWADADHVI PYAQGGETDC ANLCCLCRRH HRLKTFAPGW
TYAMTADGIL TVTTPAGVTR TSRPPGLHLT GPRVLTRPPD QPPAAPDPAD DPPPF