Gene Gobs_3785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_3785 
Symbol 
ID8755470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp3965252 
End bp3966748 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content74% 
IMG OID 
ProductHNH endonuclease 
Protein accessionYP_003410732 
Protein GI284992178 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0713491 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTCAG CCGGTCCCGC GCGTACGCCG CTGGAGGCCG AGCTGGTCGG CCGGCTGCTC 
GAGCGGCCGC CGACCAGCGC GCGGCTGCCT GTGGGGCTGC TGACCCGGGC GGAGAAGGCC
GCCGAGCTGC AGCGCCTGCA GGCCCGCAAG GCGATGGACG CCGCCTACGA GGCCGAGCTC
GTCATGGGCC TGGCCGACGA CACCCCGGAC TCCCTCGACC CGCCGCCGGG CCACCCCGGC
GCCAGGAAGG GCTCGTGGGC CCCGGATCCC GAGCTGCCCG GGGTGAGCGA GTTCTTCACC
TCCGAGCTGG CAGTGGTGCT CAACTGCGGC CGGGGCACCG CCTCCCACCT GGCGCACCGC
GCCTGGACCT ACCGGGGGAA CCTGCCGGCC ACCTGGGCCG CGCTGGCCGA TGGGGTTCTG
GACGAGCCCC GCGCGAAGGT CCTCGCCGAC GTCCTCACCC ACACGACACC GGCGATCGCC
CGGGGAATCG AGTCGCGGCT GCTGCCCGAG GCGCCCGGCC TGTCCACCGG CCGGTTGCGG
GCCCGGGCGC TGGCACTTCT GCTGGAACTC GATACCGACG CCGTCGACGC GCGGCGCAAG
GACGCTCGTC GGCAGGCCGA CGTGCGCTCC TATCCCTCAC ACCTGGAGGG CATGAGCACG
CTGGCTGCGG ACCTGCCCAC CCCGGTGTCG GCCGAGTGCC TCGACGTGGT CGACCGGTTG
GCAGCGATGC TCAAGACCGA TGGCGACCCC CGGCCGATCG GCGAGCTGCG CGCCGTGGTG
CTGGCTGACC TGATCCGCCG TCCCTGGGAC ACCAGCCGGA CGCCGGTCAC AGCTCAGCTG
ACGATCACCG CCGCACTCGA CGCGCTGGCC GGCCGGACCG ACCAGCCCGG GGAGGTCAAC
GGGCAGCCGA TCACCGCCGC CCAGCTGCGC GAGCTGCTCA TCCGGCTCGG TGCCCTGGGG
CTGCAGACAC CCGAGGGCGG CACGGTGACC CTCGCGGTCA CCGACGACGG CGCTCTGGTG
GCCACCACCA CCCTCGACCA GCTGCGCCGT CTGGCCCGCC GTGGCTGCGC CACCCACCAC
GAGCAGGACT GCGGCTGCCC GGTGCTCGAC CGACCGGCAC CCACCGACGC CTACCCACCC
ACCGCCGCCC AGGACGCCTT CGTCACCACC CGCGACCGCG CCTGCCGCTT CCCCAACTGC
GGCCAGCGCG TCGGCTGGAC CGACCGCGAC CACGTCGTCC CGCACGCCGA CGGCGGCGCC
ACTGACTGCG CCAACCTGTG CTGCCTGTGC CGCAGCCACC ACCGCCTCAA GACCCACGCC
CGCGGCTGGC GATTCGCCAT GGACAACGAC GGCGCCCTGC ACGTCACCAC ACCATCGGGC
GTCACCCGCA CCACCCGACC ACCCGGCCTG CGACCATCCC AGCCACCCGG ATCAACAGCG
GCCGCCTCGA CTCCGCCAGC GGTGTCCATC TCGGACGACG ATCCGCCACC CTTCTGA
 
Protein sequence
MRSAGPARTP LEAELVGRLL ERPPTSARLP VGLLTRAEKA AELQRLQARK AMDAAYEAEL 
VMGLADDTPD SLDPPPGHPG ARKGSWAPDP ELPGVSEFFT SELAVVLNCG RGTASHLAHR
AWTYRGNLPA TWAALADGVL DEPRAKVLAD VLTHTTPAIA RGIESRLLPE APGLSTGRLR
ARALALLLEL DTDAVDARRK DARRQADVRS YPSHLEGMST LAADLPTPVS AECLDVVDRL
AAMLKTDGDP RPIGELRAVV LADLIRRPWD TSRTPVTAQL TITAALDALA GRTDQPGEVN
GQPITAAQLR ELLIRLGALG LQTPEGGTVT LAVTDDGALV ATTTLDQLRR LARRGCATHH
EQDCGCPVLD RPAPTDAYPP TAAQDAFVTT RDRACRFPNC GQRVGWTDRD HVVPHADGGA
TDCANLCCLC RSHHRLKTHA RGWRFAMDND GALHVTTPSG VTRTTRPPGL RPSQPPGSTA
AASTPPAVSI SDDDPPPF