Gene Sked_35950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSked_35950 
Symbol 
ID8635228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSanguibacter keddieii DSM 10542 
KingdomBacteria 
Replicon accessionNC_013521 
Strand
Start bp4021060 
End bp4022946 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content73% 
IMG OID 
ProductHNH endonuclease 
Protein accessionYP_003316315 
Protein GI269796860 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.391714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.627541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGAG ACCTCTCCTG GCGGGACGCG TGCCGAGAGG AGCTGTCCGA GCGCGTGGAC 
GAGCTCGAGG CTGCCGAACG GGCGATCGCC TCCGCGCACG CGCGCAGGAT CAGGGCGATC
GAGGCGGTCC GCGTGCTCGC GGTCGAGACC GTGCGGGGCC TGGACCCGAC GCTCGGACCG
GTCGACCCTG CCGACGCCGT GCGGGTCCGC GAGATCCGGA GGCAGCAGGC GCTCGCGCGC
CGTGCTGCGC ACGCGGAGGT CGCGTGCGTC CTGCGGACGC CGGAGATGAC CACGACCTCC
TTGGTGCACG AGGCCCAGGT CCTGGTGGAC CACCACCCCG CGACCTTGGC GGCCCTGACG
CGCGGCGAGA TCTCGCGCAC GCACGCCAGG GTGGTGCTCG AGAACACCGC TGCCCTGGAA
CCTGACGAGT GCCGCGACCT CGAGGTGACG CTCGTCGAGC GGGCGAAGGA CAGCACGGTC
GCAGCCCTGC GCAGGTACGC GCGACGCCAG CGCGAGCGTT CCCACCCCCG GCCTCTCGTC
GAGCGTCACC GCGAACGGCT GGCAGAGCGT CGCGTCGAGA TCGAGCCGGC GCGCGACGGC
ATGATGTGGC TGCACCAGTA CCTGCCAGCC GTCCAGGCGA CGGCGATCTA CAACAGGCTG
ACGGACGTCG CGGTGACCTT CCAGGGGAAG GACGAGCAGG GGATGAGCGA GGACAGGACG
CTGGCGCAGC TCCGCGTCGA CGTCTTCAGC GCGCTCCTCC TCGACGACGA CGCGGCACGG
CTCGTCCACG GGGACACCGG GCCGGGGCTC TCGGCTGAGC CTGGACCGGG CGCAGCCGCG
AAAGCGCCGG GCGCAGCCGC GAAAGCGCCG GGCGCAGCAG CGGAAGCGCC GGGCGCAGCA
GCGGAAGCGC CGAGCGCATC CGAGGCAGCC CCGAGTCCGG GAGAGGTAGA CCGGGGTGTG
GGAGAGGTAG CCCGGGGTGC GGGAGAGGTG GCCCCGAGTG CGGGAGGGGT AGCCCCGAGC
AGGGCAGGGG CACTCACGGG CAGGGGAGAG CCAGCCTCGA GCACGGCTGG GTCAGTCCCG
AGCGGGGCAA GGACAGCCCC GAGTCCGCCA GGGGTGGCTC CGAGCGTGAC CGACGGAGTG
ATCGGCGGTC GGCACCGAGC CGTGGGACCG TCGCTGCGCG GTGTCCAGCC GACCGTGGCG
GTCACCGTCC CGGTCATGAC TCTCCTCGGC CACGGAGACG AGCCCGGGCA CCTCGAGGGC
TACGGACCGG TCGACGCGGA CACGGCGAGG GAGATCGCGG CTCGTGCCGC GTCGTTCACC
AGGATCCTCA CCCATCCGGA GACGGCTGTC GTGCTGTCGG TGGGACGGCA GAAGTACGCC
GTGCCGGCCG ATCTCAAGGC CTGGCTGAGG CTCCGAGACG AGACCTGCCG CTTCCCGGGC
TGCGGCCGAC GGGCGGCGAG GTGCGACATC GACCACGTCG CGCCCTGGCA GCTCGGCGGA
GGGACGGACC ACGACAACCT GATCCACCTG TGCCGACATC ATCACCGGCT CAAGCACGAG
ACAGGCTGGT CGGTCGCCAG CGCCACCTCG GGGGGCGAGA CGACAGGTGT GTTCGCGCGC
CCTGAGGCGG TGACGTGGAC CTCGCCAGCC GGCCGTCGGT ACGTCGACCA CCCGGCGCTG
CCCCGGCCAG CCCATGATCT GCCGCGGCAC CCCACCGGCC TCGTCGACCT CGGCGAACGC
AGGAGTGGTC CGTCCGGCGG CGTCGCCGCA GGAGATGACG CCGCAGGAGG TATCGATGGC
GCTAAGTCGT CGTCGTCGTC CGACGAGGAG AACGCGGAGG CTCCCGGGCT GACGCCCGAT
CCCTTCCCCG ACGAGCCGCC GTTCTGA
 
Protein sequence
MARDLSWRDA CREELSERVD ELEAAERAIA SAHARRIRAI EAVRVLAVET VRGLDPTLGP 
VDPADAVRVR EIRRQQALAR RAAHAEVACV LRTPEMTTTS LVHEAQVLVD HHPATLAALT
RGEISRTHAR VVLENTAALE PDECRDLEVT LVERAKDSTV AALRRYARRQ RERSHPRPLV
ERHRERLAER RVEIEPARDG MMWLHQYLPA VQATAIYNRL TDVAVTFQGK DEQGMSEDRT
LAQLRVDVFS ALLLDDDAAR LVHGDTGPGL SAEPGPGAAA KAPGAAAKAP GAAAEAPGAA
AEAPSASEAA PSPGEVDRGV GEVARGAGEV APSAGGVAPS RAGALTGRGE PASSTAGSVP
SGARTAPSPP GVAPSVTDGV IGGRHRAVGP SLRGVQPTVA VTVPVMTLLG HGDEPGHLEG
YGPVDADTAR EIAARAASFT RILTHPETAV VLSVGRQKYA VPADLKAWLR LRDETCRFPG
CGRRAARCDI DHVAPWQLGG GTDHDNLIHL CRHHHRLKHE TGWSVASATS GGETTGVFAR
PEAVTWTSPA GRRYVDHPAL PRPAHDLPRH PTGLVDLGER RSGPSGGVAA GDDAAGGIDG
AKSSSSSDEE NAEAPGLTPD PFPDEPPF