Gene AnaeK_4158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_4158 
Symbol 
ID6784173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp4692533 
End bp4693612 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content72% 
IMG OID642765625 
ProductHNH nuclease 
Protein accessionYP_002136490 
Protein GI197124539 
COG category[V] Defense mechanisms 
COG ID[COG1403] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.416299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATCG CACTCGAGTT CACGAAGCGT CTCGTCTCTC TTCTCCGCTC GGAGCGGCAC 
GCCATGGCGG AGTTCCTGGT CGCCCTGGCG GAGTTCGACG AGCGCGGGCT GTGGCGGGCG
CGCGGGCACA CGTCGCTCTT TTCCTTCCTG CACCGTGAGC TGAAGCTCTC GGCCGGGTCC
GCGCAGCTTC GCAAGACAGC CGCCGAGCTC ATCCAGCGCT ATCCGGCGGT CGAGGGCGCG
CTCCGGGAGG GCAAGCTCTG CCTCTCCTCC GTCTGTGAGC TGGCGAAGGT GGTGACCACC
GAGAACTGCG CGGAGATCCT GCCGCGGTTC TTTGGGCTAT CGAGCCGCGA CGCCGCTGCG
GTGGTTGCCT CCATCCGGCC GGTCGAGAAC CCTCCGCAGC GAGAGGTCGT CGTGCCGGTC
CGGATCGCGG CCCCTCCTGC GGTTCCGGCG TCGTCATCGC GTGATGCCGC GGCGCCGCTA
CCCGCACGGA GCGTTCTATT TCATGCGCAT GAAACGCCGC CTGCGTCCGC TGCCACCGAG
CCGGCTACGC CGGCTCCGCG GGCGGCCTGC CTCGCGGTAC CGATCGCGAA GCCCAGCTCG
GTGGACTGGC TCGCCTCCGA TCAGGCCCGG ATGCACCTCA CGGTCTCGAA GGCGTTCCTG
AGGAAGCTCG ACGCGGCGCG GGACGCGCTT TCGCACGCCA TGCCCGGTGC CACCCGCGAG
GACGTCCTCG AGGCGGCGCT GGATCAGCTC CTTGCCGAGC GCTCACGCCG CAAGCGCCTC
ACGGCGAAGC CGCAGAAGAC GGTCCGCGCC TCGCAGCGGC GGGAGCACAT CCCTGCGCAG
GTCCGGCGCG AGGTCTGGGA GCGCGACGGC GGGCGGTGCA CCTTCGCCCT CGCCTCGGGC
GAGCCCTGCG GCTCCACGCA CCGGCTCGAG CTGGACCACA TCGTCCCGCT CGCGCGCGGC
GGGCCCTCGA CGGCGGACAA CCTCCGCATC CGTTGCCGGG GCCACAACCT CGAGGAGGCG
CGGCGGGTCC TCGGGGACGC GCTCGTGGAC GCCTATGCCA GCAGGCGGCG CTGGGGATGA
 
Protein sequence
MDIALEFTKR LVSLLRSERH AMAEFLVALA EFDERGLWRA RGHTSLFSFL HRELKLSAGS 
AQLRKTAAEL IQRYPAVEGA LREGKLCLSS VCELAKVVTT ENCAEILPRF FGLSSRDAAA
VVASIRPVEN PPQREVVVPV RIAAPPAVPA SSSRDAAAPL PARSVLFHAH ETPPASAATE
PATPAPRAAC LAVPIAKPSS VDWLASDQAR MHLTVSKAFL RKLDAARDAL SHAMPGATRE
DVLEAALDQL LAERSRRKRL TAKPQKTVRA SQRREHIPAQ VRREVWERDG GRCTFALASG
EPCGSTHRLE LDHIVPLARG GPSTADNLRI RCRGHNLEEA RRVLGDALVD AYASRRRWG