Gene AnaeK_3550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_3550 
Symbol 
ID6784778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp4012301 
End bp4013551 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content73% 
IMG OID642765021 
ProductHNH nuclease 
Protein accessionYP_002135892 
Protein GI197123941 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00633328 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACACCG CACTCGAGTT CACGAACCGC CTCGTCACCC TGCTCCGCTC CGAGCGCCAC 
GCCATGGCCG AGTTTCTGGT TGCCCTGGCC GAGTTCGAAC GGCGCGGGCT CTACCGGCAG
CGGGGGCACA CCTCGCTGTT CTCGTTCCTG CATCGGGAGC TGAAGCTCTC GGCGGGCTCC
GCTCAGCTCC GCAAGACGGC GGCGGAGCTC ATCAACCGTT TGCCGGCGGT CGAGGGCGCG
CTCCGGGAGG GCAAGCTGTG CCTGTCCTCG GTCTGCGAGC TGGCGAAGGT GGTGACCACC
GAGAACTGCG CGGAGATCCT GCCTCGGTTC TTCGGGCTGT CGAGCCGGGA TGCTGCCGCC
GTGGTCGCTT CCATCCGGCC GGTGGAGAAC CCGCCCCGTC GCGAGGTCGT CGTGCCGATC
CGGGCGACGT CTGCGCCGGC CGCGGTCACC ACCTCTGCCG CGGCTCCGGC GGCGGCCTCG
CGCGACGCTG CATCGCCGGC GCCCGCGCGG GTCGCCTTGT TTCATGCGCA TGAAGTGAGA
GCGCCGCCCT CCGGTCGCGC TGAGCCTGTA GAACGGCGAG CCTCCGAGGC CAGACCCGTC
GCGAAGCCCA CCTCCGTCGA CTGGCTCGAC GCCGACCAGG CGCGGATTCA CCTCACCGTG
TCCAAGGCGT TCCTGAAGAA GCTCGACGCG GGCCGTGATG CGCTCTCACA CTCCATGCCG
GGCGCCTCCC GCGAGAACGT CCTCGAGGCC GCTCTCGACC TGCTCCTCGC CGAGCGCGCG
CGTCGGAAGG GGCTCACCGC GAAGCCGCAG AAGACGGTTC GTCCTTCCCG GCCGGACCAC
GTCCCGGCCC ACGTTCGCCG CGAGGTCTGG GCGCGCGACG GCGGGCGTTG CACCTTCCCC
CTCCCGTCCG GCGAGCCGTG TGGCGCCACG CACCAGCTCG AGCTCGACCA CATCGTGCCG
CGGGCGTGTG GAGGCGCCTC GACGGCCGAC AACCTCCGGA TCCGTTGCCG AGGGCACAAC
CTGGAGGAGG CGCGACGGGT CCTCGGGGAC GAGGTGATGA ACGCGTACGC ACCGAGGAGC
ACGGCCAGCA GGGAGGGGCC CCGGCCGCAG GCCGGGGAGG GACGCAATCC CTCCCCGCGG
GGACCGCTCG CGCGCCCGCG CATGCGGGCG CGAGCGGTCA CGGGCCCCGC GACGCAGGAG
TGGGGCCCCG ACGGCTCCGC CGGCGGGGGG AGGGCGCAGC CCTCGACCTA A
 
Protein sequence
MDTALEFTNR LVTLLRSERH AMAEFLVALA EFERRGLYRQ RGHTSLFSFL HRELKLSAGS 
AQLRKTAAEL INRLPAVEGA LREGKLCLSS VCELAKVVTT ENCAEILPRF FGLSSRDAAA
VVASIRPVEN PPRREVVVPI RATSAPAAVT TSAAAPAAAS RDAASPAPAR VALFHAHEVR
APPSGRAEPV ERRASEARPV AKPTSVDWLD ADQARIHLTV SKAFLKKLDA GRDALSHSMP
GASRENVLEA ALDLLLAERA RRKGLTAKPQ KTVRPSRPDH VPAHVRREVW ARDGGRCTFP
LPSGEPCGAT HQLELDHIVP RACGGASTAD NLRIRCRGHN LEEARRVLGD EVMNAYAPRS
TASREGPRPQ AGEGRNPSPR GPLARPRMRA RAVTGPATQE WGPDGSAGGG RAQPST