Gene AnaeK_2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_2101 
Symbol 
ID6788106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp2359492 
End bp2361222 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content78% 
IMG OID642763561 
ProductHNH nuclease 
Protein accessionYP_002134457 
Protein GI197122506 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACGCGC TCGTTCGCCA GCAAATCCAG CCGCTTGACG CCCCGTCAGG GCTGGAGGCG 
CTCGCGGCGC AGGCCTGGGT GCTGGAGGTG CCCCGGCCGA CCGAGCGGCG GTTCATCCTG
CGGCCGGAGG CGGCGGAGCT GGTGGACGGG CTGCTGGCCC GGGTGGCCCG CGGCGCGGGG
GCACTGGACG TGGCGCTGGG CCGTGGCCTG CGCGCGGTCG AGAAGGCCGG CGGTCCGCTT
CGCCTGGGGT ACTCGAGCCT GGGCGACTAC GCGCGCGAGC GGCTGGGGCT CCCGGAATCG
ACCTCCCGGC GGCTGGCGCG GCTCTCGGCG GGTCTGGACG AGCGGCCGCT GCTCGATGCG
GCGGTCCGGG CGGGCGAGGT GAGCCTGCGG AAGGCGCAGG TCATCCTCGG CGTGGCCCGG
GGGGCGGACG AGGCGCGCTG GGTGGCGCAG GCGCGGGACG CGACGGTCCG GGCGCTCGCC
GCCGCGGTGC GCGCGGAGCG GGGCGGCGAT GGCGGGACGG CGGACGCGGG CGAGGGCGCC
GAGCCGCTGG TGCCGCTCGA GCTGGAGATC TCCGAGGACG ATCGGGTGGC GCTGCGCGAG
GCGCTGTCGC TGGCGGGCAC GACGCTGGGC GCGACGGCGC CGCCCTGGCA GCGGCTCGAG
GCGCTCTGCC AGGAGTACCT CGCCTCGCAC CCGGAGCCGG AGCGGCTCCG GCTCGAGGAC
CTCGACGCGG ACGCGGCGAC GGCCGCGGGC GCGCTGGAGG GCGCGCCGCG GGGGCGGGGC
AGCGAGTGGT GGGACGCGGC GCGGCTCGCG CTGGAGGAGG AGACGGAGCG CTGGAGCTAC
CTCGAGCGGC TCGAGCCGCT CCCGGCGCCG GATCCGGCGG GCGGGCTCGC GCCAGGCGAC
CTGCAGGCCC TCGATGCGCG TCTGTGCGAG CTCGCCGCGA TGCGCGCGCG CTGGGACGAG
CTGGTGGGCC ACCTGGGCCT GCTCATGCGC TCCCTTGGCC TCTGGCGGGA GGCGGGCTTC
GCCTCCTTCG GCCACTACTG CGCCGAGCGC CTGGGCCTCT CGCTCCGGGC GGTCGAGCAG
CGCATCGCGC TGGAGCGACG CCTTCACGAG CTGCCGCCGC TCCGCGCGGC GCTGGCGTCG
GGGCGGGTCT CCTACGGCAA GGCGGTCGTG GTGGCGGCGG CTGCGGACGA GGACACGGTC
GAGGCCTGGA TCGCGCGCGC GGAGACCACG CCGTGCGCGG CGCTCCGGCG CGAGGCGGAG
TCGGCGGAGG ACGCGCAGAT GTGTGCGCGG CGCGCGTGGA AGGCCCGGCT GCCCGCGCGG
GTGGTGAACC TGCTCGACGC CGCCCTGGGC GCGGCCCGCC TGGCCGCGGG GAGGCCGGTC
CGGGACGGCG AGTGCCTGGG GATCATCGCG CGGCACTTCA TCGACACCTG GAAACCGTCG
TTGCGCGGCC GGCGCACGCT GGCCCACCGG GTCCTGGAAC GCGACGGCGG GCTCTGCCTT
GCGCCGGGGT GCACCCGCGC GGCGGACCAC GCGCATCACC TGTGGCAGCG CGCGCACGGT
GGACCGGACG TTCCGTGGAA CCTCGCCTCG CTGTGCGCGC CGCACCACCT CGTCGCGATC
CACGGGGGCT TCCTGCGCGT GCGCGGGAGG GCGCCGCACG CGCTGGAGTG GAAGTTCGCG
GGGTCGGCGC CGGTGGGGAG CGGGCGCGGC GGAGGCGTCG GGTCCGGCTA G
 
Protein sequence
MDALVRQQIQ PLDAPSGLEA LAAQAWVLEV PRPTERRFIL RPEAAELVDG LLARVARGAG 
ALDVALGRGL RAVEKAGGPL RLGYSSLGDY ARERLGLPES TSRRLARLSA GLDERPLLDA
AVRAGEVSLR KAQVILGVAR GADEARWVAQ ARDATVRALA AAVRAERGGD GGTADAGEGA
EPLVPLELEI SEDDRVALRE ALSLAGTTLG ATAPPWQRLE ALCQEYLASH PEPERLRLED
LDADAATAAG ALEGAPRGRG SEWWDAARLA LEEETERWSY LERLEPLPAP DPAGGLAPGD
LQALDARLCE LAAMRARWDE LVGHLGLLMR SLGLWREAGF ASFGHYCAER LGLSLRAVEQ
RIALERRLHE LPPLRAALAS GRVSYGKAVV VAAAADEDTV EAWIARAETT PCAALRREAE
SAEDAQMCAR RAWKARLPAR VVNLLDAALG AARLAAGRPV RDGECLGIIA RHFIDTWKPS
LRGRRTLAHR VLERDGGLCL APGCTRAADH AHHLWQRAHG GPDVPWNLAS LCAPHHLVAI
HGGFLRVRGR APHALEWKFA GSAPVGSGRG GGVGSG