Gene Anae109_4158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4158 
Symbol 
ID5376315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4874520 
End bp4875686 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content73% 
IMG OID640845685 
ProductHNH endonuclease 
Protein accessionYP_001381320 
Protein GI153006995 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0493415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCGA TCGCCCCTTC CGCCCTCGAC TCGACCCTGC TCGCCCAGCG CCTGCGCGAG 
CTCGCAGGCC AGGAGCGCGA CGTCCAGGTC GAGTTCCTCC TCCACCTCGA GGAGTTCGAT
CGCCGCCGCG CCTACGTGGA GGCCGGCTAC CCCTCGCTCT GGGCGTATTG CCTGGAGGTG
CTCCACCTGC GCGAGGGCGC TGCCGGGCGA CGCATCCAGG CGATGCGGGT GCTGTGCCGG
TTCCCCAGCC TCGAGGACGC CCTGCGCGAC GGGCGCCTGG GTTTGTCCAC CGTCCAGCTG
CTCGGCCAGG TGCTGACCGA GGAGAACCTG CCCGACCTCG TCGGCCGTGC CGCCTACCGC
ACCAAGGCCG AGGTGGATCA CCTCGTCGCC TCGCTCCAGG CGCGCACGGC TCCGCGGACG
GGCCTGCGCA AGCTGCCCGA CCGCGCCTCA GCCGCGAGCG CCCCGGCGCT GCCGCTGGCG
ACAGTCCATG CCGGACCTGC CGAGCCGCAG GAGGCGATCC CCGCGCCGGC GGCGGCTGGT
GGGTCGCTGC CGCCCACGGT CTCCGCGCTG CCCGACGTTC CTCGCCCGAA GGCGCGGGCG
GAGACCCGCG CCGTGAGCGA GAGCGGCTGG TCGCTGCGGG TCACCATCGA CCGGGGCTGC
AAGGAGGACC TCGAGACGCT CACCGCGCTG CTCTCGCACA AGATCCCGGA CGGCGATCTC
GCGGCGGTGC TCCGGGAGGC CATCCGCTGC GCCGTCGAGA AGCACGGCAA GCGCAAGGGC
GCGATCGCGC CGGAGCGGCA GCGGAAGGCC GACCGGGAGA CACGTCCCTC CGCCGAGCCC
GCCGCGCCCA CGAGCACGAT CCCGGCGATA GTGCGGCGCG AGGTCTGGAA GCGCGACGGC
GGACGCTGCG CCTGGGTCGC TCCGGACGGG CGGCGCTGCG ACAGCCGCTG GCAGCTGGAG
CTCGACCACA TCCAGCCGCT CGCTCTGGGG GGGCTCTCGA CGCTCGACAA TCTCCGGGTC
GCCTGCAAGC CCCATAACCT GTTGCACGCC GAACAGACCT ATGGGCGCGA GCACATGGAT
CGTTTCCGGC GTGAGAGCGT CTCCGAGCGG ACGGGGCATG CCGGCACCGC GCCAGCTGCC
ATTCAGCAGG GCTTGTGGGC AACGTGA
 
Protein sequence
MPAIAPSALD STLLAQRLRE LAGQERDVQV EFLLHLEEFD RRRAYVEAGY PSLWAYCLEV 
LHLREGAAGR RIQAMRVLCR FPSLEDALRD GRLGLSTVQL LGQVLTEENL PDLVGRAAYR
TKAEVDHLVA SLQARTAPRT GLRKLPDRAS AASAPALPLA TVHAGPAEPQ EAIPAPAAAG
GSLPPTVSAL PDVPRPKARA ETRAVSESGW SLRVTIDRGC KEDLETLTAL LSHKIPDGDL
AAVLREAIRC AVEKHGKRKG AIAPERQRKA DRETRPSAEP AAPTSTIPAI VRREVWKRDG
GRCAWVAPDG RRCDSRWQLE LDHIQPLALG GLSTLDNLRV ACKPHNLLHA EQTYGREHMD
RFRRESVSER TGHAGTAPAA IQQGLWAT