Gene Anae109_3850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3850 
Symbol 
ID5378032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4491072 
End bp4492457 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content73% 
IMG OID640845375 
ProductHNH endonuclease 
Protein accessionYP_001381013 
Protein GI153006688 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGAGGT TCGAGGCGAA ACGTGCTTCG ACGTCCGCGG CACGTCGAGA GCACGCTCCT 
CGCGAGCGCA GGCAGGGCGC GCCGTGCGCG TCCGGCGCGC CGCGTGGACG GCGAAGGGGC
GGGCCCGGAA AGCGGCGCAG GCGCGATCCT GGCGCGTCTC GGGAGGTGCG GAAATCGCGT
GTTACGAAGG GTGCACAAGG AGGTGCACCG TGCCTGCGAT CGCCCCTTCC GCCCTCGACT
CGCCCCTGCT CGCCCAGCGC CTGCGCGAGC TCGCAGGCCA GGAGCGCGAC GTCCAGGTCG
AGTTCCTCCT CCACCTCGAG GTGTTCGATC GCCGCCGCGC GTACGTGGAC GCCGGGTACC
CCTCGCTCTG GGCGTATTGC CTGGAGGTGC TCCACCTGCG CGAGGGCGGC TGCGGGGCGA
CGCATCCAGG CGATGCGGGT GCTGCGCCGG TTCCCCAGCC TCGAGGACGC CCTGCGAGAT
GGCCGCCTTT GCATCTCCAC CGTCCAGCTG CTCGGCCAGG TGCTGACCGA GGAGAACCTG
CCCGACCTCG TCGGCCGTGC CGCCTACCGC ACCAAGGCCG AGGTGGATCA CCTCGTCGCC
TCGCTCCAGG CGCGCACCGC TCCGCGGGCG GGCCTGCGCA AGCTGCCCGA CCGCGCCTCA
GCCGCGAGCG CCCCGGCGCT GCCGCTGGCG GCAGCCGAAC CTTCACGTGC CGAGCCGCAG
GAGTCGCCGC TCGCGCCGCC GTCGTCGGCC GCTGCTGCCG GGGTGTCCCC CGCCACGATG
CCCGCGCCGT CCGACCCGTC TCGCCAGAGG ACGCGGGCGG TCACCCGTGC GGTGAGCGAG
AGCGGCTGGT CGCTGCGGGT CACCATCGAC CGGGCCTGCA AGGAGGACCT CGAGACGCTC
ACCGCGCTGC TCTCGCACGA GTTCCCGGAC GGCGATCTCG CGGCGGTGCT CCGGGAGGCC
ATCCGCTGCG CCATCGAGAA GCACGGCAGG CGCAAGGGCG CGGTCGCGCC GCAGCGGCAG
CGGGGGACCG ACCGGGAGCC ACGTCCCTCC GCCGAGTCCG CCGCGCCCAC GAGCACGATC
CCGGCGATAG TGCGGCGCGA GGTCTGGAAG CGCGACGGCG GACGCTGCGC CTGGGTCGCT
CCGGACGGGC GGCGCTGCGA CAGCCGCTGG CAGCTGGAGC TCGACCACAT CCACCCGCAG
GCCCTGGGCG GACCCTCGAC GGTCGAGAAC CTCCGAGTCG CCTGCAAGTC GCACAACCTG
TTGCACGCCG AACAGACCTA CGGGCGCGAG CACATGGATC GTTTCCGGCG TGAGAGCGCC
TCCGAGCGGA CGGGGTACGC CGGCGCCGCG CCAGCTGCCA TTCAGCAGGG CTTGTGGGCA
ACGTGA
 
Protein sequence
MTRFEAKRAS TSAARREHAP RERRQGAPCA SGAPRGRRRG GPGKRRRRDP GASREVRKSR 
VTKGAQGGAP CLRSPLPPST RPCSPSACAS SQARSATSRS SSSSTSRCSI AAARTWTPGT
PRSGRIAWRC STCARAAAGR RIQAMRVLRR FPSLEDALRD GRLCISTVQL LGQVLTEENL
PDLVGRAAYR TKAEVDHLVA SLQARTAPRA GLRKLPDRAS AASAPALPLA AAEPSRAEPQ
ESPLAPPSSA AAAGVSPATM PAPSDPSRQR TRAVTRAVSE SGWSLRVTID RACKEDLETL
TALLSHEFPD GDLAAVLREA IRCAIEKHGR RKGAVAPQRQ RGTDREPRPS AESAAPTSTI
PAIVRREVWK RDGGRCAWVA PDGRRCDSRW QLELDHIHPQ ALGGPSTVEN LRVACKSHNL
LHAEQTYGRE HMDRFRRESA SERTGYAGAA PAAIQQGLWA T