Gene Anae109_1449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1449 
Symbol 
ID5374050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1644355 
End bp1645530 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content74% 
IMG OID640842960 
ProductHNH endonuclease 
Protein accessionYP_001378640 
Protein GI153004315 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.964003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCGA TCGCCCCTTC CGCCCTCGAC TCGACCCTCC TCGCCCAGCG CCTGCGCGAG 
CTCGCAGGCC AGGAGCGCGA CGTCCAGGTC GAGTTCCTCC TCCACCTCGA GGTGTTCGAT
CGCCGCCGCG CGTACGTGGA CGCCGGCTAC CCCTCGCTCT GGGCGTATTG CCTGGAGGTG
CTCCACCTGC GCGAGGGCGC GGCCGGGCGA CGCATCCAGG CGATGCGGGT GCTGCGCCGG
TTCCCCAGCC TCGAGGACGC CCTGCGAGAT GGCCGCCTTT GCATCTCCAC CGTCCAGCTG
CTCGGCCAGG TGCTGACCGA GGAGAACCTG CCCGACCTCG TCGGCCGGGC CGCGTACCGC
ACCAAGGCGG AGGTGGATCA CCTCGTCGCC TCGCTCCAGG CGCGCACCGC TCCGCGGGCG
GGCCTGCGCA AGCTGCCCGA CCGCGCTGCA GCCGCGAGCG CCCCGGCGCT GCCGCTGGCG
GCAGTGGATG CCGGACCTGC CGAGCCGCAG GAGTCGCCGC TCGCGCCGCC GTCGTCGGCC
GCTGCTGCCG GGGTGTCCCC CGCCACGATG CCCGCGCCGT CCGACCCGTC TCGCCAGAGG
ACGCGGGCGG TCACCCGTGC GGTGAGCGAG AGCGGCTGGT CGCTGCGGGT CACCATCGAC
CGGGCCTGCA AGGAGGACCT CGAGACGCTC ACCGCGCTGC TCTCGCACAA GTTCCCGGAC
GGCGATCTCG CGGGGGTGCT CCGGGAGGCC ATCCGCTGCG CCATCGAGAA GCACGGCAGG
CGCAAGGGCG CGGTCGCGCC GCAGCGGCAG CGGGGGACCG ACCGGGAGCC ACGTCCCTCC
GCCGAGTCCG CCGCGCCCAC GAGCACGATC CCGGCGATAG TGCGGCGCGA GGTCTGGAAG
CGCGACGGCG GACGCTGCGC CTGGGTCGCT CCGGACGGGC GGCGCTGCAA CAGCCGCTGG
CAGCTGGAGC TCGACCACAT CCACCCGCAG GCCCTGGGCG GACCCTCGAC GGTCGAGAAC
CTCCGAGTCG CCTGCAAGTC GCACAACCTG TTGCACGCCG AACAGACCTA CGGGCGCGAG
CACATGGATC GCTTCCGGCG TGAGAGCGTC TCCGAGCGGA CGGGGCATGC CGGCACCGCC
CCAGCTGCCA TTCAGCAGGG CTTGTGGGCA ACGTGA
 
Protein sequence
MPAIAPSALD STLLAQRLRE LAGQERDVQV EFLLHLEVFD RRRAYVDAGY PSLWAYCLEV 
LHLREGAAGR RIQAMRVLRR FPSLEDALRD GRLCISTVQL LGQVLTEENL PDLVGRAAYR
TKAEVDHLVA SLQARTAPRA GLRKLPDRAA AASAPALPLA AVDAGPAEPQ ESPLAPPSSA
AAAGVSPATM PAPSDPSRQR TRAVTRAVSE SGWSLRVTID RACKEDLETL TALLSHKFPD
GDLAGVLREA IRCAIEKHGR RKGAVAPQRQ RGTDREPRPS AESAAPTSTI PAIVRREVWK
RDGGRCAWVA PDGRRCNSRW QLELDHIHPQ ALGGPSTVEN LRVACKSHNL LHAEQTYGRE
HMDRFRRESV SERTGHAGTA PAAIQQGLWA T