Gene Anae109_1431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1431 
Symbol 
ID5376333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1619086 
End bp1620252 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content74% 
IMG OID640842942 
ProductHNH endonuclease 
Protein accessionYP_001378622 
Protein GI153004297 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.535031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCGA TCGCCCCTTC CGCCCTCGAC TCGACCCTGC TCGCCCAGCG CCTGCGCGAG 
CTCGCCGGCC AGGAGCGCGA CGTCCAGGTC GAGTTCCTCC TCCACCTCGA GGTGTTCGAT
CGCCGCCGCG CCTACGTGGA GGCCGGCTAC CCCTCGCTCT GGGCGTATTG CCTGGAGGTG
CTCCACCTGC GCGAGGGCGC GGCCGGGCGA CGCATCCAGG CGATGCGGGT GCTGCGCCGG
TTCCCCAGTC TCGAGGGCGC GCTTCGGGAT GGCCGCCTTT GCATCTCCAC CGTCCAGCTG
CTCGGCCAGG TGCTGACCGA GGAGAACCTG CCCGACCTCG TCGCCCGGGC CGCCTACCGC
ACCAAGGCCG AGGTGGATCA CCTCGTCGCC TCGCTCCAGG CGCGCACGGC TCCGCGGACG
GGCCTGCGCA AGCTGCCCGA CCGCGCCTCA GCCGCGAGCG CCCCGGCGCT GCCGCTGGCG
ACAGTCCATG CCGGACCTGC CGAGCCGCAG GAGGCGATCC CCGCGCCGGC GGCGGCTGGT
GGGTCGCTGC CGCCCACGGT CTCCGCGCTG CCCGACGTTC CTCGCCCGAA GGCGCGGGCG
GAGACCCGCG CCGTGAGCGA GAGCGGCTGG TCGCTGCGGG TCACCATCGA CCGGGGCTGC
AAGGAGGACC TCGAGACGCT CACCGCGCTG CTCTCGCACA AGATCCCGGA CGGCGATCTC
GCGGCGGTGC TCCGCGAGGC CATCCGCTGC GCCATCGAGA CGCACGGCAA GCGCAAGGGC
GCGATCGCGC CGGAGCGGCA GCGGAAAGCG GACGGGGACC CACGGCCCTC TGCCGAGCGC
GCCGCGCCCA CGGGCACGAT CCCGGCGATA GTGCGGCGCG AGGTCTGGAA GCGCGACGGC
GGACGCTGCG CCTGGGTCGC TCCGGACGGG CGGCGCTGCA ACAGCCGCTG GCAGCTGGAG
CTCGACCACA TCCACCCGCA GGCCCTGGGC GGACCCTCGA CGGTCGAGAA CCTCCGAGTC
GCCTGCAAGT CGCACAACCT GTTGCACGCC GAACAGACCT ACGGGCGCGA GCACATGGAC
CGCTTCCGTC GCATGGGCGT CGCCGGGGTG ACGCCAGATG CCAGCGGGGC GCCACCAGCG
CCGCAGCAGG CCCTGTGGGG ACCGTGA
 
Protein sequence
MPAIAPSALD STLLAQRLRE LAGQERDVQV EFLLHLEVFD RRRAYVEAGY PSLWAYCLEV 
LHLREGAAGR RIQAMRVLRR FPSLEGALRD GRLCISTVQL LGQVLTEENL PDLVARAAYR
TKAEVDHLVA SLQARTAPRT GLRKLPDRAS AASAPALPLA TVHAGPAEPQ EAIPAPAAAG
GSLPPTVSAL PDVPRPKARA ETRAVSESGW SLRVTIDRGC KEDLETLTAL LSHKIPDGDL
AAVLREAIRC AIETHGKRKG AIAPERQRKA DGDPRPSAER AAPTGTIPAI VRREVWKRDG
GRCAWVAPDG RRCNSRWQLE LDHIHPQALG GPSTVENLRV ACKSHNLLHA EQTYGREHMD
RFRRMGVAGV TPDASGAPPA PQQALWGP