Gene Anae109_1700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1700 
Symbol 
ID5375997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1912003 
End bp1913169 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content74% 
IMG OID640843209 
ProductHNH endonuclease 
Protein accessionYP_001378888 
Protein GI153004563 
COG category[V] Defense mechanisms 
COG ID[COG1403] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00106635 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCGA TCGCCCCTTC CGCCCTCGAC TCGACCCTCC TCGCCCAGCG CCTGCGCGAG 
CTCGCCGGCC AGGAGCGCGA CGTCCAGGTC GAGTTCCTCC TCCACCTCGA GGTGTTCGAT
CGCCGCCGCG CGTACGTGGA CGCCGGGTAC CCCTCGCTCT GGGCGTATTG CCTGGAGGTG
CTCCACCTGC GTGAGGGCGC TGCCGGGCGA CGCATCCAGG CGATGCGGGT GCTGCGCCGG
TTCCCCAGCC TCGAGGACGC CCTGCGAGAT GGCCGCCTTT GCATCTCCAC CGTCCAGCTG
CTCGGCCAGG TGCTGACCGA GGAGAACCTG CCCGACCTCG TCGCCCGGGC CGCGTACCGC
ACCAAGGCGG AGGTGGATCA CCTCGTCGCC TCGCTCCAGG CGCGCACCGC TCCGCGGACG
GGCGTCCGCA AGCTGCCCGA CCGCGCTCCA GCCGCGAGCG CCCCGGCGCT GCCGCTGGCG
GCAGTGGATG CCGGACCTGC CGAGCCGCAG GAGGCGATCC CCGCGCCGGC GGCGGCTGGT
GGGTCGCTGC CGCCCACGGT CTCTGCGCTG CCCGACGTGG CTCGCCAGAA GGCGCGGGCG
GAGACCCGCG CCGTGAGCGA GAGCGGCTGG TCGCTGCGGG TCACCATCGA CCGGGGCTGC
AAGGAGGACC TCGAGACGCT CACCGCGCTG CTCTCGCACA AGATCCCGGA CGGCGATCTC
GCGGCGGTGC TCCGCGAGGC CATCCGCTGC GCCGTCGAGA AGCACGGCAA GCGCAAGGGC
GCGATCGCGC CGGAGCGGCA GCGGAAGGCC GACCGGGAGA CACGTCCCTC CGCCGAGCCC
GCCGCGCCCA CGAGCACGAT CCCGGCGATA GTGCGGCGCG AGGTCTGGAA GCGCGACGGC
GGACGCTGCG CCTGGGTCGC TCCGGACGGG CGGCGCTGCA ACAGCCGCTG GCAGCTGGAG
CTCGACCACA TCCACCCGCA GGCCCTGGGC GGACCCTCGA CGGTCGAGAA CCTCCGGGTG
GCCTGCAGGT CGCACAACCT GTTGCACGCC GAACGGACCT ACGGGCGCGA GCACATGGAC
CGCTTCCGGC GCGGAAACCT CGCCGAGCGG ACGGGGCATG CCGGCACCGC GCCAGCTGCC
ATTCAGCAGG GCTTGTGGGC AACGTGA
 
Protein sequence
MPAIAPSALD STLLAQRLRE LAGQERDVQV EFLLHLEVFD RRRAYVDAGY PSLWAYCLEV 
LHLREGAAGR RIQAMRVLRR FPSLEDALRD GRLCISTVQL LGQVLTEENL PDLVARAAYR
TKAEVDHLVA SLQARTAPRT GVRKLPDRAP AASAPALPLA AVDAGPAEPQ EAIPAPAAAG
GSLPPTVSAL PDVARQKARA ETRAVSESGW SLRVTIDRGC KEDLETLTAL LSHKIPDGDL
AAVLREAIRC AVEKHGKRKG AIAPERQRKA DRETRPSAEP AAPTSTIPAI VRREVWKRDG
GRCAWVAPDG RRCNSRWQLE LDHIHPQALG GPSTVENLRV ACRSHNLLHA ERTYGREHMD
RFRRGNLAER TGHAGTAPAA IQQGLWAT