Gene Anae109_2144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_2144 
Symbol 
ID5374376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2431314 
End bp2434334 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content80% 
IMG OID640843656 
ProductDNA repair exonuclease, SbcC 
Protein accessionYP_001379330 
Protein GI153005005 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.214587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.18739 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACCGC TCAGCCTGGA GCTCCAGGCG TTCGGGCCGT ACGCCCGGGC GCAGAAGGTG 
GACTTCGACG CCCTCGGGGG CGCGGACCTC TTCCTCATCC ACGGCCCCAC CGGCGCGGGG
AAGACGACGC TCTTCGACGC CATGACGTTC GCTCTCTACG GCGACGTCCC CGGCACGCGC
GGCACCGGGC GCCTGCGCGC CGACCGCGCC GCCGAGGGGG CCGCGCCGAG GGTGGTCTTC
CGGTTCCGCC TCGGCGCCGC CGTGTACCGC GCAGAGCGGA CGGCGGCGTG GCAGCGCCCG
AAGAAGCGCG GGGAGGGGAC CATCGAGGAG GCACCCACCG CGAGCCTCTG GCGCGAGGGC
GCCGAGGCGC CCCTCGCGAC GAAGCCGACG GCCGTGACCG AGAAGGTGAC GGAGCTGCTC
GGCATGGGCG CCGAGCAGTT CACGCGCGTC GTGCTGCTCC CCCAGGGCGA CTTCAAGCGG
CTCCTCTGCG CGGACGCGCG CGAGCGGGAG GAGCTGCTCC AGCAGCTCTT CGGCACCGCC
GTCTACAAGG ACGTGGAGGA GCTGCTCGTC CGCAAGAAGA ACGAGCTCGA GGCGGCGGCG
CGGCGGCTCG CGGAGCGCCG CGAGGAGGTG CTCGGGGGAG CGGACGCGGG CGCGCTGGCG
TCGGGGCGCG AGGCGCTCGA GGCGCGGCTG GCCGAGGCGC GAGGCGAGGC GGCCGCGCGC
GCGGTCGACG ACGCCGCGGC GCTCGACGCG CTCGCCGCGG GCCGGCAGCT CGCGGCCCGC
CTCGAGGCGC TCGCGCGCGC ACGCGAGGAG TCGGCGCGGG CGCAGGCGGG CGCGGCGGAC
CTCGCGCGCG ATCGCGAGCG GCTCGCGCGG GCGAGCGCCG CGGAGCGCGT CCGGGAGAAG
CTCGCCGCCG CGCGGAAGGC CGAGGTCGCC CGGGTCGCGC GGGAGAAGGA CGAGGCGCTC
GCAGCGGGAC GGCGCGACGA GGCGGTCGCG GCGCTCGCGA GGGCCGGCGA GGCGCTGGCG
AAGGCCGAGG CGGAGGCGCC CCGGCGGACG GAGCTGTCGG CGCGGGTGCA GCTCCTCGAG
CGCGTGCTCC CGGAGCTGGA GCGGCTCGCC GCGGCGGACA GGGCCGTGGC GGAGTCGCGC
CGGGCCGCGG CCAGCGCCCG CGCGGAGGAC GAGCGCGCCC GGGCGGCGCT CGCGAGCGCC
GAGGCGCGTC CCGCGGCGCT GGAGGCGGAG GCGGCCCGGC TCCGGCCGGT GGCGGCGGAG
GAGGGCGCCT GCGCGGAGCG CTCGGCGAGG CTGGAGACGG GGCTCCGGGC CGCGCGGGAG
CGCGACGCCC GGCGCGCGGA GATCTCGAAG CTCGAGCAGG CGCTCGCGGG TGAGGAGCGC
GAGGCGCGCC ACGCCGCGGA GGCCGCGAGG CGGGCCACGG CCCACGCGGA CGGGCTCGCC
GCGGCGCGCG AGCGTGAGCT CGCGGCCTGG CTCGCGAAGA AGCTGGCCCC CGGCCATCCG
TGCGCGGTGT GCGGCTCGGC GGAGCACCCG GCTCCGGCGC GCTCGCGCGA GCGGGTGCCG
GAGCGAGAGG AGATCGACGA GGCCCGGGCC GCAGAGCGGC GGCTCTCCGA GCGCGCGGCG
GAGGCGGAGA AGCGCCGCGC GACCACGGCC GGGCTCCTCG CCGAGGCCCA CCAGCGCGCG
ACCGGCGCCG CCGAGGGGGA CGCGCGAGCG ACGGCCGCCA TCGAGGGCGA GTCGCGCGAG
GCCGCCGCCG CGCTGAAGCG GGCGCGCGAG GCCGCCGCCC GGGTGCGCGC GCTGGACGTC
GAGGCGACGA GCGCGCGCGG CGAGGTGGAG GCGGCGCGCG CCCGGCAAGA GGCGGCCGGC
GCGAGGCGCG CCGCGGCCGA GCAGCACGTC GCGGCCGCCG AGGCGGCCGG CGCCGAGCTG
GCGCGGCAGG TTCAGGCGGC GGGGGCGGGG CCGGACACGC CCGCCGAGCT CGCGGCGGCG
CGGAGGGAGC TCGACGCGCT CGAGGCGGCC GCTGCGACAG CCCGCCGCGC CGCGGGCGAG
GCCGGGGCCG GGCACGCGTC CGCGCTCGAG CGGCTCGCCT CGTGCTCCGA GGAGGCGGCG
AGGGCCGAGG CCGCCGCGAC CGAGGCGCAC GCGGCGGCGG CGGAGGCCTG CGCGGCCGCG
GGGTTCGACG GGCTCGCCGC CTGCGAGGCG GCCCTGCTCG CGCCCGAGCA GCGGTCGGCG
CTGGAGGAGT CGCTCGAGGC CCGCACGGTG GCCGCCAGGG CCGCCGCCGA GCGGGCCGGG
GCGCTCGAGG CCGAGCTCTC GGGCGCCTCC GCCCCGGACC TCCCGGCGCT CGAGGCCCGG
CGGGCGGCCA CGGCCGGCGC GGCGGGCGCC GCGCGCGACG CGGTCGTCCG GCTCGAGAAG
GACCACGAGC GGCTGCGCCA GCTCGAGGCG CGCCTCGGGG AGCTCGAGGC GCGCGCGGCG
GAGCTCGCGC GCGAGCTCGC GGTGGCGGGC AAGGTCGCCG AGATCGCGCG GGGGCACAAC
GCGCTCAACA TGAGCCTGCA GCGCTTCGTG CTCGCCGCGC GGCTCGAGGA GGTCGCCGAG
GCCGCGAGCC GCCGCTTGCT GCAGATGTCG CGTGGCCGCT TCAGGCTGCG CCACGACACC
GCCGTCGCTC GGCGGAACCA GGCGTCGGGG CTCTCGCTCG TGGTGGAGGA CGCCTGGACC
GGCGTGACGG ACCGGCCCGT GGGCGCGCTC TCCGGGGGCG AGAGCTTCCT CGCGAGCCTC
GCGCTCGCGC TCGGCCTCTC CGACGTGGTG CTGCGCCGCT CCGGAGGGCT GCGGCTCGAC
GCGCTCTTCG TGGACGAGGG CTTCGGCTCG CTCGACGAGG AAACCCTCGA CGACGCGGTG
CGCACGCTCG AGGACCTGCG GAAGAGCGGC CGGGTGGTGG GGGTCATCTC GCACGTGGCC
GAGCTGCGCC GGCGCATCCC GGCGCGCATC GAGATCCAGC GCAAGGCCGA AGGATCGGTG
GCGGTGGTAC GCGCCGGGTG A
 
Protein sequence
MRPLSLELQA FGPYARAQKV DFDALGGADL FLIHGPTGAG KTTLFDAMTF ALYGDVPGTR 
GTGRLRADRA AEGAAPRVVF RFRLGAAVYR AERTAAWQRP KKRGEGTIEE APTASLWREG
AEAPLATKPT AVTEKVTELL GMGAEQFTRV VLLPQGDFKR LLCADARERE ELLQQLFGTA
VYKDVEELLV RKKNELEAAA RRLAERREEV LGGADAGALA SGREALEARL AEARGEAAAR
AVDDAAALDA LAAGRQLAAR LEALARAREE SARAQAGAAD LARDRERLAR ASAAERVREK
LAAARKAEVA RVAREKDEAL AAGRRDEAVA ALARAGEALA KAEAEAPRRT ELSARVQLLE
RVLPELERLA AADRAVAESR RAAASARAED ERARAALASA EARPAALEAE AARLRPVAAE
EGACAERSAR LETGLRAARE RDARRAEISK LEQALAGEER EARHAAEAAR RATAHADGLA
AARERELAAW LAKKLAPGHP CAVCGSAEHP APARSRERVP EREEIDEARA AERRLSERAA
EAEKRRATTA GLLAEAHQRA TGAAEGDARA TAAIEGESRE AAAALKRARE AAARVRALDV
EATSARGEVE AARARQEAAG ARRAAAEQHV AAAEAAGAEL ARQVQAAGAG PDTPAELAAA
RRELDALEAA AATARRAAGE AGAGHASALE RLASCSEEAA RAEAAATEAH AAAAEACAAA
GFDGLAACEA ALLAPEQRSA LEESLEARTV AARAAAERAG ALEAELSGAS APDLPALEAR
RAATAGAAGA ARDAVVRLEK DHERLRQLEA RLGELEARAA ELARELAVAG KVAEIARGHN
ALNMSLQRFV LAARLEEVAE AASRRLLQMS RGRFRLRHDT AVARRNQASG LSLVVEDAWT
GVTDRPVGAL SGGESFLASL ALALGLSDVV LRRSGGLRLD ALFVDEGFGS LDEETLDDAV
RTLEDLRKSG RVVGVISHVA ELRRRIPARI EIQRKAEGSV AVVRAG