Gene Anae109_4230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4230 
Symbol 
ID5374219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4958149 
End bp4960518 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content66% 
IMG OID640845758 
Producthypothetical protein 
Protein accessionYP_001381392 
Protein GI153007067 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.828103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACGCAG GTGGCGGGAA GCCGCGGGTC AAGGCACAGG ATCGCGGCGC TCACGCGCGC 
GATCTGGAGG CGGAGCTCGA TACCGCCGCT GCTGCGGCGG AAGACGCCGA CCGACCGGTC
AGGATCGACT TTCATAGCGA ACCCGGCTTC GAGCTTGCCC TCGCGAGCCT CGACTCGACG
CGGCCTGGCG CGTACGAGCT GCTGGCCGTT CGGCACGATG GTGACGTGAC AATCGCGACG
GTCCTGGTCG AACGAAAGAG CATCAAGCAC TTCCGAAAGG CGATTGAGGA ATACGCGGCG
AAGGACACGC GGAAGGGCCT ACCGGCGCAC GAGCGGCTGA TCGCGAACAT CTCGGCGATT
CACCGCGCAT CCGTGCGGTC CATCTGGACG GATGAAGGCG CGCCGTTCCC AGCTAACGGG
GAGGCGATGT GGTGGGAAGC GTGGCTCCGC CGACGTGAGG GAGCGATCGC CGGTTTCAGG
GCTCGCGCCG TCGAGCAGGG ACTGCGGCCC GGTCCTCGAA CGCTGGCCTT CATCGACCGG
CTCGTCACCA CCGTGTACGG CACGGTCGAG CAGATGGGAG CTCTGTTCGA GGAGGACGAC
GCCCTCGCGG AGCTCAGACG CGCGAAGGAA CTCTCAACGG AGTTCTTAAG CCTCACGCCC
CGGGAACAAG CTGAGTGGGC GCGAGATCTT CGGCGCAGGG TGGTCCCCCC AGGACCCGAT
GCGCCTTCCG TGTGCCTCCT CGATACGGGC GTCAATCGGG GGCACATCCT CTTGGAGCCG
CTCCTCGATG CGAGGTCCAC GCTGTCCTGC GACGAACAAT GGGGCGCGAA CGACCACAAT
GGCCACGGGA CGGAGATGGC GGGCCTGGCC GGGTTCGGAG ACCTCGCACC GCTTCTGATG
TCTGGCAGCC TCGTTCCAAT CCGTCACCGG CTCGAGAGCG TGAAGATCCT TCCTCCCGCC
GGGAACAACG AACGCGAAGT CTACGGCGCC CTCACTCAGG AGGCAGTGGC ACGCGCAGAG
GCTGCGAATC CGGACCGCGC TCGCGCCGTA TGCCTCACGG TGACGACCAG GGACGGCCGC
GACCAAGGAA AGCCGTCTTC CTGGTCCGCC GCGATCGATC AGTTCGCCGC GGGGGCGCTC
GATGATCAGC GACGCCTGTT CTGTGTTTCG GCGGGGAACG CGGACGTCGA CGACGCGATG
AACTACCCGA CGAGCAACGA AACCGACTCG ATTCACGACC CTGGGCAGTC GTGGAACGCC
ATCACAGTGG GCGCGCTCGC GGACCGCGTC GATATTACAG AGCCCGAATT CCACGGCTGG
TCCGCGGTGG CGCCAGCCGG TGATCTCGGA CCCTGCAGCA CGACCTCGTC CACGTGGCCC
GGGCAGAGGC GGTGGCCGAT CAAGCCCGAA ATCCTGATGC CGGGCGGCAA CATCGCAATC
AACCCGGACC GGACCGTCGT GGATGCGACG GACAGCCTGT CGCTCCTGAC GACGCACTGG
ATGCCCATCG AGCGGCAGTT CACGACCTCT GGCGACACGA GCGCGGCCGC TGCCTTGGCG
GCGCGCTTGG CTGCGCGTAT CCAAGCGGAC TACCGTCAGC TCTGGCCCGA GACGGTTCGG
GGCCTGATCG TCCACGCCGC CGAGTGGACC GAGGCGATGC GAAGGCGATT CCCCGCCAAG
AACGACGTCG AGAAGCGGCT CAGGTTCTAC GGGTTCGGTG TCCCCGACGA AGCTGCCGCG
GTTCGAAGCG CGGACGATGC GCTGACGCTC ATCGCCCAGA ACACGATTCA GCCATTCATA
AAGGAGAAGA AGGGCAAGAG CACACGGTTC GTCACCGCGG ATATGCACGT TCATCGGCTG
CCATGGCCGA CGGACGTTCT CACCGAGCTC GGAGAGAGGG ACGTGGACCT CCGTGTGACG
CTCTCGTATT TCATCGAGCC AAGCCCGGGT GAGCGGGGAT GGAAACAGAG GCACCGGTAC
GCGTCGCACG GCCTCCGGTT CGAGCTTAAG ACCGCCACCG AGACATTCGA GCAGTTCCGA
ACCCGTATCA ACCGTGCGGC GCGCGCCGAG GATGAGAAAC CTACGAGCAA GGGCGACCAA
CGGGGGTGGA CGCTTGGCCC CGATCTGCGC ACCGCCGGAT CGATTCACTC CGACACATGG
ACCGGCCCGG CGATCGACCT CGCGCGCCGC AGTGCGATCG CCATCTACCC CGCGATCGGT
TGGTGGCGGG AGCGGCACCA TCTAGGTCGA TGGAACCAGA AGACGAGGTA CTCCCTCGTG
GTGTCGATCC GGACACCAGG GATCGAGACC GACATCTACA CCCCGGTTGC GATCCAACTC
GGCATCCCCG TCCCGACCGA GATCGCGTGA
 
Protein sequence
MHAGGGKPRV KAQDRGAHAR DLEAELDTAA AAAEDADRPV RIDFHSEPGF ELALASLDST 
RPGAYELLAV RHDGDVTIAT VLVERKSIKH FRKAIEEYAA KDTRKGLPAH ERLIANISAI
HRASVRSIWT DEGAPFPANG EAMWWEAWLR RREGAIAGFR ARAVEQGLRP GPRTLAFIDR
LVTTVYGTVE QMGALFEEDD ALAELRRAKE LSTEFLSLTP REQAEWARDL RRRVVPPGPD
APSVCLLDTG VNRGHILLEP LLDARSTLSC DEQWGANDHN GHGTEMAGLA GFGDLAPLLM
SGSLVPIRHR LESVKILPPA GNNEREVYGA LTQEAVARAE AANPDRARAV CLTVTTRDGR
DQGKPSSWSA AIDQFAAGAL DDQRRLFCVS AGNADVDDAM NYPTSNETDS IHDPGQSWNA
ITVGALADRV DITEPEFHGW SAVAPAGDLG PCSTTSSTWP GQRRWPIKPE ILMPGGNIAI
NPDRTVVDAT DSLSLLTTHW MPIERQFTTS GDTSAAAALA ARLAARIQAD YRQLWPETVR
GLIVHAAEWT EAMRRRFPAK NDVEKRLRFY GFGVPDEAAA VRSADDALTL IAQNTIQPFI
KEKKGKSTRF VTADMHVHRL PWPTDVLTEL GERDVDLRVT LSYFIEPSPG ERGWKQRHRY
ASHGLRFELK TATETFEQFR TRINRAARAE DEKPTSKGDQ RGWTLGPDLR TAGSIHSDTW
TGPAIDLARR SAIAIYPAIG WWRERHHLGR WNQKTRYSLV VSIRTPGIET DIYTPVAIQL
GIPVPTEIA