Gene Anae109_1978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1978 
Symbol 
ID5377504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2239013 
End bp2240719 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content70% 
IMG OID640843488 
Producthypothetical protein 
Protein accessionYP_001379165 
Protein GI153004840 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.308445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCCGG CCGTCAACAG GGCGCTCGCC GTAGGCATCC TCGCAGCCGT CACCGGCCTC 
GCGTTCCTCG TCGCCATCAC CTTCTTCCGC AAGGGTGGGT TCTCCGATCG CGACAGCTAC
CTCGTGCACG CGTACTTCCG GGACGCGACC GGCCTCACCT GGAAGAGCCG CGTGCAGATC
GCCGGCATCC AGATCGGCGA GGTGGACGAC ATCCGGCTCG AGGGCGCCCG CGCGCGCCTC
GACATCCGGG TGAAGAACGA CGTCGAGCTC CACAAGGACG CCTGCCTCAC CAAGATCTAC
CCGTCCGCGT TGCTCCCGGA CGCCGTCCTC GAGGCGGTCC CCGGCTCCGA CGCGTCCCCG
AAGCTCAAGG ACCTGCCCGA GGAGGAGCGC GAGATCGCGT GCGCGCGGGC GGCGGCGACG
GCGCAGGAGC TCATGGACTC GATGGCGAAG ATCGCCTCGG ACGTCCAGAC GATCACGGGC
GACCTGGCGG ACACGGTCAA GGGAGACCAG GGGAGCCTCC GCGAGATCGT GGAGAACCTC
GCCCGCATCA CGCGCCAGGT GGACCAGGTC GTCGCCGAGG AGGGCCAGAC GATCTCGGAG
ATCCTGGCCA ACACCCGGGA GTTCACGGGC GCGCTCGCGG AGATCTCGGC CCGCGACAAG
GAGCGGATCC ACAACATCGC CCGCAACGTC GAGGACCTCA CCTCGCGCCT CAACGTGGTC
CTCGCCAGCG TGCAGGACAT CCTCGACCCG CAGGGCGGCG GGGCGCCGGG GACGCCGGGG
GCGAAGGGCG TGCCGGGAGC GCCTGGTGCG CCGGGCACGC CTGGGGCGCC GGGTGCTCCG
GGGACGCCGG GGGCTCTCGC CGGCACTCCC GAGCAGCGCG CGCAGGCGCA GGCCGAGGCG
CGCGGCGTGA AGCAGGCGGT GGATCGGCTG AACACCAGCC TCGCGAGCCT CGACAGCATG
CTCGAGAAGG TGAACGAGGG GAAGAGCCCG GCGGGCAAGC TCCTCGTGGA CGAGCGGCTC
GGGCGCAAGG TCGGGGAGGC CGTCGAGGGG ATCAGCGACT ACGTCGATCG GCTCGTGAAG
CTGCAGATCC AGCTGCAGCT CCGCTCCGAG TGGCTGCTCA ACCAGACCGT CTCCGAGGGT
CGCCCCGGCT CCAAGATCTA CTTCGGCGCG CGGCTCATGC CCCGGCCGGA CAAGTACTAC
CTCATCGAGC TGGCCTCGGA TCCGCGCGGC GTGAACACCG TCACGACGGA GACGATCACC
ACGCGCCCTC CGGGCACGAA CGTCGAGACC ACCACGCTCG TCACGCGCAC GCTGAACGAG
GAGAAGCTCA CCTTCTCGCT GCAGCTCGCG AAGCGCTACG GAATGGTCAC CTTCCGCGCC
GGCCTCATCG AGGGCTCGGG CGGCGTCGGC TCCGACCTGC ACCTCCTGAA CGACGCGCTT
CAGCTGTCGG TGAACATGTA CCAGTTCTCG CGCCCCGGCC GCTCCGTGTA CCCGCGCGCG
AAGATCTGGG CGAACTACCA CTTCTTCCAG CACTTCTACG TGACCACGGG CGTCGACGAC
TTCCTGAACC AGTGGGAGAC GGGCAACTAC CCCGGCGGCC GGAGCTTCAA CATCGGCAGC
GACGTCTTCT TCGGCGGCGG CCTGTACTTC ACCGACGACG ACCTCAAGAC GCTCATCTCC
ACGGGCGCGG CGTCCGCGGT GCCGTGA
 
Protein sequence
MKPAVNRALA VGILAAVTGL AFLVAITFFR KGGFSDRDSY LVHAYFRDAT GLTWKSRVQI 
AGIQIGEVDD IRLEGARARL DIRVKNDVEL HKDACLTKIY PSALLPDAVL EAVPGSDASP
KLKDLPEEER EIACARAAAT AQELMDSMAK IASDVQTITG DLADTVKGDQ GSLREIVENL
ARITRQVDQV VAEEGQTISE ILANTREFTG ALAEISARDK ERIHNIARNV EDLTSRLNVV
LASVQDILDP QGGGAPGTPG AKGVPGAPGA PGTPGAPGAP GTPGALAGTP EQRAQAQAEA
RGVKQAVDRL NTSLASLDSM LEKVNEGKSP AGKLLVDERL GRKVGEAVEG ISDYVDRLVK
LQIQLQLRSE WLLNQTVSEG RPGSKIYFGA RLMPRPDKYY LIELASDPRG VNTVTTETIT
TRPPGTNVET TTLVTRTLNE EKLTFSLQLA KRYGMVTFRA GLIEGSGGVG SDLHLLNDAL
QLSVNMYQFS RPGRSVYPRA KIWANYHFFQ HFYVTTGVDD FLNQWETGNY PGGRSFNIGS
DVFFGGGLYF TDDDLKTLIS TGAASAVP