Gene Anae109_4201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4201 
Symbol 
ID5376455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4927197 
End bp4928333 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content76% 
IMG OID640845728 
Productputative glycosyl hydrolase 
Protein accessionYP_001381363 
Protein GI153007038 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3934] Endo-beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGCCG GGGCCCCCGC GATCATCTTG CTCTTCGTGG CCGCGTGCGA GGACGGACGG 
GATCTGGAGG CGGAGGCCGC CGCGTGCGCG GAGGGGCCGC CGCTCGTCCT GCCCGGCGTG
GACGGCCACC TCGTCGCGCT GAACGCCTAC TACCTCCAGG AGGAGGCGAC CCGCGCGCTG
CGCCGCGGCG ATCCGGAGGC CGCCGCCGTG GAGGAGACGC TCGCGAAGGC CTCCGCCCTG
GGCGTTCGGG CCCTGCGCAC GAACGCGTTC AACGACGGCG CGCAGGACAC CGCGATGCAG
GTGGTGCCGC TGGTCTACGA CGAGGTCGCC CTGCGCGGGC TGGACCTCGT CCTCGCGCGG
GCCCGCGTGC ACGGGCTCCG GCTGGTGCTG CCGCTCGCGA ACCGCTGGGA CGCCTACGGC
GGGCAGCGCC AGTACGTCGC CTGGGCGGGG CTCCCCGCGC CGCGCGAGGG CGATCCGCGC
TTCTTCACCG AGCGCGCGGT GGTCGAGCAC TTCCGCGCGC ACGTCGCCAC GCTCCTCGAC
CGCGTCAGCA CGGTGGACGG CCTCCGCTGG GGCGACCACC CGGCCGTGCT CGCCTGGGAG
CTCGTGAACG AGCCGCGCGC GGACGGCGTC TCGCGCGAGG CGCTCCGCGC CTGGGTGGAC
GAGCTCGCCG CCCTCGTGAA GGCGAAGGCG CCCGGGCACC TCGTCGGCTC GGGCGAGGAG
GGGCTCGACG CCGAGGAGTT CGCGCTCCTC ACCGCCTCGC CGCACCTGGA CTACGCGACC
GTGCACCTCT ACCCGGAGGC GTGGGGCGTG CCCGCGGACT GGGCCGCGTT CTTCGGGGCG
GGGTTCCTCT CCGAGCGGAT CGCGACCGCC CGCCGTCTCG GGAAGCCGCT CCTCGTGGGG
GAGCTGGCGC TCCGCAACGA CGGGCTGCCG CTCGAGGATC GCCGCGCCAT CTACCGCGGC
TGGTTCCGCT GCATCCGCGC GGCCGGCGGC GCCGGCGTCG CGCCCTGGCT GTTCGCCTAC
GACGGCCGCC CCGACGCGTG GGACCCGCAC ACGTTCTACT GGCGCGACGG CACCGACCCG
GCCGACCCGG TGAACCGCTA CGCGGACCTG GTCCGCGACG CGGCGGCGGT GCCGTAG
 
Protein sequence
MRAGAPAIIL LFVAACEDGR DLEAEAAACA EGPPLVLPGV DGHLVALNAY YLQEEATRAL 
RRGDPEAAAV EETLAKASAL GVRALRTNAF NDGAQDTAMQ VVPLVYDEVA LRGLDLVLAR
ARVHGLRLVL PLANRWDAYG GQRQYVAWAG LPAPREGDPR FFTERAVVEH FRAHVATLLD
RVSTVDGLRW GDHPAVLAWE LVNEPRADGV SREALRAWVD ELAALVKAKA PGHLVGSGEE
GLDAEEFALL TASPHLDYAT VHLYPEAWGV PADWAAFFGA GFLSERIATA RRLGKPLLVG
ELALRNDGLP LEDRRAIYRG WFRCIRAAGG AGVAPWLFAY DGRPDAWDPH TFYWRDGTDP
ADPVNRYADL VRDAAAVP