Gene Anae109_1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1697 
Symbol 
ID5375994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1908512 
End bp1909855 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content77% 
IMG OID640843206 
Productextracellular solute-binding protein 
Protein accessionYP_001378885 
Protein GI153004560 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00303465 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTGGCC TCTCCGTGGC CCCGACCCGG GCCGCACCCG CCGGAAACGG CTCCCGCGCG 
CTGCGACCCC GTTACGCTCC CGGCGTGACT TCATCCCGCG CGCTCGTCCC GCTCCTGGCC
GCCGCTGCGC TCGTGGCGGG GTGCCGGACG CACGAGAGCG CTCCGGCAGC GAAGACCCGC
CTCGTCTTCC GCTACCAGCC CCTTGGGCCG GATCCCGAGC CGCTGCGTGC GCTGGTCGCC
GCCTTCGAGC GCGAGACCGG CTTCGAGGTC GCGCTGCAGG CGCTCCCGAA CGCGGCCGAT
CTGGCCCATC AGCTCCTCGT CACCTCCCTC GGCGCGGGGG GCGAGGATCT GGACGTCTTC
GTCCTCGACG TCGTCTGGGT GGCGGAGCTC GCACGCGCCG GGTGGCTCGC CGACGTCTCG
GGGGCGTTCC CGCCCGCTCG CGTCCGCGCC GACTTTCTCG GAGGCGCCGC CGCCGCGGTC
GTGCAGGGCG AGCGCACCTT CGCGGTGCCC TGGTACGTGG ACGTGGGGCT CCTCTACTAC
CGGACGGACC TCGTGCCGGA GCCGCCGCGC ACCTACGCGG CGCTCGCGGC GGCGATCCGC
GCCGCGCAGG CCCGCTCGCC CGGAATCGCC GGCTACCTGT GGCAGGGCCG CCAGTACGAG
GGGCTCTCCT GCGTCTTCTT CGAGGCGCTG TGGGGGCACG GCGGCGCCGC GCTCGAGGGC
GAGCGGCTGC GGCTCGACAC GGCCGAGGCG CGGGCGGCGC TCTCGTGGCT GCGGGGGCTC
GTGGCGGAGG GGCTCTCGCC GCCGTGGGTC GCCACCGCCG CGGAGGAGGA GGCGCGGCGC
GCGTTCCAGG AGGGCCGCGC GGCGTTCATG CGCAACTGGC CGTACGCGTG GCCCTTGCTG
CAGGAGCCGG CCTCCCCGGT GCGCGGCCGC GTCGGCGTCG CGCCTCTCCC CGGTGCGCAG
GGGCCGTCGC CCGGTGCGCT GGGGGGCTGG CAGCTCGGGA TCTCCTCGGG GGCGCCCCCC
GCGCGCCGCG CCGCCGCGGA GCGGCTGGTC GCGCACCTCA CCTCGCCCGA GGCGAACGCG
GTGCTGGCGA TCGCCTACGG CCGGAACCCC GCTCGGCGTA CGGCCTACGA CGACGCGCGC
GTCCGCGGCG AGGCGCCGTT CATCGCGGCG CTCCTCCCCC GCGTGGAGGA CGCCCGTCCG
CGGCCGGTCA CGCCCTATTA CATGCTGGCG GCGGACGCGC TTCAGGGCGA GCTCTCCGCT
GCGGTGTCCG GGCTCCGGCC GCCCACCGAG GCGCTCGCGC GCGCGCAGGC GCAGCTGGAC
GCGATCGCGG GGGTGACGCG GTGA
 
Protein sequence
MRGLSVAPTR AAPAGNGSRA LRPRYAPGVT SSRALVPLLA AAALVAGCRT HESAPAAKTR 
LVFRYQPLGP DPEPLRALVA AFERETGFEV ALQALPNAAD LAHQLLVTSL GAGGEDLDVF
VLDVVWVAEL ARAGWLADVS GAFPPARVRA DFLGGAAAAV VQGERTFAVP WYVDVGLLYY
RTDLVPEPPR TYAALAAAIR AAQARSPGIA GYLWQGRQYE GLSCVFFEAL WGHGGAALEG
ERLRLDTAEA RAALSWLRGL VAEGLSPPWV ATAAEEEARR AFQEGRAAFM RNWPYAWPLL
QEPASPVRGR VGVAPLPGAQ GPSPGALGGW QLGISSGAPP ARRAAAERLV AHLTSPEANA
VLAIAYGRNP ARRTAYDDAR VRGEAPFIAA LLPRVEDARP RPVTPYYMLA ADALQGELSA
AVSGLRPPTE ALARAQAQLD AIAGVTR