Gene Anae109_1638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1638 
Symbol 
ID5375632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1842379 
End bp1843416 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content74% 
IMG OID640843147 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_001378826 
Protein GI153004501 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.749782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCGA GGCGCAGCTC CGCGGCGCTC GCGGCCTTCG GCGGCCTGGC GCTCCTCGCG 
GCGCTCGGCG CGTGCCGCGG CGCCACGTCG GAGAAGCCCG CGCGCGCCGA GCTGCTCAAC
GTCTCCTACG ACTCGACCCG CGAGCTCTAC GGGGAGGCGA GCGCGGCGTT CGCGCGTCGA
TGGAGGTCGC GCACGGGCCA GGAGGTCACC GTGAGGCAGT CGCACGGCGG CTCCGGCAAG
CAGGCCCGCT CGGTGATCGA CGGGCTGGAG GCGGACGTGG TGACGCTCGC GCTCGGCTAC
GACGTCGACG CGCTCGGCAG GGCCGGGCTC GTCGCGCCGG ACTGGGCCGG GCGGCGCCCC
GGCGGCGCCG CGCCCTTCAC CTCCACGATC GTCTTCCTCG TCCGCGAGGG GAACCCGAAG
GGCATCCGCG CCTGGGACGA CCTCGTGAAG CCGGGCGTCC AGGTGATCAC GCCGAACCCG
AAGACGTCGG GCGGCGCGCG CTGGAGCTAC CTCGCCGCGT GGGCCCACGC CCTCGAGAAG
GGCGGCGGCG ACGAGGCCCA GGCGCTGGCG TTCGTCCGCG CCCTCTACGC GAACGTGCCG
GTGCTCGACT CGGGAGCGCG TGGCTCCACG ACGACGTTCG TCGCGCGCGG GATCGGCGAC
GTGCTCCTCG CCTGGGAGAG CGAGGCCCTG CTGGCGATGG AGAAGCTCGG TCACGGCGGC
CTCGAGCTCG TCGTGCCCGA GGAGAGCATC CTCGCCGAGC CGCCGGTGGC GGTGGTGGAT
GCCGTCGTGG AACGGCATGG CACGCGCGAG CTCGCGGAGG CGTACGTGCA GTTCCTGCAC
TCGGAGGAGG GCCAGGAGAT CGCCGCGCGA CACCACTATC GACCGCGCCT GGCCTCGGTG
GCGGCCCGGT ACGAGGACCG CTTCCCCAAG CTACGGCTGT TCACCGTGGA CGCCGTCTTC
GGAGGCTGGG CGAGCGCGCA GGCGAGGCAC TTCGCGGACG GCGGCCTGTT CGACCAGATC
CACGCGCCGG GCCGCTGA
 
Protein sequence
MTARRSSAAL AAFGGLALLA ALGACRGATS EKPARAELLN VSYDSTRELY GEASAAFARR 
WRSRTGQEVT VRQSHGGSGK QARSVIDGLE ADVVTLALGY DVDALGRAGL VAPDWAGRRP
GGAAPFTSTI VFLVREGNPK GIRAWDDLVK PGVQVITPNP KTSGGARWSY LAAWAHALEK
GGGDEAQALA FVRALYANVP VLDSGARGST TTFVARGIGD VLLAWESEAL LAMEKLGHGG
LELVVPEESI LAEPPVAVVD AVVERHGTRE LAEAYVQFLH SEEGQEIAAR HHYRPRLASV
AARYEDRFPK LRLFTVDAVF GGWASAQARH FADGGLFDQI HAPGR