Gene Anae109_4410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4410 
Symbol 
ID5376205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp5159616 
End bp5160644 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content73% 
IMG OID640845938 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_001381572 
Protein GI153007247 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.476755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCAA GCCGGTGGAA GGTCCTTGGG CTCGCGGCGC TGGTGGCCGT CGCATTCGCG 
GGCGGCGGCG AGACGGCGCG GGCGGTGGAG GGAGGCGAGG CGAAGCTGCT CAACGTGTCG
TACGACCCCA CGCGCGAGCT CTACGACGAC GTCAACCAGG CGTTCGCGGC GCGCTGGAAG
GCGAAGACCG GTCAGGCCGT CACCGTGCGG CAGTCCCACG GGGGCTCCGG CAAGCAGGCG
CGGGCGGTGA TCGACGGGCT CGAGGCGGAC GTGGTCACGC TCGCCCTCGC CTACGACGTG
GACGCGATCG CCGCGCGCGG GCTGCTCCCC GCCGACTGGC AGAAGCGGCT GCCGGAGCGC
GCGGCGCCGT ACACCTCGAC CATCGTGTTC CTCGTGCGCA AGGGGAACCC CAAGGGCCTG
CGCGACTGGG ACGACCTCGT GAAACCCGGG GTCCAGGTCA TCACCCCCAA CCCGAAGACG
TCCGGCGGCG CGCGCTGGAA CTACCTCGCG GCCTGGGCGC ACGCCCTCGA GAAGGGCGGC
GGCGACGAGG CCAAGGCGCG CGAGTTCGTG ACGGCCCTGT TCCGGAACGT CCCGGTGCTC
GACTCCGGCG CTCGGGGCTC CACGACCACC TTCGTCGAGC GCGGCCTCGG CGACGTCCTG
CTCGCCTGGG AGAACGAGGC GTTCCTGGCG ATCGAGCAGC TCGGCAAGGG TCGGTTCGAG
ATCGTCGCGC CGCGCACCAG CATCCTCGCG GAGCCGCCCG TGGCGGTGGT CGAGAAGAAC
GCGGACCGGC ACGGCACGCG CGCCCTCGCC CAGGCGTACC TCGAGTTCCT CTACACGCCG
GAGGGCCAGG AGCTCGTCGC GAAGCACTTC TACCGCCCGC GCGACCGCGC CGTCGCGGCC
CGCCACGCCG GCCGCTTCCC CGCCATGCGC CTCGTGACGA TCGACGCGTT CGGCGGCTGG
CAGAAGGCGC AGGCCGCCCA CTTCGCGGAC GGCGGCGTCT TCGACCAGAT CTACGCGCCC
GGCCGCTGA
 
Protein sequence
MKASRWKVLG LAALVAVAFA GGGETARAVE GGEAKLLNVS YDPTRELYDD VNQAFAARWK 
AKTGQAVTVR QSHGGSGKQA RAVIDGLEAD VVTLALAYDV DAIAARGLLP ADWQKRLPER
AAPYTSTIVF LVRKGNPKGL RDWDDLVKPG VQVITPNPKT SGGARWNYLA AWAHALEKGG
GDEAKAREFV TALFRNVPVL DSGARGSTTT FVERGLGDVL LAWENEAFLA IEQLGKGRFE
IVAPRTSILA EPPVAVVEKN ADRHGTRALA QAYLEFLYTP EGQELVAKHF YRPRDRAVAA
RHAGRFPAMR LVTIDAFGGW QKAQAAHFAD GGVFDQIYAP GR