Gene Anae109_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1661 
Symbol 
ID5376056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1867158 
End bp1868489 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content66% 
IMG OID640843170 
Productextracellular solute-binding protein 
Protein accessionYP_001378849 
Protein GI153004524 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.407934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGG ACGGATCGCG CGGCCTGACG CGCAGGGATC TCTTGAAGGC GGCCGGAGCG 
GGCGCCATCG CGGCGGCGGC GGGGAGTGCC GGGCGCGCGC GCGCGCAGCC GAAGACGCTC
AAGATCGTGC AGTGGAGCCA CTTCATCCCA GCCTACGACA AGTGGTTCGA CGGCGTGTTC
TGCAAGCGGT GGGGGGAGAA GAACGGCACG CAGGTGATCG TCGATCACAT CGCGATCGGC
GAGATCAACG CGCGGGCGGC GGCCGAGGTG TCGGCGCAGC GCGGTCACGA TCTGTTCATG
TTCCTGTCGC CGCCCGCCGC GTACGAGAAG CAGGTCATCG ACCACTCGGA GATCTACCAG
GCGGTGGAGA AGAAGTGGGG CAAGGTCATC GACCTCGGCC ACAAGTCCAC CTTCAACCCG
AAGACGAAGA AGTACTTCGC CTTCTCCGAC AGCTACGTGC CGGATCCGGG CAACTACCGT
CAGGACCTCT GGTCGCAGGT CGGGTTCCCG AAGGGACCCG ACACCTGGGA GGACGTGCGC
AAGGGCGGCA AGGCCATCAA GGACAAGTTC GGCAACCCCG TCGGCATCGG GCTCTCGCAG
GAGCTCGACA CGAACATGGC CATGCGCGCG CTGATGTGGT CGTTCGGCGC GTCGGAGCAG
GACGCCGAGG GGCGCGTGAC GATCAACTCG CCGCAGACCA TCGAGTCGCT CAAGTTCATG
CGCGCGCTCT TCAAGGAGGC CGAGACGAGC GAGGTCTTCA CCTGGGACCC TTCGTCCAAC
AACCGCGGGA TCCTCGCGGG CAAGCTGTCC TTCGTCTGCA ACGCCATCTC GGTGACGCGC
ACCGCCGAGA AGGAGAACCC GGACATGTCG AAGAAGCTCC AGATCGTGCC CGCGCCGAAG
GGTCCGGTGC GCCGCATGGC GGCCGAGCAC GTGATGGACT GCTACGCGAT CTGGAAGTTC
GCCGAGAACA AGGAGGGCGC GAAGCAGTTC CTGGCCGACT ACATCGACGC GTTCGGCGAG
GCGTTCAAGC AGAGCGAGTT CTACAACTTC CCGTGCTTCC CGAAGACCGT CCCCGACCTG
AAGCAGCAGA TCGCGAACGA TCCGAAGGGC GTCCCGCCCG ACAAGTACAA CGTGCTCGGC
GACGTGCTCG AGTGGGCGAC CAACGTCGGC TATCCCGGCT ACGCGTCCGC CGCGGTGGAC
GAGGCGTTCA ACACGTTCGT CATCCCCACC ATGTTCGCGA AGGTGGCGCG TGACGAGCTG
TCGCCCGAGG ACTCGGTGCG GGCGGCGGAG AAGGAGCTGA AGCGCATCTG GGACAAGTGG
AAGACGGCCT GA
 
Protein sequence
MAKDGSRGLT RRDLLKAAGA GAIAAAAGSA GRARAQPKTL KIVQWSHFIP AYDKWFDGVF 
CKRWGEKNGT QVIVDHIAIG EINARAAAEV SAQRGHDLFM FLSPPAAYEK QVIDHSEIYQ
AVEKKWGKVI DLGHKSTFNP KTKKYFAFSD SYVPDPGNYR QDLWSQVGFP KGPDTWEDVR
KGGKAIKDKF GNPVGIGLSQ ELDTNMAMRA LMWSFGASEQ DAEGRVTINS PQTIESLKFM
RALFKEAETS EVFTWDPSSN NRGILAGKLS FVCNAISVTR TAEKENPDMS KKLQIVPAPK
GPVRRMAAEH VMDCYAIWKF AENKEGAKQF LADYIDAFGE AFKQSEFYNF PCFPKTVPDL
KQQIANDPKG VPPDKYNVLG DVLEWATNVG YPGYASAAVD EAFNTFVIPT MFAKVARDEL
SPEDSVRAAE KELKRIWDKW KTA