Gene AnaeK_4101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_4101 
Symbol 
ID6785571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp4632776 
End bp4634056 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content74% 
IMG OID642765569 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002136434 
Protein GI197124483 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCACTCCA CGCCCGGCGC CGCCCGGCTC TTCCATGGCC CCGGCGGCCG CGCGCTGCTC 
GTCCTCACCT TCATCAACCT GTTCAACTAC CTCGATCGCT TCGTGGTCTC GGCGCTCGTC
GAGAGCCTGC GCGCGGACCT GTGGCTCACC GACACGCGCC TGGGCTGGCT CATGACGAGC
TTCACCATCG TCTACGCGCT CGCCTCCCCG GTGTTCGGCG CGCTGGGCGA CCGCCGGTCC
CGTCCCCCGC TGGTGGCGCT CGGCGTCCTG CTCTGGAGCG CCGCCACCAT GCTCTCCGGC
GCCGCGCGCG GCTTCTACAC GCTGCTCCTC GCGCGCGCGG CGGTCGGCGT GGGCGAGGCC
GCCTACGGCA CGCTCTCCCC CGGCCTGCTC GCCGACTACT TCGGAAAGGA CCGGCGCGGT
CGGGCGTACG CCACCTTCTT CGCCGCCATC CCCATCGGCT CGGCGCTCGG CTACATCGTG
GGCGGCCTGG TGGAGCACCG CTTCGGCTGG CGCACCGCGT TCGTGATCTC CGGCGCGCCC
GGCGTGCTGC TCGCGTACTG GTGCCTGCGG CTCCCCGACC CGCCGCGCGG CGCGAGCGAG
CGCCCCTGGC TCGAGCTCGG GAAGCGCGGC CTCGCCGCCA CCTACCGGCG CCTGCTCGCG
AACCGGCCCT ACGTTCTCGC CGTGGCCGGC TATGCGGCCT ACACCTTCGC GGTCGGCGGC
ATGGCGTTCT GGATGCCCGC GTTCCTGGAG CGCTCGCGCG GCGTGCCCCG CGCCATCGCC
ACCGTCCAGT TCGGCGCGGT GGTGGTGATG ACCGGGTTCG CCGGCACGTT CGCCGGCGGC
TTCTTCGCCG ACTGGCTCCG CCGCCGCCGC CGCGAGGCCG ACCTGTGGGT CTCCGGCATC
GCCACGCTGC TCGCCGCCCC GCTGTCACTG ATGGTGTTCC TCACCTGGCG GCCCGGGTTC
TACCTCTCCG CGCTCATCGG CGCGCAGCTC CTGCTGTTCG CGTCCTCAGG ACCCATCAAC
GCCGCGCTCA TGAACGTGGT CCCGCCCGCC GAGCGGGCCA CCGCCGCCGC GCTCTCGATC
CTCGCCATCC ACGTGTTCGG CGACCTGCCC TCGCCCACCA TCATCGGCGC GCTCTCCGAC
CACAGCTCGC TGGGGCGCGC CGTGCTCATC GTGCCCGCCG CGATCCTCGT CTCGGGCGCG
ATCTGGACCT GGGCGGCGTG GCGCGGGGAG CGGGCGGCGA CGTTGGCGGG CGCCGGCCCA
GGCGACGATC GCGCCGCCTA G
 
Protein sequence
MHSTPGAARL FHGPGGRALL VLTFINLFNY LDRFVVSALV ESLRADLWLT DTRLGWLMTS 
FTIVYALASP VFGALGDRRS RPPLVALGVL LWSAATMLSG AARGFYTLLL ARAAVGVGEA
AYGTLSPGLL ADYFGKDRRG RAYATFFAAI PIGSALGYIV GGLVEHRFGW RTAFVISGAP
GVLLAYWCLR LPDPPRGASE RPWLELGKRG LAATYRRLLA NRPYVLAVAG YAAYTFAVGG
MAFWMPAFLE RSRGVPRAIA TVQFGAVVVM TGFAGTFAGG FFADWLRRRR READLWVSGI
ATLLAAPLSL MVFLTWRPGF YLSALIGAQL LLFASSGPIN AALMNVVPPA ERATAAALSI
LAIHVFGDLP SPTIIGALSD HSSLGRAVLI VPAAILVSGA IWTWAAWRGE RAATLAGAGP
GDDRAA