Gene Cagg_0732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0732 
Symbol 
ID7268051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp909694 
End bp910932 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content55% 
IMG OID643565583 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002462092 
Protein GI219847659 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000169813 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACGA GCACTGCCCT CAGCTCATCT CGTCATCGCG CAGCAATATT CCGGGCTTTA 
CGCCATCGAA ATTATCGCCT CTTCTTCATC GGACAACTGA TCTCACTGAC CGGGACGTGG
ATGCAGAGTG TTGCGCAGGG CTGGCTGGTA CTCCGTCTTT CCGATTCGCC GTTCTTGCTC
GGTGCAGCGG CAGCGGCCAA CTCGCTGCCG GTATTGTTGT TATCACTTTT TGCCGGTACG
ATAGCCGATC GTTTTCCAAA ACGCCGCATT TTACTAATCA CCCAATCGAC GGCGATGGTG
TTGGCGGCGA TATTGGCCTT CCTAACGTTC AGTAGTGTTG TACAAATTTG GCATGTGTTG
ATCCTGGCAC TCCTGTTAGG GGTCGTCAAT GCGTTTGATG CACCGGCCCG ACAGGCCTTT
ACCGTTGAGA TGGTGGGGCG CGAAGATCTG CTCAATGCTA TCGCTCTCAA TTCGTCAATC
TTCAATGGGG CGCGCACCGT CGGGCCGGCG TTGGCCGGTA TGGTAGTAGC CTGGATCGGT
GAAGGACCCG CATTCCTGTT TAATGCCTTA AGCTTTGGGG CCGTTCTGAC CAGCCTCTTA
CTGATGCGTC TTGATACGCA GTTGCACCGC GGATCGCAGC GGGGTGGGAT GTTACGCGCC
GGATTGGCGT ATATTGCCGG TGAACCACAT GTACGCGCCC TGTTATTACG GGCCGGCGCC
GTTAGTTTCT TCTGTTTCGT TCATATTCCG CTTTTGCCAA TATTTGCCCG TGATATTCTT
CAGATCGGAG CTACGGGTCT TGGATGGCTA TCGGCAGCCA GTGGTTGTGG TTCGCTGGTC
GCAGCACTCA TCCTCGCCCA ACTCCGCGAT GATGCACCGC GCGGTAAGTT GCTATCAATC
GCAGCCACAA TGTATGCACC TCTCCTTATT ATGTTCACCC AAGTTCGTAG CCTGTCGTTC
GCTTTGCTCT TCATCGGTCT CTGCGGCTGG GCCGGGGTCA CCACGATGGC GCTCACCAAC
ACGCTGATTC AGCTTACCGT ACCCGATGAA TTGCGTGGAA GGGTTATGAG CGTCTTTACG
CTGCTTTTGA TGGGACTTAG CCCCTTGGGC GGTATGCTGG CCGGCAGTAT CGCCGAACTG
GTTGGAAGTG TACCCACAGT CATAGCCGGC AGTGCTGTCA TCGGCTGGCT ACTCGTGCTG
CTCGTCGAAT GGCAAACACC GCAATTACGC CGGTTATGA
 
Protein sequence
MSTSTALSSS RHRAAIFRAL RHRNYRLFFI GQLISLTGTW MQSVAQGWLV LRLSDSPFLL 
GAAAAANSLP VLLLSLFAGT IADRFPKRRI LLITQSTAMV LAAILAFLTF SSVVQIWHVL
ILALLLGVVN AFDAPARQAF TVEMVGREDL LNAIALNSSI FNGARTVGPA LAGMVVAWIG
EGPAFLFNAL SFGAVLTSLL LMRLDTQLHR GSQRGGMLRA GLAYIAGEPH VRALLLRAGA
VSFFCFVHIP LLPIFARDIL QIGATGLGWL SAASGCGSLV AALILAQLRD DAPRGKLLSI
AATMYAPLLI MFTQVRSLSF ALLFIGLCGW AGVTTMALTN TLIQLTVPDE LRGRVMSVFT
LLLMGLSPLG GMLAGSIAEL VGSVPTVIAG SAVIGWLLVL LVEWQTPQLR RL