Gene Cagg_1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1102 
Symbol 
ID7268555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1359320 
End bp1360858 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content59% 
IMG OID643565944 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002462448 
Protein GI219848015 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000316091 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAGAATC CAGCTTTGAC CAACGTTGAA ACCCGTACCG TTGACCCGCG CTTGACACTG 
GCGCTGGTCT GTCTCGCGAT CTTTATCGGT GCGGTCGATC TCACCGTGAT CAGTGCAGCG
TTGCCCAAAG TGATGATCGA CCTCCGCCTC GCTCTTGATA CCGAGCTGAA CCGTGCGTCG
TGGGCGGTGA GTGGGTACTT GTTGGCGTAT ACCGTCAGTA TTACCTTCAT GGGGCGGTTG
TCCGACCTAT TCGGACGGCG GAAGGTCTAT TTTCTCTGCC TCATTACCTT CTTGGTTGGA
TCGGCGGTCG TGGCGGCGGC GCCCAACTTG ACGATATTGA TTGTTGGCCG GGTTATCCAG
GCGCTTGGAG CCGGAGCGAT GGTACCGGTA TCGATGGCAC TGGTCGGCGA TCTCTTTTCG
GTAGGGCAGC GGGCGGCTGC CCTCGGTGTG ATCGGGGCCG TCGATACTGC CGGCTGGATG
GTTGGTCATC TCTACGGCGG CGTCTTGATG CGTCTGTTCG ACGACTGGCG GCTGCTCTTT
TGGCTCAATC TCCCGATCGG TGCGGTGGCG CTTGGGCTGA CGTGGTACGC CCTGCGAAAT
GTACCAACCC CGCCGCGCGT AGGTTCGTTC GATTGGCCGG GAACCGTGTT GTTGAGTGCC
GGTCTGGTGG TATTGAACGT TGGTTTGGCG GCCGGGAGTG AGTTGGGGGC GACCGACTTC
TACGGTGAGC GGTTGGGGCC TCCACCGTAT GCCGGGCCGC TGGTCGGATT AGCGTTGATG
TTGTTCGCAC TGTTTGTCTG GGTCGAGCGA CGCAGCGCCG ATCCGTTGAT CGGCTTAGAA
CTGTTTACGC GCCGTGATAC GGCGATGGCG TGTATCATCA ATGTGATGGT TGGTTTTGGC
TTGGCCATCG CGATCACGAA TGTACCACTG TACATTAACA CTCGTCTGCT GCTTTACCAC
CCAACCGATA GCGATATTCT GCGGATTGCA GCGTGGGATG CCGGTTGGAT GTTGTCGGCA
TTGACCTTGA CGATGGCTGT CGCCGCATTG CCCGGTGGCC TATTGACGGC ACGCTTTGGG
GCGCGCTTGC CGACCATCCT CGGCTTAGGC TTAGCGCTCG TTGGCTATCT CTTGATGACG
TTCTGGGGGC CAGAGGCAAC CTATCTGCGG ATGGGGTTGG AATTGGCCCT AACCGGTATT
GGTCTCGGCT TGGTGATCGC ACCGGTCGCC GATACCGTTG TAGCGGCTGC CGGCGGAGAC
CAGCGTGGGG CAGCTTCGGC ATTGGTGATT GCTCTGCGTT TGGTTGGGAT GACGGTCGGT
GTCGCGTTGC TCACATTGTG GGGCGTGCAT CGGCAAGATG TGTTACGGCG GGCCGGCGCC
GATAACCCGC TGGCAATGAC CGACCCCGCC CGGTTTCTGA TGGAGATTGC CGCCAACGTG
ATCGGCGAAA CCTTTCTCTT TGGCGCCGCA GCGTGTGTCA TCGGACTGGT GGCCGGTTGG
TTAATGCGAA GATGGGTGGT AACACATCAC ACCGGATAA
 
Protein sequence
MKNPALTNVE TRTVDPRLTL ALVCLAIFIG AVDLTVISAA LPKVMIDLRL ALDTELNRAS 
WAVSGYLLAY TVSITFMGRL SDLFGRRKVY FLCLITFLVG SAVVAAAPNL TILIVGRVIQ
ALGAGAMVPV SMALVGDLFS VGQRAAALGV IGAVDTAGWM VGHLYGGVLM RLFDDWRLLF
WLNLPIGAVA LGLTWYALRN VPTPPRVGSF DWPGTVLLSA GLVVLNVGLA AGSELGATDF
YGERLGPPPY AGPLVGLALM LFALFVWVER RSADPLIGLE LFTRRDTAMA CIINVMVGFG
LAIAITNVPL YINTRLLLYH PTDSDILRIA AWDAGWMLSA LTLTMAVAAL PGGLLTARFG
ARLPTILGLG LALVGYLLMT FWGPEATYLR MGLELALTGI GLGLVIAPVA DTVVAAAGGD
QRGAASALVI ALRLVGMTVG VALLTLWGVH RQDVLRRAGA DNPLAMTDPA RFLMEIAANV
IGETFLFGAA ACVIGLVAGW LMRRWVVTHH TG