Gene Cagg_0769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0769 
Symbol 
ID7268088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp953611 
End bp954849 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content56% 
IMG OID643565620 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002462129 
Protein GI219847696 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCACA TCACATCCAA CCTGCGCGTA TCGATCCTCA AGATTCCGGC TGCGGTTTGG 
CGTATTTTGG CCCACAGTTT TATTTACGGT TTCGCCCTTA GTATTGCCGA TATTTTGTTC
AATTTTTACC TCGTTAGTCA AGGTTATACC ATCAATGACG TGGGTTTGCT GTCGATGGTG
AGCCGGGCTG CCGGTATGGT GATGGGGTTA CCGATCGGTT GGCTTATCGA CCGGTTCGGG
TCACAACGCG CAATGATCGG TGGTGTGATA GGGTATGCGC TTGGCTGGGC GGCACTCTTA
CAGGCGCCGG CATTGCCGTG GCTCATTGCT GCCCAATTTG TCGTTGGGGC TTGCTATCTG
CTGGCTGCTA CTGCAGTTAC TCCGTTGCTG GCGTTGGTGA CTACCGAAGA GCTGCGTCCT
CTCGTCTTCG GTATGAATGC TTCGGCGACA TTGATAGTCG GCTTGCTCGG AAGTGCTGTT
GGTGGGGTGT TGCCGATGGG AGCAGCCTTG ATGATGGCCG TTGAGCCGCA ATCTACTGTG
GCTTACCGGG TGGCTCTCAC GACGGTGATA GGGTTGAGTA TTGTGGCGTT ATGGCCGGTA
CTGGTGCAGC TCCCGGCGGT GGCGGAGAGA CGTGCCGCCG GTGAGGAGGA ATCGGTTGGA
TCGCGCCGGC TCTCGTGGTT CATGCTGCTC TGGATGGCTT TGCCCTCGTT TCTGCTCGGT
GTTGGTGGTG GCGCAATTTT GCCGTTTCAG AATCTCTTTT TTCGCGATCA GTTTGGTTTG
AGCGACGCCG GAGTCGGGTT GACTCTATCG CTGGCTTCGC TTGGCGCCGG CGTTGGGGCA
TTGCTGGGTG CGCCGGTGGT CCGTCGTATC GGCTTGCAAC GCGGTGCCGC ATTGCTACGG
TTAGGTGCGA CACCGGCAAT GTTGCTGATG TTAACACCAT GGTTGCCGCT CGCCATTATT
GGGTTTTTTG CCCGTGGCTT TTTCATTGCT GCCAGTTATC CGATGAACGA TGCCTTGGTG
ATGGGGGCTA CGCCCACCAC TCAGCGTGGG CTTGCTATGA GCTTGATGAG TTTGCTCTGG
GCCGGTGGTT GGGCTATTTC GGCGGTGATT TGGGGGTGGG TAACACCGAT CTTTGGGTAT
GGGCCGCAGA TTGTTGCTGC TGCTTTGGCC TATGCCCTCT CGGCGTTGGT GATTTGGAGT
CTGCGTCTGC AACGATCGGC AGAGCAGACG GCTGCATAG
 
Protein sequence
MTHITSNLRV SILKIPAAVW RILAHSFIYG FALSIADILF NFYLVSQGYT INDVGLLSMV 
SRAAGMVMGL PIGWLIDRFG SQRAMIGGVI GYALGWAALL QAPALPWLIA AQFVVGACYL
LAATAVTPLL ALVTTEELRP LVFGMNASAT LIVGLLGSAV GGVLPMGAAL MMAVEPQSTV
AYRVALTTVI GLSIVALWPV LVQLPAVAER RAAGEEESVG SRRLSWFMLL WMALPSFLLG
VGGGAILPFQ NLFFRDQFGL SDAGVGLTLS LASLGAGVGA LLGAPVVRRI GLQRGAALLR
LGATPAMLLM LTPWLPLAII GFFARGFFIA ASYPMNDALV MGATPTTQRG LAMSLMSLLW
AGGWAISAVI WGWVTPIFGY GPQIVAAALA YALSALVIWS LRLQRSAEQT AA