Gene Cagg_1298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1298 
Symbol 
ID7268589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1597428 
End bp1598636 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content57% 
IMG OID643566141 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002462642 
Protein GI219848209 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.148337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.167522 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTAA ACCATCATCG CTACATTGTC GCCGGTTCGG TCCTGGTTGA TATGCTGGGA 
TACGGCTTGA TTATGCCGTT ACTCCCGTTT ATGGTACAGA AGTGGGGAGG TAACGCGACG
ATCATCGGGT TGCTCGGCTC ACTGTATGCG CTGATGCAAT TGCTGGCGGC ACCGCTGCTC
GGGGCGCTTT CCGACCGCGT CGGACGACGA CCGGTTATTT TGGGCTGCTT ATTCGGATCG
GCATTGGCAT ATAGCTGGCT AGCCCTCGCC GACTCACTCC CTTTACTGGC GGCCGCCATC
GCTCTTGGCG GGGTAGCCGG TTCGAGCATG CCGGTGGCGC AGGCTTACAT TGCCGATGTC
ACGTCGCCAA CCGAACGAAC TCACGGCTTC GGGCTGCTAG GTGCGGCATT TGGGCTTGGC
CTGATCGGTG GAGCTGCTAT CGGCGGCTTC CTCAGCCAGT TCGGATTAGC CCTGCCCCCA
CTCGTCGCCG CTACCATTGC CCTGAGCAAC GCTATCTACG CCAGCATTGT CTTACCCGAG
TCGTTACCAC CAATGCGCCG CCATGCATCG TCGTTGCCCT TCCGCAACCT GTTCGGATCG
GCGCTCATTG CGTTACAAGC CTCATCGGTG CGACCGTTGT TGATCGCGGT GATGCTACTC
AACATCACAT TCGCCGGTTT ACAAAGTAAT ATTGCACTCT ACACACTCAC CCGTTTCGGC
TGGGGACCAG ATCAGAATGC GATCTTGTTT GTCTTTGTGG GGCTGTGTGC TGCCCTAACG
CAAGGTGTGC TCATCGGCAA GTTACAATCA CGGCTTGGCG ATGCATGGTT AGCCAGCGGT
GGTCTGGGGC TGATGGCCCT CGCCTTTGCC TTTGTCAGTG GTGTATATAC GGATTGGCTT
CTCTTTCCGC TCGCCGCGCT TCTCGCCATC GGGATGGGGC TAGCCGTACC GGCCATCACA
AGCCTCGTCT CACGACAGGC CGGTGAAAAC CGGCAAGGAA TTGTGCTAGG CGGGATGCAG
GCATTGATCA GTGTAGCTCT CCTGATCGGG CCGGCCAGTT TCGGTTTCCT CTTCGACCGG
TTTGGCAGCA CTACACCCTA TCTCGCAGGA GGCATGCTGT TGATCGGCGC ATGGCTTATG
ACAACAATAA CCATCGCCAA CCCGTCTATT CTTATTCGTT CTGCTGAAGA TCCAGAGGCT
CAGGTATGA
 
Protein sequence
MTLNHHRYIV AGSVLVDMLG YGLIMPLLPF MVQKWGGNAT IIGLLGSLYA LMQLLAAPLL 
GALSDRVGRR PVILGCLFGS ALAYSWLALA DSLPLLAAAI ALGGVAGSSM PVAQAYIADV
TSPTERTHGF GLLGAAFGLG LIGGAAIGGF LSQFGLALPP LVAATIALSN AIYASIVLPE
SLPPMRRHAS SLPFRNLFGS ALIALQASSV RPLLIAVMLL NITFAGLQSN IALYTLTRFG
WGPDQNAILF VFVGLCAALT QGVLIGKLQS RLGDAWLASG GLGLMALAFA FVSGVYTDWL
LFPLAALLAI GMGLAVPAIT SLVSRQAGEN RQGIVLGGMQ ALISVALLIG PASFGFLFDR
FGSTTPYLAG GMLLIGAWLM TTITIANPSI LIRSAEDPEA QV