Gene Cagg_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1664 
Symbol 
ID7268966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2031101 
End bp2032321 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content57% 
IMG OID643566506 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002463001 
Protein GI219848568 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCA CCCAGCACGC TCGCTATCGG GGCTTAAGCA TCCTCATCGT GATTAACTTC 
ATGATGTACG CCGGCTTTTT CATGGTCATC CCGCTCGTAT CAGTCCACTA TGTCCAAACG
ATGGGTTTTG CCGCAGTGAC GGTCGGGATG GCGCTCGCGT TGCGCCAACT CGTTCAGCAA
GGGGTGAGTG TCGGCGGTGG GGTGCTCTCA GATCGCTTCG GCGGACGTAA CCTGATTACC
GCCGGCGTCT TGATCCGCGC TCTTGGATTC GTCAGCCTTG CCTTTGCCAA CACACCATTG
CTGCTCTTCG CCGCGATGCT ACTCTCGGCG CTTGGTGGAG CACTCTTTGA AGCACCGAGT
CGAGCCGGGA TTGCTGTGTT GACAACCCCT GACGAACGCG CCCGTGCCTT TTCGATCAAC
GGGGTGGGCG GTGGTTTAGG GATGGTAGTC GGGCCTTTCG TCGGTTCGCT CTTACTCGAT
TTTGGCTTTA CTACGGTAGC CCTGGCAGCC GCCATCTGTT TTGCGCTGAT CGGCGTGCTC
AGCTTACTCT TACCGCCGCT GGAGACGGCA AGTGATCGGA CACGGCTAGG GTTTGGTTTG
AGGTTGGCAT TGCGCGACCG TCCGTTTCTG ATCTTTACCG CCTTACTGAT GGGCTACTGG
TTTATGTGGG TACAATTGAC GATCAGCCTA CCACTGGCCG GCGAGCGATT GACCAATGCC
GCCGATGCGG TGCGGTGGAT CTATGGTATC AATGCGGGGA TGACCGTCCT CTTGCAAATC
CCGATCATGG GGCTGGTTGA ACGACGCCTC CGACCACCCA CCATCCTGAT CCTCGGTATC
GCGTTGATGG CCGGTGGCCT GGGAATGGTT GCCATCGCCG AGACGTTTAC ATTGCTCATC
GGTTGTATCG TTATCTTTAC CATCGGCACC TTGCTTGCCA CCCCATCCCA ACAGAGCGTC
ACTGCCGCAC TCGCCGACCC ACGCGCGCTT GGCTCATACT TCGGGGTTAA TGCCCTAGCA
CTCGCATTTG GTGGCGGATT AGGGAACCTA AGCGGTGGTC TGTTGATCGA TCTCGCTACC
GTTCTCCATC TCCCGGCATT ACCATGGATT GTTTTTGCAA CGATTGGTCT TATCAGCGCT
ACCGGCCTCG TCATCCTCGA TCGTCGGTTG CAACGACAAT CAAATATCGC CGTCAACGCT
CAACAGCAAC CATCGCCGTA A
 
Protein sequence
MTLTQHARYR GLSILIVINF MMYAGFFMVI PLVSVHYVQT MGFAAVTVGM ALALRQLVQQ 
GVSVGGGVLS DRFGGRNLIT AGVLIRALGF VSLAFANTPL LLFAAMLLSA LGGALFEAPS
RAGIAVLTTP DERARAFSIN GVGGGLGMVV GPFVGSLLLD FGFTTVALAA AICFALIGVL
SLLLPPLETA SDRTRLGFGL RLALRDRPFL IFTALLMGYW FMWVQLTISL PLAGERLTNA
ADAVRWIYGI NAGMTVLLQI PIMGLVERRL RPPTILILGI ALMAGGLGMV AIAETFTLLI
GCIVIFTIGT LLATPSQQSV TAALADPRAL GSYFGVNALA LAFGGGLGNL SGGLLIDLAT
VLHLPALPWI VFATIGLISA TGLVILDRRL QRQSNIAVNA QQQPSP