Gene Cagg_3472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3472 
Symbol 
ID7269698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4231902 
End bp4233152 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content55% 
IMG OID643568281 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002464748 
Protein GI219850315 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.135359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACCGC TCATTTCGCG CGCCGAGGTA CGCGCGCTTG TCCGCCCAAC CAATGTCACC 
GAGCGCAACG TGCGTAATGT ATTAATCGAT GGCATTGGAG TTGGGATCGT GACGGGGGTG
GGTGTCTTTC TTCCCGTCTT TTTGGCTCGT TTGGGCGCGT CGAGTCTGTT GGTCGGTTTG
ATTACCTCAT TACCGGCGCT TACCGGTGCC TTATTCGCCT TACCGATCGG TCGTTTTCTC
GAACGGCAGC GCAATATTGT AGCCTGGTAT TCGGGGATGC GCTTTTGGGT GCTGGCCTCG
TATGCTGTAT TCGGCCTCTT GCCGTTTGTG CTCCCGCTTT CGGTAGTGCC GTGGACGATC
ATCGTCATTT GGGCGTTGGT AACGATCCCA TCAACCTTTG TCAATGTCGC CTTCACCATT
GTTATGGGTG AGGTTGCCGG ACCACAGCGC CGTTATGCGC TGATGAGCAT GCGTTGGTCG
AGTCTGGGTT TGGCAACGGC AGTGACGGTA GCGGTGATCG GCGCACTGCT TGATCTGATC
CCGTTTCCGC TCAATTATCA GGTGGTCTTT ATCGGTTCGA TGGCGGGTGG GTTGCTCAGT
TTCTTGTTTT CACGGGCGAT TACCTTGCCT GAGCGAACGT TCGTAGCAAC GCAGCGACGT
CAAGAGTCGC TGTTCGTGGC GTTACGCCAG GCTCCGCCGG CATTTATGCG GTATGCGTTG
AGCGCCTTCG TGTTTCGCAG CGGTGTGGCC ATGGTGATAC CCCTGATACC ACTGTATTAT
GTGCGTGAAG CAGGTCTGAA CGATGCATGG ATTGGTTTGA TTAGCACGGT GGGGAATGGA
GTCTTGCTGG TGGCGTATGC GGTGTGGTCG GCAGGTGCGC CACGGTTGGG TAATCATGGA
GTGTTGTTGC TGAGTAGCCT TGGGATGACG CTGTATCCGT TTGGCGTTGC GCTAACCGAG
ACGCCGTGGT TGTTGGCTAT CCTGGCCGGC TTAGCAACGT TCTGTGTTGC CGGGAATGAC
CTGGTCAATT TTGATCTAGT ACTGAGTTCA ATCCCACCCG AACGCCAGGC AACGTATATT
GGGTTGTTTC AGACGTTACA GAACCTTGCG TTGTTTGTTA TGCCGCTGGT AGCAACGGTA
TTGGCCGACG TAGTGGGAAT TGTACCAATG CTCGTTATGG CCGGTGTGCT GCGATTTTTA
GGGGTAGCAC TCTTTGCCTG GCTGAAGGTA GGCAAACCTG CCAGTGCATA G
 
Protein sequence
MRPLISRAEV RALVRPTNVT ERNVRNVLID GIGVGIVTGV GVFLPVFLAR LGASSLLVGL 
ITSLPALTGA LFALPIGRFL ERQRNIVAWY SGMRFWVLAS YAVFGLLPFV LPLSVVPWTI
IVIWALVTIP STFVNVAFTI VMGEVAGPQR RYALMSMRWS SLGLATAVTV AVIGALLDLI
PFPLNYQVVF IGSMAGGLLS FLFSRAITLP ERTFVATQRR QESLFVALRQ APPAFMRYAL
SAFVFRSGVA MVIPLIPLYY VREAGLNDAW IGLISTVGNG VLLVAYAVWS AGAPRLGNHG
VLLLSSLGMT LYPFGVALTE TPWLLAILAG LATFCVAGND LVNFDLVLSS IPPERQATYI
GLFQTLQNLA LFVMPLVATV LADVVGIVPM LVMAGVLRFL GVALFAWLKV GKPASA