Gene Cagg_1176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1176 
Symbol 
ID7267925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1450930 
End bp1452282 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content55% 
IMG OID643566019 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002462521 
Protein GI219848088 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0522954 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCGG TGGCTGTGCC CGAAACTGAT ACCCGTCGTG CCATTATGAC CGTGCTTTTT 
GTCGGCGTCT TTATGTCGGC ACTCGATTCG GCCATCATTG GCCCGATCGT ACCAGCCTTG
CGAGCTGCAT TTGCGATTGA CAATACGCAA GTGGTGCTTG TCTCGATTAT CTTCACGCTC
TGCTCGCTCA GTAGCACAAC TCTCATGGCT AGCCTTAGTG ATCGGTATGG TCGCCGGCAT
GTCTATCTCC TCAACGTGTT TGGCTTTGCC ATCGGTTCAC TGGTGATCGC GCTGTCACAT
GATCTGATGA CCGTCTTGAT TGGTCGTGCG CTGCAAGGGA TATGTGCCGG TGGCATTACC
CCTACTGCTA GTGCCGTCAT CGGCGATGTG TTACCTCCGG TTGAGCGGGC AAAGGCGCTT
GGCCTGATCG GCGCAACGTC GGGCATGGCT TTTCTGATCG GTCCGGTTTT GGCGTCTCTC
ATTCTCGCGT TTGCCGACTG GCAGTGGATT TTCTTGCTCA ATTTACCGGT AGCGGCGGCA
GTGATCGTGC TGGGTTGGCG TGCATTGCCA CGCACGAATC CTCACCAACC AGAGCTGCAC
GATACGTTTG ATTGGCCGGG ATTGCTGCTC CTCATCACGA TCTTGGTTAG CCTGACCCTT
GGTATTAATC AATTGCTCGA CCGCTTCCTG GGTATGACGA TCTGGCCGTG GCTATTTGGG
CTGGTGGCTT TGCTCACACC GGTGCTTGCG CGGCGTGAAC AACGAACGCC GGCGCCATTG
CTACCACCAC GCCTGTTTAC CAATCGTCAG TTGCAGTTCG TCTACCTCTT GGCAACCGGG
TCGGGAATTG CAATGGCCAG CATTATCTTC ATTACCTCAG TCGCCGTGAA CTTCGGGGTA
CCGATTAGCC AAGCCGGTTT CTTCCTCTTA CCGCTCGTCT TTCTTGCTTC GGTTACATCA
ATCATCGGTG GGCGAATGCT CCCGAAGATC GGTGGTCGGG CCGCGATGCT GATCGGGTAT
GGGCAACTAA CGGTGGGTAA TCTGATGCTA GGCTGGCCGA GCGCACCGTT CTGGCTCTTC
GTGATCGCGA CGATTGTCGT CGGTAGCGGC TTGGGTATTG TCGTTGGCGG AACACTACGG
GCATTGGTAC TTGAAGAGGT CGCTCCCGGT GATCGCGGTG TCGCCCAGAG CGTGATCAAT
ATTTCGATTA GTATCGGCAC CCTGATCTCG GTAGCAGTGA TGGCCAGCAT TGCCGACACT
ATTAATCTGT CGGCCGCATA TCTGGCTTGT GCTGCGGTAA TGGCTTTAAT GACGGTGATT
AGTCTTGGTT TACGCCGCCA GTACCGGACG TAA
 
Protein sequence
MASVAVPETD TRRAIMTVLF VGVFMSALDS AIIGPIVPAL RAAFAIDNTQ VVLVSIIFTL 
CSLSSTTLMA SLSDRYGRRH VYLLNVFGFA IGSLVIALSH DLMTVLIGRA LQGICAGGIT
PTASAVIGDV LPPVERAKAL GLIGATSGMA FLIGPVLASL ILAFADWQWI FLLNLPVAAA
VIVLGWRALP RTNPHQPELH DTFDWPGLLL LITILVSLTL GINQLLDRFL GMTIWPWLFG
LVALLTPVLA RREQRTPAPL LPPRLFTNRQ LQFVYLLATG SGIAMASIIF ITSVAVNFGV
PISQAGFFLL PLVFLASVTS IIGGRMLPKI GGRAAMLIGY GQLTVGNLML GWPSAPFWLF
VIATIVVGSG LGIVVGGTLR ALVLEEVAPG DRGVAQSVIN ISISIGTLIS VAVMASIADT
INLSAAYLAC AAVMALMTVI SLGLRRQYRT