Gene Cagg_0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0420 
Symbol 
ID7266588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp520153 
End bp521535 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content57% 
IMG OID643565287 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002461801 
Protein GI219847368 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAACG AACGTCAGCG TAATCAGATC CTCGGCCTGT TGTTCTTCGG CGTATTAATG 
GCAGCGCTCG ATATTGCCAT TGTTGGGCCG GCATTGCCGG TGCTTCAGCG TGTATTTACG
GTTAGCGAGC GACTGCTGTC GTGGGTTTTC TCGATCTATG TACTCGGCAA TCTAGTCGGG
ACACCGATGA TCGCCGCGCT CTCGGATCGG TTTGGTCGGC GGGCGTTGTA TGTTCTGAGC
TTAAGTAGCT TCGCGCTTGG TTCACTCATC GTTGCCCTTG CACCATCGTT TTCGGTACTT
CTCGTCGGGC GGTTGCTACA AGGTCTTAGC GCCGGCGGTA TTTTCCCGGT AGCCAGCGCC
GTCATCGGCG ACACATTTCC GCCAGAACGA CGTGGATCGG CTCTCGGCTT GATCGGGGCG
GTCTTCGGGA TCGCGTTTCT GATCGGCCCG ATCATTGGTG GGCTGTTACT CTTGCTTGGT
TGGCAATGGT TATTCTTGAT CAATTTGCCA ATTGCGGCAA TCCTCATCGC CTTCAGTGTG
CGACTCTTAC CGGGACGCAC AGTAACAAGC AGTGCGCCCT TCGATCTCAC CGGCTTGCTG
GTGTTGGGTA TCATGCTGAG CAGTCTCGCA TACGGCCTCA CCGAACTTGA TCCAGATGCG
ATCCGTGCTG GGAATGTACC ATTCTTTGCG ATAGGTGCCT TAATTGTCGC CGCCTTGCTG
GTACCGGCTT TTATCACGAT CGAGAAACGA GCCACCGAGC CAATTTTGCA GCCATCTATC
TTTCGTTCAC GTCAAATCTG GCTGACAGCA GCCTTGGCCG TCGGCGCCGG TATCGCCGAG
TCGTCGATTG TCTTCGTGCC GGCGTTGCTG ACGGCGGCGC ACGGTGTAAG CAGCTCAACT
GCCAGCTTTA TGCTCTTACC GGTGGTATTA GCGATGGCGG TCGGATCACC GGTTTCGGGT
CGGATGCTCG ATCAATTTGG GTCGCGGATT GTGGTGACTA TCGGTGTGAT TCTGAGCGGT
GCAGGGCTGG TGTTGCTCGG TGCGCTACCG ATGAGCCTTG TCGCCTATTA TCTTTCCGCG
ATCGTGTTCG GGATCGGCCT GGCGATCCTG CTCGGTGCAT CGTTACGGTA CGTTCTGCTG
AATGAAGTTC CGGCCAACGA ACGCGCAGCA GCGCAAGGAT TACTTACCGT CACGATGGGC
GTTGGGCAGT TGCTCGGCGC GGTGTTGGTT GGCTTGATCG CCGCCACCGG TGGCGGTGGA
GCCGGTGGAT ATGGGGTGGC CTTTTTAGTC ATCGGTATCT TGATGCTCGC CCTAACCTTT
GCCGGCTTGG GGTTGAAGAA TCGGACGGCC GAGAAAGCCA CAGCGCTGGC CCATGCTCAT
TAG
 
Protein sequence
MVNERQRNQI LGLLFFGVLM AALDIAIVGP ALPVLQRVFT VSERLLSWVF SIYVLGNLVG 
TPMIAALSDR FGRRALYVLS LSSFALGSLI VALAPSFSVL LVGRLLQGLS AGGIFPVASA
VIGDTFPPER RGSALGLIGA VFGIAFLIGP IIGGLLLLLG WQWLFLINLP IAAILIAFSV
RLLPGRTVTS SAPFDLTGLL VLGIMLSSLA YGLTELDPDA IRAGNVPFFA IGALIVAALL
VPAFITIEKR ATEPILQPSI FRSRQIWLTA ALAVGAGIAE SSIVFVPALL TAAHGVSSST
ASFMLLPVVL AMAVGSPVSG RMLDQFGSRI VVTIGVILSG AGLVLLGALP MSLVAYYLSA
IVFGIGLAIL LGASLRYVLL NEVPANERAA AQGLLTVTMG VGQLLGAVLV GLIAATGGGG
AGGYGVAFLV IGILMLALTF AGLGLKNRTA EKATALAHAH