Gene Cagg_1787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1787 
Symbol 
ID7267699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2192734 
End bp2193996 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content58% 
IMG OID643566627 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002463122 
Protein GI219848689 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.406772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATAC CAACCACCTT CGACTATGAA GTTGCTACGC GCCGGATCAT CGGCGCTTTA 
TTCGTTACGC AGAGTTTAGC ATCAGCCGCA ATTATTGCCA ACATCGCCGT GAATGCGATT
GCCGGTGCGC AACTGAGTGG CAATGACGCA CTCGCCGGCT TGCCGGCAAC CCTGATGCTG
GCCGGGGCGG CGCTGTCAGC CTATCCCGCA GGGCGAGCGA TGCAGCGGTT TGGCCGCCGG
CCCGGTCTCC TTGTGGGGAT GGTGTTGGGG CTGATGGGCA TGCTGATTGA CGGAGTAGCG
GTACTTAGCC ACTCGTTTCT TCTCTTTTTA GGCGGCCTGT TTGTGGTTGG CATGGCGCGT
GGGATTATCG ATCAGAGTCG TTACGCCGCT GCCGATGTCG TCTCGCCGGA ACGACGGGCC
GGTGCGATCA GCACAGTGGT CTTTGCGAGC ACTATCGGCG CAGTGGGAGG GCCGTTGTTG
GTAGGGCCGT TGGGTCAGGT GGCGGCAGCC GGTGGCTTAC CTGAGTTGAC CGGACCAATG
TTTGGTGGGG TAGCCCTCTT CGCCATCGCC ACGTTGGTCA TGTTTGTCTT TATGCGACCC
GATCCGCGCA CGTTGGCGCT GCGCTTGAAT GTTCAGACGA CCACAGCCGA TGCCACAACG
GTGGTACCGG TGCGTTCAGT GGGTACGATT CTGCGGCTCC CGCTCGTTCG GGCCGGACTG
GTGAGTATGG TGCTTGGTCA GGTGGTGATG GTGTTGGTGA TGAGTGTCAC CTCGCTTCAT
ATGAGCCATC ACGCTCACGG TCTTGATAGC ATCTCGTTGG TGATCGGTAC CCATACCTTT
GGCATGTTTG GCCTATCAAT GTTCACCGGT CGGATCGCCG ACCGCCTGGG TCGGCCCCTG
ACGATTATAT TTGGCGCTCT GATGTTAATC GTCGGGACAT TGATTGCACC GGCATCGCTC
TTGACGCCAT GGCTGGCTTT GGGATTGTTT CTTGTCGGGT TGGGGTGGAA CTTTTGTTAT
ATTGCCGGCT CAGCACTGGT GGCAGACGCC ATTGTGCCGT CGGAGCGTGG TGCGGTGCAA
GGCGCGAGCG ATCTGCTCGT CAATCTAGGT TCGGCATTTG GTAGCCTGAG CAGTGGGTTT
ATTCTGGCCG GGTTAGGGTA TCTACTACTC TGCTTGATCG GAGCGGTTCT TAGTCTTATC
CCTCTGAGCG CGGCGTTGTG GTGGGGACGT TCGGTGCGCC AGACAGTGGC TGCGGCTGAT
TAA
 
Protein sequence
MSIPTTFDYE VATRRIIGAL FVTQSLASAA IIANIAVNAI AGAQLSGNDA LAGLPATLML 
AGAALSAYPA GRAMQRFGRR PGLLVGMVLG LMGMLIDGVA VLSHSFLLFL GGLFVVGMAR
GIIDQSRYAA ADVVSPERRA GAISTVVFAS TIGAVGGPLL VGPLGQVAAA GGLPELTGPM
FGGVALFAIA TLVMFVFMRP DPRTLALRLN VQTTTADATT VVPVRSVGTI LRLPLVRAGL
VSMVLGQVVM VLVMSVTSLH MSHHAHGLDS ISLVIGTHTF GMFGLSMFTG RIADRLGRPL
TIIFGALMLI VGTLIAPASL LTPWLALGLF LVGLGWNFCY IAGSALVADA IVPSERGAVQ
GASDLLVNLG SAFGSLSSGF ILAGLGYLLL CLIGAVLSLI PLSAALWWGR SVRQTVAAAD