Gene Cagg_0390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0390 
Symbol 
ID7268491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp482834 
End bp484174 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content52% 
IMG OID643565258 
Productmajor facilitator superfamily permease 
Protein accessionYP_002461772 
Protein GI219847339 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.778281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.55351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAT TGACGGCCGT AACCGACAAC CGGCGTGAAC AGATCGGCTG GTATTTCTAT 
GATTGGGCTA ATTCGGCATT TTCCACAACG GTTGTTACTG TCTTTTTAGG ACCATACCTC
ACCGCAGTAG CAAGGAACGC AGCGGATGCC AACGGCTTTA TCTATCCGTT CGGTATTCCG
GTCGCAGCCG GATCGTTTTT TCCCTACATG GTCTCGCTCT CGGTATTGTT ACAGGTCTTC
TTCTTACCCG TTCTCGGTGC GATTGCCGAC TATTCCAATG CCCGCAAGCA GATGCTAGCC
ATCTTTGCCT ATGTTGGTGC GTTTGCAACA ATGGGACTCT ACTTTCTCAA CGGTGATAAC
TATCTGCTTG GGGGTATGCT CTTTCTCATT GCGAACGTTT CCTTCGGTGC ATCGGTTGTT
TTTTACAACG CATTTCTCCC TGACATCGCC AGCCCTGACC GACGCGATGC CGTTTCGTCA
CAGGGATGGG CATTGGGCTA TTTGGGGGGG GGGTTACTGT TGGCAGCAAA TCTCGTCTTC
TTTCTCAATG CCGAGTCGCT TGGGGTAGAG AGCAGTATGG CCGTCCGGAT CAGTCTTACC
TCGGCGGGCA TGTGGTGGGC AATTTTTACC ATCATTCCGA TGTTGACCTT GCTTAATCGC
GGTGCAGTGC GTCGTTTACC TCCCGGTGAA CATTACCTGA CGGTTGGGTT TCGGCAACTG
GCCCACACCT TGCGTCAGAT GCGAAATTAT CCACAAACCT TACTCTTTTT AGCCGCGTAT
CTCCTCTACA ACGATGGGAT ACAGGCAGTA ATTGCCCTGG CAGCGCAGTT TGGTGCCGAA
GAGTTAGGTA TTGGCGAAAC AACGCGCATT GCTACTATTC TGATGGTGCA ATTTGTGGCG
TTTGTTGGGG CATTAGCTTT TGGTGCGTTG GCAAGCCGTT TAGGATCGAA ACGGGTCTTG
CTTGGCAGTC TCGTCATTTG GACGGTCGTG GTCGCCTATG CTTACGTGAT GCCTGCCAAT
GATGATCTGC AATTTATCGC GTTGGGAGCG GCGATTGCGT TGGTCTTGGG TGGGAGTCAA
GCAATCAGTC GGTCGCTCTT CTCGCTCATG ATTCCTGATG GGCAAGAGGC AGAGTACTTT
AGCCTGTATG AAGTTGGTGA GCGCGGCACA AGCTGGCTCG CTCCGCTGCT TTTTGGTCTG
GCCTATCAGT TCACGTCAAG TTACCGCGTA GCGATTGTGT CACTGATTAT TTTTTTCATC
GGTGGATTTA TTTTATTGTT GTTTGTGAAT GTGCGGCGGG CCGCCGAAGA GGCCGGCAAT
CAGGCCCCCG CGCACGTGTA G
 
Protein sequence
MTTLTAVTDN RREQIGWYFY DWANSAFSTT VVTVFLGPYL TAVARNAADA NGFIYPFGIP 
VAAGSFFPYM VSLSVLLQVF FLPVLGAIAD YSNARKQMLA IFAYVGAFAT MGLYFLNGDN
YLLGGMLFLI ANVSFGASVV FYNAFLPDIA SPDRRDAVSS QGWALGYLGG GLLLAANLVF
FLNAESLGVE SSMAVRISLT SAGMWWAIFT IIPMLTLLNR GAVRRLPPGE HYLTVGFRQL
AHTLRQMRNY PQTLLFLAAY LLYNDGIQAV IALAAQFGAE ELGIGETTRI ATILMVQFVA
FVGALAFGAL ASRLGSKRVL LGSLVIWTVV VAYAYVMPAN DDLQFIALGA AIALVLGGSQ
AISRSLFSLM IPDGQEAEYF SLYEVGERGT SWLAPLLFGL AYQFTSSYRV AIVSLIIFFI
GGFILLLFVN VRRAAEEAGN QAPAHV