Gene Cagg_1320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1320 
Symbol 
ID7268611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1629758 
End bp1630951 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content60% 
IMG OID643566162 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002462663 
Protein GI219848230 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.296335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000456317 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTACGGA ATCGGGTCTT TACAGCAGTG GTCAGCGGTC ACTTTATGGT CGATGTTCTC 
AATAGTGTTG GTGCCGTCCT ATTAGCCGTG CTAGCCGGGC CATTGGGTCT GAGCAATCAG
CAAATTGGCT TAGCGTTAAC CGTCTACACC CTGCTCGGCG CACTCTCACA ACCATTATTT
GGCTGGATCG CCGACCGCCT GCCCGGACGC ACGCTCCAAC TCGCCGCACT TGGTGTAGCG
TGGATGGCGC TCTGCTATAC CGGTGTTGCC CTTGCTCCCA ACTGGACAGT GCTCTTACCC
TGTTTTCTCC TTGCTGCGCT TGGTAGTGGC CTGTTCCATC CCATCGGTAC GGCCGGCGCG
GCGGCGGCTG TGCCCGACAA GCCGGCGAGT GCCACCGCGA TCTTCTTCTT TGGTGGGCAA
GCCGGTTTAG CGGTAGGCCC AGTGTTGGCC GGTTTTGTGC TGACTGCTGC CGGGACGCCG
GGGATTATCC CGATTGCCGT GACTGCGATT ATTCCGGCGA CCATCTTACT GCTCGCGCAA
CAAACAGTAC GTAGTCGACG GACGAGTCGT GCTCCTGCCC CTGCGACGCC ACAGGTCTGG
ACGGCACTAG CCATTGCGGT CTTGATCGCC TTTCTTGCAC TCGTCGTCGT GCGTTCGTCA
ATCCAAGCGG TGTACCAGTC GTTCTTACCG AAACTGTTCA GTGATCGTGG CTGGGAACCA
ACCGTCTATG GCGCATTGGC CGGCACATTT ATGGCCAGCG CTGCGGTTGG CAACGTCCTG
AACGGATGGA TGGCCGATCG CTTTGGCATG CGGGCTGCGA CCGTGTGGCC GCTCATTCTG
AGTGTTCCCG CTGGGTTAGC ATGCTTTTTA GTAACACCGC TATGGGTCAT CTTCATCGCC
TGTGGGCTGG CCGGGATGCT GGTCGGTGGG CAACACAGCA TTCTGGTCGT CCATGCCCAG
CGCATCTTAC CGGTCAAGCA GGGTCTCGCT TCCGGCCTCA TTCTCGGCTT CACGTTCGCC
ACCGGCGCAT TTGGGACGTG GCTGAGCGGT CTATTGGCAG ACATTGTTGG GCTTGAGACG
GTGATGATCG GCGTGACCCT GCTTGGGTTG CCGGCGGCGG CATTAGCGCT GACCCTGCCC
GGTCGTGTGC ATCCGGCGAT GGTACCGGCG GCAGTGCCGG CACAGGGTGA TTAA
 
Protein sequence
MLRNRVFTAV VSGHFMVDVL NSVGAVLLAV LAGPLGLSNQ QIGLALTVYT LLGALSQPLF 
GWIADRLPGR TLQLAALGVA WMALCYTGVA LAPNWTVLLP CFLLAALGSG LFHPIGTAGA
AAAVPDKPAS ATAIFFFGGQ AGLAVGPVLA GFVLTAAGTP GIIPIAVTAI IPATILLLAQ
QTVRSRRTSR APAPATPQVW TALAIAVLIA FLALVVVRSS IQAVYQSFLP KLFSDRGWEP
TVYGALAGTF MASAAVGNVL NGWMADRFGM RAATVWPLIL SVPAGLACFL VTPLWVIFIA
CGLAGMLVGG QHSILVVHAQ RILPVKQGLA SGLILGFTFA TGAFGTWLSG LLADIVGLET
VMIGVTLLGL PAAALALTLP GRVHPAMVPA AVPAQGD