Gene Cagg_2342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2342 
Symbol 
ID7268692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2848198 
End bp2849481 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content56% 
IMG OID643567171 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002463656 
Protein GI219849223 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000352958 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00223229 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAACGCA ATTCTCCACT CTTGTTCATC TTCCTCACGA TCTTCATCGA CCTTCTCGGC 
ATCGGCATTG TGTTGCCGTT GCTGCCGGAA TATGTCAAAA TTATCGAACG CTCAAGCTGG
CCGTGGTTGG CCGATAACCG TGCTTTGGTG GTCGGCGCGC TCACTGCTTC GTATGCGTTG
ATGCAGTTTC TCTTCGCGCC TATCCTTGGT GCGTTAAGCG ATCGTTTCGG GCGCCGACCG
ATATTGTTGC TGAGTCTGTT TGGGGTCGGT CTGAGTTATC TCGTTTTTGC CGTCGCCGAA
AACCTGACGT TCCTCGGTGT CGAGACGGTT ATCGGGTTGC TGTTCCTTGC CCGTATTACG
GCCGGTATCA CCGGCGCCAG CATCAGCACA GCGCAGGCAT ACATTGCCGA TGTCACACCT
CCCAGTGAGC GCGCGCGTGG TCTGGGGATG ATCGGCGCTG CCTTTGGACT CGGTTTTATG
CTTGGTCCGG CTATCGGTGG CCTCCTTTCT AACATTAGCT TGCAGGCACC GGCGCTGTTC
GCTGCTGCAC TCAGCTTTGC TAACGTTATG TTTGGCTTCT TCCGCTTGCC CGAATCGTTG
CCACCAGAGA AGCGGATGCG GTCGGTGTCA CGCAATCTGA ATCCAGTTAC TCGTCTAACG
GCCGTCGCGC GCGATCCTCG AGTTCAACCT TTTATCTTCG GTAGTGTGTT ATTTAATCTT
GCCTTTGCCG GCCTGCAAAG CAATTTTCCG GTCTACAGCG ACGTGCGCTT CGGGTTTAGC
CCACAGCAGA ATGCGCTCGT TTTTGCCTTC ATCGGGTTGA TTGCGGTGTT GGTGCAGGGC
TTTCTTATCC GCAAATTGGT GGCACGCTTC GGCGAGGCTC GCCTGGCTTT GGCCGGTCTG
ACTCTGATGG CTCTTGGCTT TGCTGCGACC GGTCTCGCGC CTGCGAGTTG GATGCTCTTC
CCGGCAATCG GGATCGTGGC GCTGGGTAGT GGTATGCTTA CTCCATCGCT GACCAGCCTG
ATTTCGCAGT CGGTGTCGGC TACCGAGCAA GGCGCGATCC TCGGTGGAGT GCAGTCGTTT
AATAGCCTCA CGATGGTGCT AGGGCCGCTG TTGGCCGGTA CCCTGTTTGA CCTGATTGCA
TCAAATGCGC CATACCTGTT TGGGGCGGTC TTGCTCACCG GTGCGCTTAC CGTTCTGCTC
TCTACCCTGC GTCGGCGCTT TGTTACGATA CTGCAGCCCG ATACCGCAGT GGTTACCATT
GATACACCGG TTCGCGTTGA GTAG
 
Protein sequence
MKRNSPLLFI FLTIFIDLLG IGIVLPLLPE YVKIIERSSW PWLADNRALV VGALTASYAL 
MQFLFAPILG ALSDRFGRRP ILLLSLFGVG LSYLVFAVAE NLTFLGVETV IGLLFLARIT
AGITGASIST AQAYIADVTP PSERARGLGM IGAAFGLGFM LGPAIGGLLS NISLQAPALF
AAALSFANVM FGFFRLPESL PPEKRMRSVS RNLNPVTRLT AVARDPRVQP FIFGSVLFNL
AFAGLQSNFP VYSDVRFGFS PQQNALVFAF IGLIAVLVQG FLIRKLVARF GEARLALAGL
TLMALGFAAT GLAPASWMLF PAIGIVALGS GMLTPSLTSL ISQSVSATEQ GAILGGVQSF
NSLTMVLGPL LAGTLFDLIA SNAPYLFGAV LLTGALTVLL STLRRRFVTI LQPDTAVVTI
DTPVRVE