Gene Caci_3297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3297 
Symbol 
ID8334650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3637618 
End bp3639051 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content69% 
IMG OID644956442 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003114045 
Protein GI256392481 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.467563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTG AGGCTGCCAC TGCCGCGACC CCCGCACGGT TCGCGGGTTC GTGGAGCGAG 
CTGTTCCGAG GCGGGACCGG CTTGTTGGTG GTCGGTCTGC TGCTGATGGA GTTCGCCACG
GCGATGCAGT ACTTCGCGGT GGCTGTGGTG ATGCCGTTGG TGGCGCACGA TCTGCACGCC
GAACGTTCCT ACGGCTGGCT GCTCGGCTCC TACGGGATGG CGATGATCGC CGCCGCGCCG
CTGACACCGG CGATCACGGC GCGGTTGGGG CGGTTGCGGA CGGCGGGGGT GGCCTCGGTC
GTGTTCGTGG TCGGCGGGGT GTTGGCGGCG GTGGCCGGTT CCGCGGTGCT GTTCGTTGTC
GCGCGGCTGT TCCAGGGGTT CGGGTCTGGC GTCCTGACCA CCTTCGGAGT GAGCGCCGTC
GCGCACTCGA TTCCCGAGCG GCTGCGCAAG CGCGTGTACT CGCTGATCTC GGCGATGTGG
CTGCTGCCGG CTTTCCTCGG GCCGGTGTAC GCCTCGACGG TGTCGGACTT GCTCGGCTGG
CGCTGGACGC TGACGCTGAT TCTGCCGTTG GTGGTCGTCG GGCGCGCGCT GGTGGTGCGG
CGGGCTGGGG CGATGCGGCA GGAGGCGGCG CAGGGGGTCG AGGGGGTCGA GAAGGACAGC
GCGGTGCGGA CACCGCCGTT CATTCGGGTC CTGATACTCA TCGGCGGATG CTTGCTGCTG
TTGGGCGGGA CCAGTGTGAA GAGTGGCGTC GGGCAGATCG TCGCAGTGTG CGGGCTCCTC
GTGGTCGCGG TGGCGGCGCG GAAGCTGCTG CCGTCCGGCG AACGTGGTCC GCGGTTGGCG
GTCCTGGCGA TGCTGACGCT GTCGCTGGGT TATTCAGGGC TGGACGCGAT GGCGACGGTC
ATCGCGCGCA GCGGGTTCGG CTCGTCGATC GCGGCGGCGT CGGCGGTGCT CACGTGCGAC
GCGGTCGCCT GGTCGACGGT GGCGTTCCTG CAGCCGAAGT TCCACGAGCG GTGGAATCTG
AGCACCGGCG CGGCAGGCGT GGTCGGCGCG CTGTTGGTCG CCGTGCCGGT GGCGGGGATG
CCGGTGATGC TGGCCGCGCA CCTGTCATCG GGTACTGCGA TGCCGCTGAT GTGGGTGGCG
TTCCTCATCT CCGGATCGGG CATGGGCTTC ATCTACACGA ACCTGCCGGT GACGGCGGTG
GACGTCCGGG ACAAGTCGAC GACCGACGCC TTCGCCGCCG GGCTCGTGCT CGCCGAATCC
ATGGGAGCGA GTCTGGGCTC GATGATCGGT GGCGGTCTGT ACGCCTACGG TCTCCAGCGC
GGACTGTCGG CGTGGCACTC CCTTTCCGTG GCGGTCGGGG CGCTGAGCGT CTCGCTGTTC
GCGACTGTTT TCATCGCCGT CGCGATTCAG CGGCACCTAC GCCTCCGCGG CTGA
 
Protein sequence
MSTEAATAAT PARFAGSWSE LFRGGTGLLV VGLLLMEFAT AMQYFAVAVV MPLVAHDLHA 
ERSYGWLLGS YGMAMIAAAP LTPAITARLG RLRTAGVASV VFVVGGVLAA VAGSAVLFVV
ARLFQGFGSG VLTTFGVSAV AHSIPERLRK RVYSLISAMW LLPAFLGPVY ASTVSDLLGW
RWTLTLILPL VVVGRALVVR RAGAMRQEAA QGVEGVEKDS AVRTPPFIRV LILIGGCLLL
LGGTSVKSGV GQIVAVCGLL VVAVAARKLL PSGERGPRLA VLAMLTLSLG YSGLDAMATV
IARSGFGSSI AAASAVLTCD AVAWSTVAFL QPKFHERWNL STGAAGVVGA LLVAVPVAGM
PVMLAAHLSS GTAMPLMWVA FLISGSGMGF IYTNLPVTAV DVRDKSTTDA FAAGLVLAES
MGASLGSMIG GGLYAYGLQR GLSAWHSLSV AVGALSVSLF ATVFIAVAIQ RHLRLRG