Gene Caci_3500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3500 
Symbol 
ID8334853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3903292 
End bp3904833 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content71% 
IMG OID644956644 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003114247 
Protein GI256392683 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.021178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCT CCGCCGCCGT CGCGGATCCG GCCTCCGCCT TCAGCGAACG CGCCGCCCGG 
CGCGCCGTCT GGCTGGTGCT GAGCGCGACG TTCGTCGTCT CCGCGGACAT CTCGATCGTC
GCCGTCGCCG CCCCGCCGAT CCAGCGCGGC CTGCACGCGA GCTCCGGGGA TATCGAGCTG
ACCGTCGCCG CCTACCAGAT CGCCTACGCC GCGCTGCTGA TCACCGGCGG CCGGCTCGGC
GACATCTTCG GCCGGCGCGC GCTGTTCACC TGGGCGTTCG CCGGCTTCGT CCTCACCTCC
GCAGCCTGCG GCCTGGCGAC CTCCCCGGGC CAGTTGGTGG CGTTCCGCGC GCTGCAAGGC
GTGACCGCCG CGATGCTCTC GCCCCAGGTG ATGGCGACCA TCCAGATCAT GCTGCCGCCC
GAGAAGCGCG CCGCGGCGTT CGGCGCGCAG GGCGCGATGC TCAGCCTCGC CACAGTCATC
GGGCCGGTGT TCGCCGGACT GCTGTACTCC GGGAACATCA TGGGCCTGTC ATGGCGGCCG
ATCTTCCTGG TGAACGTGCC CTTCGGGCTG GCGGCGATCT GGCTGGGCCG GCGCTACCTG
CCCTCGCTGC GCAATCCCGA GGCCAAAAGC CTCGACCTAC CCGGTACGTG TCTGGTCGTC
CTCGCGTTGG TCGCGCTCAT GACGCCGCTG TCACTGGGCG AGCAGTACGG ATGGCCGCTG
TGGTGCTGGC TGAGCCTGGC TGCCTCGCCG GTGCTGATCC TGGCGTTCCT GAAGCTGCAG
CAGGCTGAGG AGCGGCGCGG CGGGTCTCCG CTGCTGCCGA CCGACCTGTG GCGAGACCGG
GCGTTCCGTA CCGGCGTCGT GCTCTTCCTG CTGGCGTTCA GCGGGGTCGT GTCCTTCTTC
CTGTACTACT TCACCCTGAT CCAGACCGCG TACAACGTCT CCACGCTGTG GGCCGCGGTG
ACCACGATCC CGGTCGGGAT CGGCACGATC GCGCTGTCGG CTGCCTCGGG GCGGCTGGTC
CGCGCCTGGG GCGGGCGCCG GGTCGCCTCG GTCGGGGCGA TCGTGTGCTG CTTCGGCGCG
CTGTCGATGT TCATCCCGGT GGTCGCGGTC ACGGACTCCT CGCTGGCGCT GTGGTCCATC
CCGTCGCAGC TGGTGCTCGG CTCCGGAATC GGGATGCTGT TCGCTCCGCT GCTGTCGGTG
GTCCTCGCTG GAATCCGCAG CACGCACGCC GGCGCCGCCG CCGGACTGCT GGTGACGATG
CAGATCGCCG GCGGTGCGCT GGGGGTCAGC GCCATGGGAG TGCTCTTCAA CTCGCGGCTG
CCCGGAGGCT CCACGGACCA CGCGTCCCAC GGACAGCTCT CCTCGGCGAT GGTCCACGCC
ATGCTCTACA ACCCGGTCTC GTTCCTGGCG GCGCTGCTGG TGATCCTCGT TCTGCCGAGG
ACGGTGCGCA GTGCCGGAAG GGCTGCCGGG CCCAAGGGGA CGCCGGCCGG CGCGGCGGGT
GCTGCGGGAG CGCCGGGAAC TCCGGGAGCT GCTCATGCCT GA
 
Protein sequence
MTTSAAVADP ASAFSERAAR RAVWLVLSAT FVVSADISIV AVAAPPIQRG LHASSGDIEL 
TVAAYQIAYA ALLITGGRLG DIFGRRALFT WAFAGFVLTS AACGLATSPG QLVAFRALQG
VTAAMLSPQV MATIQIMLPP EKRAAAFGAQ GAMLSLATVI GPVFAGLLYS GNIMGLSWRP
IFLVNVPFGL AAIWLGRRYL PSLRNPEAKS LDLPGTCLVV LALVALMTPL SLGEQYGWPL
WCWLSLAASP VLILAFLKLQ QAEERRGGSP LLPTDLWRDR AFRTGVVLFL LAFSGVVSFF
LYYFTLIQTA YNVSTLWAAV TTIPVGIGTI ALSAASGRLV RAWGGRRVAS VGAIVCCFGA
LSMFIPVVAV TDSSLALWSI PSQLVLGSGI GMLFAPLLSV VLAGIRSTHA GAAAGLLVTM
QIAGGALGVS AMGVLFNSRL PGGSTDHASH GQLSSAMVHA MLYNPVSFLA ALLVILVLPR
TVRSAGRAAG PKGTPAGAAG AAGAPGTPGA AHA