Gene Caci_6215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6215 
Symbol 
ID8337578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7155076 
End bp7156290 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content67% 
IMG OID644959316 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003116910 
Protein GI256395346 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0532982 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGCA CACTGCGCAT CCGCGACTAC CGCCTGCTCC TGACCGGCCA GTTGTTGTCC 
AGCATCGGAA ACTGGCTCCT TCTGGTCGCA GCACCCTTTT TCGTCTTCCG CCTGACCGGA
TCCACGATGG CGACCGGATT GTCCATGGCG GCGGAGACCG TCCCGGCCGT GTTGCTGGGA
CCGGTCGCAG GAGTCTTCGT CGACCGCTGG GACCGGCGCT GGACGATGAT CGCGACCGAC
GTGCTGCGCG CTGGAGCGGT ACTTCTGCTG CTGCTCGTCC ACAGTCGGGA TCAGGTATGG
ATCGTGTACG CCGCGCTCGC GCTGGAGTCC GCGTTCGGAC AGTACTTCGG GCCCGCGCAG
GGTGCGCTCA TTCCGAATCT CGTGGGGCGC GGCCCGGCAC TCAGTGCGGC GAATTCTTTG
GGGCAGTTGG TCGGCGGGAC GATCCGCCTT GTCGGCGGAC CGCTGGGTGG CGTGCTGTTC
GCGGTTGCCG GGTTCCGTGC TGTGGTGGCG GTCGACGCTG CCAGCTATGT CGCGTCGGCG
TGCCTCATAG GCCTGATTCG GTTTCGGGCG GTACGGGATG CGGTTGATGT CGGACGCGCG
CCTGATGCCA CCATTGGCGT CGTCCGATTG CTGGCCGAAG ACCTGCGCGT CGGATTCGGC
CACTTGCGAC ACACACCGGC GGTCCCGACA CTGTTCGGCG TCGCAGTGGT CTTCTTCACC
GGCAACGCCG TGCTGACCGC ACTGCTCGTC CCCTACCTCA GCACCGTCCT GGACAGCGGC
GCGCAAAGCC TCGGCATCCT GTTCGGGGCA CTCGGCATCG GCTTCGTCCT CGGCGCCCCC
GCCAGCCGCC TGGTCGCAGG ACGACTGTCG GACCGGACCA CGATCGCGGC CAGCCTGGGT
CTGCTCGCCG CAGTCTTCGC AATAACGTTC AACACGCCAC ACCTGGCCGG GGACATCGCG
CTGTTCACCC TCATCGGCCC ACCGGCGGTC TGCTTCCTGG TCACCGCCGA CACCTCGATC
ACCCGCCGGA CGCCCGACCG CCTCCAGGGA CGCGTCAGTT CCGTCTATCT GGCCGCGCAA
GGACTGGCCA CCTTGGTCGG CATGATCGCC GGTTCACTGC TCGGCCAACG GCTCGGCGTC
GTCGCCACGA TGGACGGCGC CGCTGTATTG ATCGCAGTGT CGGCGGGAGG GGCTCTGCTC
TTGCCAGGCT CCTGA
 
Protein sequence
MLGTLRIRDY RLLLTGQLLS SIGNWLLLVA APFFVFRLTG STMATGLSMA AETVPAVLLG 
PVAGVFVDRW DRRWTMIATD VLRAGAVLLL LLVHSRDQVW IVYAALALES AFGQYFGPAQ
GALIPNLVGR GPALSAANSL GQLVGGTIRL VGGPLGGVLF AVAGFRAVVA VDAASYVASA
CLIGLIRFRA VRDAVDVGRA PDATIGVVRL LAEDLRVGFG HLRHTPAVPT LFGVAVVFFT
GNAVLTALLV PYLSTVLDSG AQSLGILFGA LGIGFVLGAP ASRLVAGRLS DRTTIAASLG
LLAAVFAITF NTPHLAGDIA LFTLIGPPAV CFLVTADTSI TRRTPDRLQG RVSSVYLAAQ
GLATLVGMIA GSLLGQRLGV VATMDGAAVL IAVSAGGALL LPGS