Gene Caci_4121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4121 
Symbol 
ID8335475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4662942 
End bp4664357 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content71% 
IMG OID644957224 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003114826 
Protein GI256393262 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.878146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACA AATCTCACCG CGCCAAAGCA CTGCTGGTCG CCGGGTGTTT CTTCATGGAG 
ATGCTCGACG GCACCATCGT CACCACCGCC GCGCCGCAGA TGGCCCGGTC CCTGCACGTC
AGCTCCTCGG CCATCGGGCT GGTCATCACC TCCTTCCTGC TCACCTTCGC CGCGCTGATA
CCGCTCAGCG GCTGGCTGAC CCGCAGGTGG GGCACGCGGC CGGTGTTCCT CGCCGCGATC
GCCGTGTTCA CCGCCGCCTC CTTGGGCTGC GCGCTCAGTA CGACGCTGCC GGTGCTGATC
GCCATGCGGG TCCTGCAGGG CTTCGGCGGC GCGATGATGG TGCCGGTCGG ACGGCTGATC
GTGTTGGCCG GCGCGGAGAA ATCCGACCTG CTGCGGCTCA TGGCGTACAT CGTGTGGCCG
GGGTTGATGG CTCCGGTCGT GGCGCCGCTG GCCGGCGGGC TCATCACGAC CTACGCCTCC
TGGCCGTGGC TGTTCGGGAT CAACATCCCG CTCGGCGTGG TGGCGTTCGC CATCGCCTGG
CGCGTCGTGG AGGCGGCGCC CACCGAGCGG CCGCCGCGGC TGGACCGGCT CGGCGTCGTG
CTCACGTGCC TCGGGCTCGG CGGCCTCACG TATGCCGCGC ACCTGTTCTC CGACACCGAC
ATCTCCTGGG CCACCGCCAT CGCCACCGCT GTGGTGTCGG TCGTCCTGCT CGCCGCGGCG
ACGCGCCACC TGTTGCGGAC CGAGGCGCCG CTGGTGAATC TCAGGGTCCT GCGGATCGCG
ACGCTGCGGA CGTCGGTCAG CGGGGGTTCG GTGTTCTGGC TCGTCGTCGG CGCCGGACCG
TTCCTGCTGC CGCTGCTGTT CCAGAACGTG TTCGGCTGGA GCGCCGTGAA GTCCGGGGCG
GTGGTCCTGT TCATCTTCGT GGGCAACATC GGCATCAAGC CGGCGACCAC ACCGATGCTC
AACCGCTTCG GCTTCCGTCC GGTGCTCGTC GCCTCCACGC TGGTGATGGC GGCGGCGATG
GCTGCCGCCG GGCTGCTCAC CGCCCACACG CCGATCGTCC TCACCTGCGC GCTGATCCTG
CTCAGCGGCA TCGCCCGCTC GGTCGCGCTG ACCGCGTTCA GCACCATCGC CTACAGCGAC
GTCGGCCCGG AGGAGATGCG CGACGCCAAC TCCATCGCCG CCACCGCCTT CCAGATGTCC
GCGGGACTGG CGATCGCCGT GAGCACCATC GCCCTGCGCG CCGGCGGACC CTTGGGACGG
CTGCTGCCGG GAGCGCCGAG CGCCGGGACC GCCTACACCG TCGCGTTCCT CATCCTCGCG
CTGTTCTCGC TGAGCGTGAC GGTGACCGCG TTGCGCATGC ATCCCGACGC CGGCGCACGC
GTGCGGCGCG TGCGGCCGGT CGCGGCGCGT CCGTGA
 
Protein sequence
MIDKSHRAKA LLVAGCFFME MLDGTIVTTA APQMARSLHV SSSAIGLVIT SFLLTFAALI 
PLSGWLTRRW GTRPVFLAAI AVFTAASLGC ALSTTLPVLI AMRVLQGFGG AMMVPVGRLI
VLAGAEKSDL LRLMAYIVWP GLMAPVVAPL AGGLITTYAS WPWLFGINIP LGVVAFAIAW
RVVEAAPTER PPRLDRLGVV LTCLGLGGLT YAAHLFSDTD ISWATAIATA VVSVVLLAAA
TRHLLRTEAP LVNLRVLRIA TLRTSVSGGS VFWLVVGAGP FLLPLLFQNV FGWSAVKSGA
VVLFIFVGNI GIKPATTPML NRFGFRPVLV ASTLVMAAAM AAAGLLTAHT PIVLTCALIL
LSGIARSVAL TAFSTIAYSD VGPEEMRDAN SIAATAFQMS AGLAIAVSTI ALRAGGPLGR
LLPGAPSAGT AYTVAFLILA LFSLSVTVTA LRMHPDAGAR VRRVRPVAAR P