Gene Caci_3674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3674 
Symbol 
ID8335027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4113906 
End bp4115264 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content70% 
IMG OID644956814 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003114417 
Protein GI256392853 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.878146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.946449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGGGC CCTGGTACAG GCGTCTGCGG GAGCTGCCGC TCGCGCTGCG CGTGCTGTAC 
CTGGCCATGT TCGTCAACCG CGCGGGCATG TTCGTCTTCC CGCTGCTGGC CGTCTACCTG
GTCAAGGGCA AGCATCTGGG CGCCGACCAG GCCGGGGTGC TGTTATCGGT GGGCAGCACC
GGCCTGCTGG TCGGCAGCCT GCTGGCCGGG CCGCTGTGCC TGCGCTGGGG CTGCCGCTGG
ACCCTGGTCG GCGCGCTGCT GCTCAACGCC GCGGGCTACC TGGGGCTCGC GGTCGCCGAC
GGCGGGCCGT GGATCTACCC CGCGATCCTG TTCGCGGCGC TGGTCGGCAT GGGCGTCTTC
GGCCCGGCGA GCAACACCCT GATCGCCGAC CTCGCGCCGC CGGAGCAGCG CGCGTACGCC
TTCACCCTCA GCTACATGTT CAACAACCTG GGCATGGGTG TCGGACCGCT GCTGGGAGGC
GTGGCGGCAG CGGTGTCGTT CTCGCTGATG TTCGCCGTGA ACATCGCCGC CAGCCTCGTG
GTCGCCGGGT TGCTGCTGCT GTGGGTACCC GTCGACCGGC CCGCGTCTGC CGGTGCCGCC
GGTGTTGCCG GTGCCGGAAT CGGGACTCAG CGCCCGGAGA AGGTCGGCTA CCGCTCCCTG
GGCCACCGGC ATCTGTGGCT GCTGCTGGGC GCCTCGTTCT TCTACGTCGT GCCGCTGATC
GGCCTGGAAT ACTCCGTCCC GCTGGCCGTG ACCACCTCGC TGAACGCCTC GACGGCGTAC
ATCGGCGTCG TCTACACGAT CAACAGCGTC GTGATCGTCG GCTTCGGCCT CCAGGTCGAG
AAGTTCATCG TCCGCCACAG CACCAGGACC CTGCTGCTGG CCGCCGGCGC GCTGTGGTCG
GCGGGCCTGG TGCTGCCGGT CGTGGCGTTC TCCCTGGCGG CGCTGCTGCT GTGCACCGTC
ATCTGGACGC TCGGGGAGAT CATCGTCTCC GTCGTCGTCC CGGCCTACAT CGCCGACCAG
GCCGACGAGC GCCGGGTGCC GGGCTTCATG GCCGTCAACG GCTTCGTGCT GGGCCTGGCC
CGGCTCGTCG TGCCCGCCGG GCTCGGCGTG CTGTGGTCGG CCCGCGGCCA CGCCCCGGTC
TTCGGGGTGC TGCTGGCCGC TCCGATCTTG GGCATGGCGG CGTTCGCGCT GCTGCGCCTG
CCCCACCCAT CCCGACAACC AGAGGAAAGC GACCATGACC GCGACCAAGT TCCTGCTTCC
CGAGGAGCAG ATCCCGACCA CCTGGTACAA CGTGGTGCCG GACCTGCCGG ACGGGTTGCC
GCCGATGCTG CACCCGGGCA CCAAGGAGCC GATCACTGA
 
Protein sequence
MTGPWYRRLR ELPLALRVLY LAMFVNRAGM FVFPLLAVYL VKGKHLGADQ AGVLLSVGST 
GLLVGSLLAG PLCLRWGCRW TLVGALLLNA AGYLGLAVAD GGPWIYPAIL FAALVGMGVF
GPASNTLIAD LAPPEQRAYA FTLSYMFNNL GMGVGPLLGG VAAAVSFSLM FAVNIAASLV
VAGLLLLWVP VDRPASAGAA GVAGAGIGTQ RPEKVGYRSL GHRHLWLLLG ASFFYVVPLI
GLEYSVPLAV TTSLNASTAY IGVVYTINSV VIVGFGLQVE KFIVRHSTRT LLLAAGALWS
AGLVLPVVAF SLAALLLCTV IWTLGEIIVS VVVPAYIADQ ADERRVPGFM AVNGFVLGLA
RLVVPAGLGV LWSARGHAPV FGVLLAAPIL GMAAFALLRL PHPSRQPEES DHDRDQVPAS
RGADPDHLVQ RGAGPAGRVA ADAAPGHQGA DH