Gene Caci_5903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5903 
Symbol 
ID8337265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6810798 
End bp6812261 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content71% 
IMG OID644959007 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003116602 
Protein GI256395038 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.811375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACC AGGCGCGGCG ATCGCGGGGA GTACTGCCCG CGCTCGTGTT GTCCGTACTG 
GCTTTCTCCC TCGTCCAGAC CTCCGTGGTC CCGATCCTGC CGACGCTGCA GAAGAACTTG
CACGTCGCCG GCAGCGGCAT CACCTGGCTG ATGACCGCGA ACCTGCTGTC GGCGGCGGTG
CTCACGCCGC TGCTGGCGCG CATCGGCGAC CTGCGCGGGC GCAAGCCGGT GCTGGTGGTG
GCGATCGCCG GTGTGCTGGT TGGCGGCGTC CTCGGCGGGA TCGGCGGCTC GTTCGGGCTG
CTGCTGATAG CCCGGGTCGT GGCCGGGACC GGGGGTGCGA TCCTGCCGCT GGCGGTGGCC
GTAGTGCGCG ATGAGCTGCC GCGCAAGAAG GTCACCGGCG GTGTGGCGAT GGTCTCCGCG
GCCCTCGGCG TCGGCTCGGG GCTGGGCCTG GTCGCCACCG GCGTGGTGAT GGAGCACTTC
AGCTACGAGT CGGTGTTCTG GATGGGCGCC GTGCTGGCCG CCGTCGCTCT CGCATTGGTG
GTCTGGCTGG TCCCGCACGA CCCGATCAAG GCCGAGGGCA AGGCCGACCC GCTGGGCGCG
CTGCTGCTCG CCGGTTGGCT CTCGGCGCTG CTGATCGCGG TCAGCCAGGG CAACGACTGG
GGCTGGGGCT CCGCCCGCAC GCTCGGACTG TTCGTGACGG CCGTGGTGGT CCTGGTGCTG
TGGGTCGTGG TGGAACGCCG CGTGGCCTCC CCGCTGGTGG ACATCGCGAT GCTCGCCAAG
CCCGCGGTCG CGGTCACCAA CACCGCCGGG GTCCTGGTCG GCTTCGCGAT GTATGGCTCT
TTCCTGTTGA TGAGCGACTT CACGCAGACC CCGAAGGCGG TCGGCTACGG TTTCGGCGCC
TCGGTTTTGG CCTCGGGCTG GATGCTGTTC CCCTCGGCGG TCGGCTCCTT CGCCGCAGCC
CCGGTCGGCG CGGCCCTGAT CAAGCGCGGC GGTCCGCGCC TGCCCCTGGT GCTCGGCGGC
GCGTTCGCCG CGGCGGGCCT GGGCCTGCTG GTCTTCGCGC ACAGCTCCAG CTGGCACGTC
GTGGTCGCCT CCGGCGTCAT GGGCGTCGGC GTGGGCATGG CGTACGCGGC GATGCCGGCG
TACATCAACG CCTCGGTCCC GGTGCAGCAG TCGGGCATCG CCAACGGCAT GAACGCGGTG
CTGCGGACCG TCGGCGGCGC CGTCGGCACG GCGGTCATCG GCGCGGTGCT GACCGGCAAC
ATGAAGCAGG TCGCCCCCGG CGTCCAGTTG CCGACCATCG ACGCCTACTC GCACGCCTTC
CTGATCGCCT CGGCGCTGGC ACTGGTCGCC GCGGTGGTGC CGTTCCTGGT CAAGGCGCCG
CAGATGACAG CGATGACGAC GCCGGACACC ATCGACGCCG GAGTCGACAG CGAGCCGAAG
GCGATGGCTG CTGCGAACGT TTGA
 
Protein sequence
MSDQARRSRG VLPALVLSVL AFSLVQTSVV PILPTLQKNL HVAGSGITWL MTANLLSAAV 
LTPLLARIGD LRGRKPVLVV AIAGVLVGGV LGGIGGSFGL LLIARVVAGT GGAILPLAVA
VVRDELPRKK VTGGVAMVSA ALGVGSGLGL VATGVVMEHF SYESVFWMGA VLAAVALALV
VWLVPHDPIK AEGKADPLGA LLLAGWLSAL LIAVSQGNDW GWGSARTLGL FVTAVVVLVL
WVVVERRVAS PLVDIAMLAK PAVAVTNTAG VLVGFAMYGS FLLMSDFTQT PKAVGYGFGA
SVLASGWMLF PSAVGSFAAA PVGAALIKRG GPRLPLVLGG AFAAAGLGLL VFAHSSSWHV
VVASGVMGVG VGMAYAAMPA YINASVPVQQ SGIANGMNAV LRTVGGAVGT AVIGAVLTGN
MKQVAPGVQL PTIDAYSHAF LIASALALVA AVVPFLVKAP QMTAMTTPDT IDAGVDSEPK
AMAAANV