Gene Caci_4809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4809 
Symbol 
ID8336163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5478405 
End bp5479661 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content73% 
IMG OID644957909 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003115511 
Protein GI256393947 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.942167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAGTG TCTCGATACG ATCGGCCGAG GACTCCAGTG CCTCTATCGC TTCCTCGGGA 
GGAGGCGGTG ACGGCGGGCT TGGTCGCACG CTCGTCCTCG CGCTCGGCAC GTTCGCGGTG
GGCACCGATG CCTTCGTCCT GGCCGGTTTC CTTCCTGACG TCGCAGCCTC CTTGCACACC
TCCACCGCGA GCGCGGGGCA GGCGGTGACC GTCTTCGCCG CCGCCTACGC GGTCGCCTCG
CCGGTGGTCG CGACGCTCAC CGCGCGCTTC CCGCGCCGTC TGCTGCTGGT CGCGGCGCTG
ATCGTGCTGG CCGCTGCCAA TGCGGCGTCC GCGCTCGCGC CGAACCTGCC GCTGCTGCTC
GCCGCTCGGG TTCTCGCGGC GGCGGGAGCG GCGGGGTACA CGCCGACCGC CGGGGCGGTG
ACGGCTGCGC TCGTGCGGCC CGAGATGCGC GGGCGGGCGC TGGCGGTCGT GGTCGGCGGG
TTGACGGTGG CGACGGCGCT CGGCGTGCCG CTCGGGGATG CGGCCGGCGC GGTGATGGGC
TGGCGCGCGG CGCTCGGGCT CGTCGCAGCG CTCTGTCTGC TCACTGCGAT CGCCGCGGCC
GTCCTGATGC CGACGCTGCC CGGCTCGGCG CCGGTTCCGC TCGCGGCGCG GTTGGCTGCG
CTGCGGCGGC CCGGGGTCGC GAGCGTGCTG CCGTTGACCG TTCTGGGCAT GGCGGCTGCG
TACACCGTCT ACGCCTATGC CATTCCGGCG CTGCACGCGC TGGGCATCGC CGATGGCGCG
ACGGCATGGA TCCTTGCGGC GTACGGCGCG GGGGCGATCC TCGGCAACCT GGCTGCCGGT
ATCGCTGCGG ATCGTCTCGG GCCGACGCGG GTCCTCGTGG TCGGATACGC GCTGATGGCG
ATGACGTTGG CGACTTTCGC GGTGCTCGCG GTGGCCAAGG TGCACGCTCC GGCGCTCGTC
GCGGTGCTCG CGATCACATG GGGCGCCTCT ACGTGGTGTC AGACTCCGCC GCAGCAGCAT
CGGTTGTTCA GCGCCGCGCC GAGCGAAGCC CCGCTGCTTA TGGCGCTGAA CGCCTCGGCG
ATCTATGTCG GCATCGGTAT CGGGACTGCT GCGGGCGGGC TGCTGGTCGC CTCCGGTGCC
GCGTGGATGT TCACGATCGC TGCGATCGTG GCGTGCCTCG CGCTCGGATG GCTCGCCGCG
ACCGCAACCG CAACCGGTCA CAAGGCAACC CGCCGGTGGT CTACCCTCGC CAGGTGA
 
Protein sequence
MSSVSIRSAE DSSASIASSG GGGDGGLGRT LVLALGTFAV GTDAFVLAGF LPDVAASLHT 
STASAGQAVT VFAAAYAVAS PVVATLTARF PRRLLLVAAL IVLAAANAAS ALAPNLPLLL
AARVLAAAGA AGYTPTAGAV TAALVRPEMR GRALAVVVGG LTVATALGVP LGDAAGAVMG
WRAALGLVAA LCLLTAIAAA VLMPTLPGSA PVPLAARLAA LRRPGVASVL PLTVLGMAAA
YTVYAYAIPA LHALGIADGA TAWILAAYGA GAILGNLAAG IAADRLGPTR VLVVGYALMA
MTLATFAVLA VAKVHAPALV AVLAITWGAS TWCQTPPQQH RLFSAAPSEA PLLMALNASA
IYVGIGIGTA AGGLLVASGA AWMFTIAAIV ACLALGWLAA TATATGHKAT RRWSTLAR