Gene Caci_6497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6497 
Symbol 
ID8337861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7486821 
End bp7488713 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content72% 
IMG OID644959595 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003117188 
Protein GI256395624 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTGGA ACAACCCGGC CACGGCCGGT CCCCCCTACC GCTGGCGCTG GGCGGCCTTC 
GCCGTCGTCC TGCTGGCCTC GGTGATGGAC CTGCTCGACT CGCTGGTCAC CAACATCGCC
GGTCCCACCA TCCGGGCCGA CATCGGCGGC GGGCCGGCGC TGATCCAGTG GCTCGGTGCC
GGCTACACGC TGGCCATGGC CGCCGGCCTG ATCACCGGCG GCCGGCTCGG CGACATCTAC
GGCCGCAAAC CGGTCTTCAT CGTCGGCGTC ATCGGCTTCA CCGTGGCCTC GGTGCTGTGC
GCGAGCTCCT TCTCCCCGGG CATGCTGATC GGCTGCCGGG TGGCGCAGGG CCTGTTCGGC
GCGGTGATGC TGCCCCAGGG CCTGGGCGTG ATCAAGGCGG TTTTCCCGCC GCAGGAGCTG
GCCAAGGCCT TCGGCGCCTA CGGGCCGGCG ATGGGCTTCT CCACCGTGCT CGGCCCGGTG
ATCGCCGGCA CCCTGATCGA CGCCAACCTG TTCGGCAGCA GCTGGCGCAT GATCTTCCTG
ATCAACCTGC CGCTGGGCCT GGTGGCGCTC GTCGGCGCGA TCAAGTACCT GCCGTCGGAC
CGCGACGAGC GCGCCCCGAT CAAACTGGAC CTGCTCGGCA CCGTGCTGGC CTCGAGCGCG
GCGCTGCTCG TGGTCTACCC GGTGGTGCAG GGCCGCTCGC TGGGCTGGCC GGCCTGGACC
TTCGTGATGA TCGCCGCCTC GGTGGTGGTC TTCGGGATCT TCGGCTGGGT GGAGACCCGC
ATCCACAACC GCGGCGGCGA CCCGCTGGTG GTGCCCACGC TGTTCCGCAA ACGGGCCTTC
ACCGGCGGCC TGTTCACCGG CCTGGCCTTC TTCACCGCGA TCGTCGGCTT CTCGCTGGTG
TTCACCGTCT TCGTGCAGAT CGGTCTGGGC TACTCCCCGC TGAAGGCCGG TCTGACCACT
TTGCCGCAGG CCGTCGGCAG CGTCGTCGGG TTCATCGCGG CCGGCGCGGG CCTGGCCGCC
AAGCTCGGGC GGCGCATGCT GCACCTGGGC CTGGTGAGCA TGACCGCCGG GGTGATCGGC
ACGTTCCTGA CGATCCACTA CGCCGGGACC GGCCTGACCC CCTACGACCT GATCCCCTCG
CTGCTGTTCA CCGGCATCGG CCTGGGCATG TTCCTGGCGC CGTTCTTCGA CATCGTGCTG
GCCGGGGTGG AGGCCGGCGA GTACGGCTCG GCCTCCGGCA CGCTGAACGC GGTGCAGCAG
TTCGCCGGGG CGCTCGGCAT CGCGGTGGTC GGCACGGTGT TCTTCGGGGT GCTCGGCGGC
CACGTCGCCT CAGCCGTGGA CAGCCACGCG CCGGCCCTGC GCACGCAGCT GAGCGTGGCC
GGGGTCAGCT CCGCGGACCA GGACACGATC CTGGCGAACC TGCACACCTG CGAGCAGGAC
CGGGCCACGG CCAGCGACCC GGCCGCCGTC CCGGCCTCCT GCGCCGTCCT GAACACCGCG
GTCGGCAAGG CCTCCGCCGA GCACGGCCCG GAGGTCGGCC AGGCCGTCGG CACCACCGCG
GTCAAGGCCG CCAAGGCGGG CTTCAGCTCG GCGGTGCAGG AGACGATCTG GACGGTCGTC
GGGCTGCTCG CCGCGGCGTT CGTCCTCGGA TTCGCCCTGC CGATGAAGGC CGCGCAGCAG
GGTGACTGGG CCAACCAGGA CTGGGGCGGC CAAGACGACG GCGACGGCGG CGCGGGCTGG
GGCGGCCAGG GCGGCGGGGA CGGCGCGGAC GGCGAGAACG CCGGAGCCGG AGCTGGAACC
GGCACTGGAA CCGGCGCCGA CGGCAAAGCC GTCCCCGGCC AGTCGGCCAA CGAATGGCAG
CAGTATTCGG GCACGCAGAA CACGGCGGAC TGA
 
Protein sequence
MTWNNPATAG PPYRWRWAAF AVVLLASVMD LLDSLVTNIA GPTIRADIGG GPALIQWLGA 
GYTLAMAAGL ITGGRLGDIY GRKPVFIVGV IGFTVASVLC ASSFSPGMLI GCRVAQGLFG
AVMLPQGLGV IKAVFPPQEL AKAFGAYGPA MGFSTVLGPV IAGTLIDANL FGSSWRMIFL
INLPLGLVAL VGAIKYLPSD RDERAPIKLD LLGTVLASSA ALLVVYPVVQ GRSLGWPAWT
FVMIAASVVV FGIFGWVETR IHNRGGDPLV VPTLFRKRAF TGGLFTGLAF FTAIVGFSLV
FTVFVQIGLG YSPLKAGLTT LPQAVGSVVG FIAAGAGLAA KLGRRMLHLG LVSMTAGVIG
TFLTIHYAGT GLTPYDLIPS LLFTGIGLGM FLAPFFDIVL AGVEAGEYGS ASGTLNAVQQ
FAGALGIAVV GTVFFGVLGG HVASAVDSHA PALRTQLSVA GVSSADQDTI LANLHTCEQD
RATASDPAAV PASCAVLNTA VGKASAEHGP EVGQAVGTTA VKAAKAGFSS AVQETIWTVV
GLLAAAFVLG FALPMKAAQQ GDWANQDWGG QDDGDGGAGW GGQGGGDGAD GENAGAGAGT
GTGTGADGKA VPGQSANEWQ QYSGTQNTAD