Gene Caci_2549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2549 
Symbol 
ID8333898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2883013 
End bp2884500 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content66% 
IMG OID644955702 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003113308 
Protein GI256391744 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCG TCACGGACGA GCCCAAGGGA GCCGTCCACC GCAGCCTGGT GCCCGCGCGC 
ATGGACCGGC TGCCGTGGGC GAGGTTCCAC TGGCTGGTGG TCGTCGGGCT CGGGGTGTCG
TGGATCCTGG ACGGTCTGGA GATCCAGATC GTCTCGCAGG CCGGCTACCA GGACTCCCTC
GGATTGACCA CCGCCCAAGT CGGCGCGGTC GGGTCGGTGT ACCTAGCCGG CGAGGTGGCC
GGGGCGCTGG TCTTCGGCCG CATCACCGAC CGGCTCGGCC GGCGCAAGCT CTTCATGGTC
ACGCTCGGCA TCTACCTGGT CGCCAGCGGT CTGGCGGGCT TCTCCTGGGA TCTGTGGTCG
CTGCTGGTCC TGCGGTTCAT CGCCGGGACC GGAATCGGCG GGGAGTACAC CGCGATCAAC
TCCGCGATCG ACGAGCTGAT CCCGTCGCAC TACCGGGGAC GCGTCGACAT CGCCGTCAAC
GGCACGTATT GGGGAGGCGC GGCGATCGGT GCCGCGGCGA ACCTGTATCT GCTCTCTGAC
CAGGTCCCGC AGAACATCGG GTGGCGGATC GGGTTCCTGA TCGGTCCCAC GATCGGCGTC
GCGATCATCG TTCTGCGGCG TCATATCCCC GAAAGTCCGC GCTGGCTGAT GACCCACGGC
CGTCAAGCCG AGGCTGAGCA GGTCGTCGAC GACATCGAGG ACCGGGTGCG AGCCGACGGC
GCGGAGCTGG AGGACGTGCC GGACAGCAAG GCGATCGAGA TCGTCCCCGA GAAGAGCATC
ACCTACCGGC AGATGGCACG GGTGTTCTTC GGGCAGTATC CGCGGCGGTC GATCCTGGGC
TTCTCGATGA TGGTGACCCA GGCCTTCCTC TACAACGCGA TCTTCTTCAC CTACGCCTTG
GTGCTCGAGC ACTTCTATGG CGTCTCCAAG GCGCACACGA GCTACTACTT CTTCCCGTTC
GCGCTGGGCA ACCTGGCCGG GCCGCTGCTG ATGGGGCATC TGTTCGACAC CATCGGACGG
CGCAAGATGA TCCTGCTGAC GTACGGCCTT TCCGGGCTCC TGCTGCTGGT GTCTGCCTTC
TTCTTCCACG CCGGCGTGCT GAACGCCACC ACGCAGACGG CGTTTTGGTG CGTGACCTTC
TTCTTCGCCT CAGCTGGCGC GTCCTCGGCC TACCTGACGG TGAGCGAGAT CTTCCCGCTG
GAGCTGCGGG CGCAGGCGAT CTCCTTCTTC TTCGCGATCT CCCAAGGCGC GGGCGGCGTT
GTCGCGCCGT TCCTGTTCGG TCACCTGATC GGCGGTCAGA ACAACCCGCA TCCGGACCGG
ACGCCGTTGT TCTGGGGCTA CGTCATCGGC GCGATCGTGA TGATGATCGG CGGGGCGGTC
GGCTGGTTCC TTGGAGTGAA TGCCGAGCGC CAGTCGCTGG AGGACGTCGC CCGGCCGATC
TCGGCTCGCG ACAATGGCGG CGGCGCCGTG TCCGCGGCTA CCACCTAG
 
Protein sequence
MSVVTDEPKG AVHRSLVPAR MDRLPWARFH WLVVVGLGVS WILDGLEIQI VSQAGYQDSL 
GLTTAQVGAV GSVYLAGEVA GALVFGRITD RLGRRKLFMV TLGIYLVASG LAGFSWDLWS
LLVLRFIAGT GIGGEYTAIN SAIDELIPSH YRGRVDIAVN GTYWGGAAIG AAANLYLLSD
QVPQNIGWRI GFLIGPTIGV AIIVLRRHIP ESPRWLMTHG RQAEAEQVVD DIEDRVRADG
AELEDVPDSK AIEIVPEKSI TYRQMARVFF GQYPRRSILG FSMMVTQAFL YNAIFFTYAL
VLEHFYGVSK AHTSYYFFPF ALGNLAGPLL MGHLFDTIGR RKMILLTYGL SGLLLLVSAF
FFHAGVLNAT TQTAFWCVTF FFASAGASSA YLTVSEIFPL ELRAQAISFF FAISQGAGGV
VAPFLFGHLI GGQNNPHPDR TPLFWGYVIG AIVMMIGGAV GWFLGVNAER QSLEDVARPI
SARDNGGGAV SAATT