Gene Caci_3638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3638 
Symbol 
ID8334991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4071196 
End bp4072425 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content71% 
IMG OID644956779 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003114382 
Protein GI256392818 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0205021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00774423 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCGACGG AGTCGGCCAC CGCCGAGGGC GTCAGCACCC TGATCGGGGC GGACCGTCTG 
GCCCGCGGTC GCCGTGCGCT GGCTGCGGCG ACCTTGGCGA ACGCCGTTGG GAACGGCATG
TACCTGGCCG TGATGGCCGT CTACTTCACC GGCTCGGCGG GGTTTTCCGC CGCTCGCGTC
GGGCTCGGGC TGACGGCCGC CGGGCTGGTG GGTTTGGCGG CGGGAGTCCC GGTCGGCCGC
TGGGCCGACC GCAACGGTCC GCGCGAGGTG TACCTGGGGC TCGGCCTGGT GTCGGCGGCG
ACGATGGCGG CGTACGCGGG ACTGCGCTCG TTCTGGCTGT TCTGCGCGGT GGCGGTGGTC
GACAACCTCG CGGCGTCGGG CACGCGGGCT GCGCGCGGGG CGTTGATCGC CCGGTTGGCC
GGTCGCGATC CGCACCTGTA CCGGGCTCGT CTCCGGTCGG TCGCGAACCT CGGGCTGGCG
GTCGGAGCCC TGATCGGCGC GGCGGGCCTG GCGATCGACA CGCGGGCCGG GTATACGGCG
ATCCTGCTTC TCAATTCCCT GACGTTCCTG ATGAACACGT TGCTGTGCCT GAGGATCCCG
CCGCTGCGGT CGATACCCGC ACCGCCCGGG CGCACCGCCT GGCCGGTGGT GCGGGACCTG
CGCTTCCTGG CGTTCTCCTG TCTCGCGGCG GTGTTGGCCA TCCACGACGA GGTGCTGTTG
TTCGCGGTGC CGTTGTGGAT CGCACGGGTC GGCCACGCGC CGCGCTGGAT CGTGGCTGTC
CTGTTGTTCG TCAATACTCT GATGGTTGCC TCGCTCCAGG TGCGCATCGG GCGCGCGGTG
GACACGATCG CGGGAGGCGT CCGGGCCTCC GTTCGAGCCG GCTGGATCCT TGCGGCGGCC
GCGCTGCTGT TCGGCGTCAT CGGAACCGTG CCCGGCTGGA CCGCGATCGC CATCCTGCTC
CTCGCCGCGG CCGCTCACTC CGTCGGCGAG ATCGTGCAGC AGGCAGGGTA TTCAGAACTC
TCATTCGGTC TGGCCCCGGA TCACGCGCAG GGGCAATACC AGGGCATGTC GGCGACGTTC
AGCGGCGCGG CGATCGCGCT GGCTCCCGGA TTACTGGCTT GGCTGTGCCT CGGCGTCGGT
ACGAGAGGCT GGCTGGTGCT CGCGGGCGCG TTCGCGCTAG CCGGAGCGCT GACCCCGATC
GCGGTCGGTG CTCCCGACGC AGCAGGCTGA
 
Protein sequence
MATESATAEG VSTLIGADRL ARGRRALAAA TLANAVGNGM YLAVMAVYFT GSAGFSAARV 
GLGLTAAGLV GLAAGVPVGR WADRNGPREV YLGLGLVSAA TMAAYAGLRS FWLFCAVAVV
DNLAASGTRA ARGALIARLA GRDPHLYRAR LRSVANLGLA VGALIGAAGL AIDTRAGYTA
ILLLNSLTFL MNTLLCLRIP PLRSIPAPPG RTAWPVVRDL RFLAFSCLAA VLAIHDEVLL
FAVPLWIARV GHAPRWIVAV LLFVNTLMVA SLQVRIGRAV DTIAGGVRAS VRAGWILAAA
ALLFGVIGTV PGWTAIAILL LAAAAHSVGE IVQQAGYSEL SFGLAPDHAQ GQYQGMSATF
SGAAIALAPG LLAWLCLGVG TRGWLVLAGA FALAGALTPI AVGAPDAAG