Gene Caci_8042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_8042 
Symbol 
ID8339420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp9334468 
End bp9335778 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content69% 
IMG OID644961127 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003118706 
Protein GI256397142 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACGC GGGGGGCAAA CGCGGAGGCG GACAGTATCG CGGCTCGTCA TGACACTCCG 
AAGCACACGC CCAGCTGGCT CCGCCGCGCG CTGCCGGACA CCGGACCGCA GCGCGCGCTG
GTGATGTCGA GTTTCGTGAA CCGCATCGGC ACCGGGATGT TCCTGGCGAC CTCCGCTTTG
TACTTCACGG TGATCGTGGG CATACCGGCG CGACAAGTCG GCACGGGGTT GAGCATCGCC
GGTCTGGCCG CACTGCTCGG CTCGGTTCCG GCGGGGACGC TCGCCGACCG CGTCGGTCCG
CGCACCGTCC AGCTGGTCAC GCTCGCGGTG CAGACCGTCA CGATGGCGCT GTTCGTAGTC
GTGCACTCGT GGTGGGCGTT CACCGTGGTG GCGGCTTTGG ATTACGTCGC GGACGCGGCG
AACAACGCGG CGCGCGGGGC GTTGATAGGC CGCATCGGGG GCGAGCGGCC GGGACTGTTC
CGCGCGAAGC TGCGGACGTT CGTGAGCGTC GGGGTGGTCG CCGGAACGCT GCTCGCGGCG
GTCGCGATCC AGATCGGGAC GCGCGGCGCG TATGTCACGG TGATTCTGGT GAACGCGGTG
TCGTATGTGG TCTGCGCGCT GTTGCTGCTG CGGGTCCCGA ACTTCGGGGC GTTGCCGAAG
CCTGCCGGAA CGCGGCGGTT CGCGGCGTTG GCGGACCGGC CGTATGCGGC GTTCGCGGCT
CTCAATGGTC TGATCAACCT GCAAGCGGTC GTGGTGACGC TGGTGATTCC GCTGTGGATC
GCGTCGCGGA CACAGATCCC GCATTGGGCT GCCGCTGCGG TGTTCGGGCT GAACTTCTTG
GTGGGCACGG CGCTGATGCA GCCGGTGGGT CGGCGTATAA AGACGACGGA GCAAGGCGGA
AAAGCAATGC GCGTCGCCGG GCTCGCGATC GCCGTCGGCT GCGCGGTGTT GGCTGGAAGC
AACTCGGGAC CGCGATGGTC CGAGACGCTG GTGTTGTTCG TGGGCGCAGC GGTGTTGTGC
GCCGCCGGGG TGTGGGTGAC CGCCGCCGGT TTCTCGCTGA GTTTCGAGCT GGCGCCCGCT
CACGCGCAGG GGCAATACCA AGGCGTCACG CTGCTCGGGC TTGACGCCGC GGGCGCTGTC
GGACCGGCGT TGCTGACCGC GCTGGTGCTG GGACTCGGCG CGCCGGGGTG GGTGGTGCTC
GGTCTGGGCT TCGCCGCCGC CGGGCTGATG GGACCGGCGG TGACGCGATG GGCTGAGCGG
ACTCGGCCGA CGGTTGTCAG TGTCGGCGAT GCCGCGCCGG AACCGGCTTA G
 
Protein sequence
MITRGANAEA DSIAARHDTP KHTPSWLRRA LPDTGPQRAL VMSSFVNRIG TGMFLATSAL 
YFTVIVGIPA RQVGTGLSIA GLAALLGSVP AGTLADRVGP RTVQLVTLAV QTVTMALFVV
VHSWWAFTVV AALDYVADAA NNAARGALIG RIGGERPGLF RAKLRTFVSV GVVAGTLLAA
VAIQIGTRGA YVTVILVNAV SYVVCALLLL RVPNFGALPK PAGTRRFAAL ADRPYAAFAA
LNGLINLQAV VVTLVIPLWI ASRTQIPHWA AAAVFGLNFL VGTALMQPVG RRIKTTEQGG
KAMRVAGLAI AVGCAVLAGS NSGPRWSETL VLFVGAAVLC AAGVWVTAAG FSLSFELAPA
HAQGQYQGVT LLGLDAAGAV GPALLTALVL GLGAPGWVVL GLGFAAAGLM GPAVTRWAER
TRPTVVSVGD AAPEPA