Gene Caci_8266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_8266 
Symbol 
ID8339645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp9577117 
End bp9578532 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content72% 
IMG OID644961352 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003118930 
Protein GI256397366 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.193695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.361913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCA CGACCATCCG CTCCGAGGCA GCCACACCGG CCTCAGACGC CCTGGGAATC 
GCAGCGCTCC CGGACCAGCC GCGCCCGGCG CAGCCGTCGG CCTCACGCCC GAGCCGCCAG
AGCCGCCCGA GCCGCTCGCG CGCCCCGCTG TCGGTGTTCG CCGGCTTCCC CCGCGCGATC
TGGGTGGTCT TCGCCGGCAC CGTCGTGAAC CGCGTCGGCT TCCTGGTCGG ACCGTTCCTC
GTGTTCTTCC TCGGCTCCCG CGGCATCCCC TCGTCACAGA CCCCTTACGT CCTCGGCGCA
CTCGGCGCCG GCAACCTCGT CGGCCCGGCC GTCGGCGGCT GGCTCGCCGA CCGCCACAGC
CGCAAGCTGA CCATGCTCGC CGGCCTGCTC GGCACCGCCG CCGCCCAAGG CGCGCTCTTC
GCCGCCCCGA ACGTCGCGAC CATGGCCCTG GCCGCGATAG CACTGAGCGC CACGGCGACC
ATGGTGTCGC CGGCGGCATC GGCGATCCTC ACCGACGACG TCGGACCCGC CCGCCGCCGC
GAAGCCTTCG CCCTGATCGG CTGGGCGGTG AACATCGGCA CGGCCGTCGC CGGAGTCCTC
GGCGGCTACC TCGCGGCCCA CGGCTACTGG ATGCTGTTCG CGATCGACGC CGGAACCTCG
CTGGGATACG CGGTGATCGT CGCGATGCTG CTCCCGGCGG ACCGCACGCG CCACGACGCT
TCCCAGACCC CTGAGTCCCT GACGTCTGCT TCCCAGACTC TCGCCGCTCA GAGCTCAGAT
TCCCCGACCT CTGCCGCCAC CCCCACCAGC TCCGGCTACG GCATCGTCTT CCGCGACCGC
CTGACCCGCA GACTCCTCAT CCTGTTCGCC GTCCAACTCT TCATCTACTC CCTGACCGAG
AGCGCCCTCC CCCTCGCCAT CCGCACCGAC GGCCTGTCCC CCACCGTCAT GGGCCTGGCC
GCCGCCGTCA ACGCAGGACT GGTCGTCGCC CTCCAACCCC TGGCCACAAC CCTCGTGTCC
CGCTTCCCCC GCACCCAGGT CTTCCTCACC GGCGGCATCC TGACCACCAC CGGCATAGCC
CTCACCGGCC TCGCCCACAC CCCCACCGCC TACGCCGCAA CCGTCACCAT CTGGTCCCTC
GGCGAAGTCA TCATCGGCGG CATCCCCGCC AGCCTCATAG CCAACCTCGC CCCCGCCACC
GCCCGCGGCC GCTACCAAGG CGCCTTCAGC TGGGCCTGGG GCGTCTCCCG CTTCCTAGCC
CTAGCCGCCG GCACAACCGC CTTCACCCTC ATCAGCCCAG CATTCCTCTG GTGGACCGCC
CTCCTCGCCG GCACCGCCGC CAACATCGGC ATCATGCTGC TCAGCCCGGC GATCGACCGG
CGGACGTCAG CGATCGACGA GCCGGCCACA CGGTGA
 
Protein sequence
MTTTTIRSEA ATPASDALGI AALPDQPRPA QPSASRPSRQ SRPSRSRAPL SVFAGFPRAI 
WVVFAGTVVN RVGFLVGPFL VFFLGSRGIP SSQTPYVLGA LGAGNLVGPA VGGWLADRHS
RKLTMLAGLL GTAAAQGALF AAPNVATMAL AAIALSATAT MVSPAASAIL TDDVGPARRR
EAFALIGWAV NIGTAVAGVL GGYLAAHGYW MLFAIDAGTS LGYAVIVAML LPADRTRHDA
SQTPESLTSA SQTLAAQSSD SPTSAATPTS SGYGIVFRDR LTRRLLILFA VQLFIYSLTE
SALPLAIRTD GLSPTVMGLA AAVNAGLVVA LQPLATTLVS RFPRTQVFLT GGILTTTGIA
LTGLAHTPTA YAATVTIWSL GEVIIGGIPA SLIANLAPAT ARGRYQGAFS WAWGVSRFLA
LAAGTTAFTL ISPAFLWWTA LLAGTAANIG IMLLSPAIDR RTSAIDEPAT R