Gene Caci_7148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7148 
Symbol 
ID8338516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8314242 
End bp8316002 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content66% 
IMG OID644960229 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003117818 
Protein GI256396254 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.118228 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCAT CCAGGCGCCG GTCACGGTCG GCCATCGCCG TCACCGTGCT GGCAACACTG 
ACGATCACCG CCGGGTGCTC GTCGTCGTCC GCCAAGAAAT CCGGCTCCGG TCAGCAGGAG
GCCGCGCAGA ACGTGGCCAA GCAAGCCGTC GCGGTCGGCA CGGCTGCCGA CTCCCGGGGA
CCGGATCCGG CAGTGTCCGG CGCCAAGTCC GGCGGGACTG TCACAGTGCT CGAACACTCG
GACTTCAGCC ACCTGGACCC GGCCCGCGTC TGGTCCTCCA CCAACCAGAC CGCCGATCTG
CTGCTGACTC GCCAGCTCAC CAGCTACCAG CAGGTCGGCA ACACCACCAA GCTGGTCGGA
GACCTGGCCA CCGACACCGG GAGCAGCACC GACGGCAAGA CCTGGACCTA CCACCTCAAG
GCCGGCCTGA AGTACGAGGA CGGCAGCACG ATCACCGCTC AGGACGTGAA GTACGGCATC
GAGCGCACCT TCCAGAAGGA GCTGTCCGGC GGTCCGCAGT ACCTCCAGAT GTGGCTGACC
GGCAAGACCG ACTACTCCTC CACCTACTCA GGGCCCTGGG GCGGCCAGGA CCTGCCGCAG
ATCCAGACGC CCGACGCCAC GACGATCGTC TTCCACCTCG CGTCGGTGCA CGCCGACTTC
CCGTTCGCGT TGGCGATGCA GGCTTACAGC CCGCTTCCCA AGGACAAGGA CGCCAAGTCC
GCCCTGGACC AGCACCCCTT CTCCTCCGGG CCCTACAAGG TCGACAGCCA CGACATCGAC
AAGGGCATGG TCCTGTCCCG CAACACCCAT TGGGACGCGA ACACCGACCC GGTCCGCCAC
GCCTACCCCG ACCAGTGGAA GTTCGAGTTC GGGGCGCAGG ACGTCGACAT CAACCAGCGC
CTGGCGGCCG CCAACGGCGC CGACAAGGAC GCGATGACCT TCAAGGTCAC CATCGGCTCG
GACCTGGCAT CGCAGGTGAA CTCGAGCCCT GATCTGAAGG CTCGTCTGGT CAACCAGGTC
ACGCCCTTCT CAGAGTTCTA CAACATCAAC ACCCGGCGCG TGACGGACGT CAAGGTCCGC
GAAGCCCTGC TGGAGGCGTT CCCGCGCGCC CAGACGCGGC AGCTGCTCGG GGGACCGATC
TACGGCGACT TCACCACCAC GATCCTGTCG CCGGTGACCA ACGGCTACCA GGACTACGAC
CTGTACGGCG CCCCCGACAC CGGCGACCCA GCCAAGGCCA AGGCGCTGCT GGCGACGACA
TCGACGCCGC ACCCGACCAT CGTGTACGCC TACCAGGACG ACACCGCCTG GCAGCAGGGC
GCCGTCGCCA TCCAGCAGGC GCTGACCAAG GCCGGCTTCA CCGTCGTCAC CAAGGCGATC
AGCGACAAGA ACTACTACGA CGAGACGCAG AAGACTGACA ACCAGTTTGA CGTCTACTGG
GGCGGCTGGG GTCCGGACTG GCCCAGCGCC TCCTCGGTCA TCCCGCCGTT GTTCGACGGC
CGGCAGATCA CCGACGGCGG CAGCGACAAC TCGCTGCTGA ACGACCCGAC GGTGAACGCC
GAGATCGACC GGATCCAGTC CATGACCGAC CTGAGCCAGC AGAACACCGC CTGGGCCGCG
CTGGACAAGA AGATCATGCA GGAGGTCCCC ATCATCCCCT GGGTCGATCC CCGGCAGGTC
TCGCTCTATG GGCCCGGACT CGGTGGCGTC CACACCGGCT TCATCGGCAC CTGCTATCCG
CTCGACGTCT ACGTCAAGTA G
 
Protein sequence
MSASRRRSRS AIAVTVLATL TITAGCSSSS AKKSGSGQQE AAQNVAKQAV AVGTAADSRG 
PDPAVSGAKS GGTVTVLEHS DFSHLDPARV WSSTNQTADL LLTRQLTSYQ QVGNTTKLVG
DLATDTGSST DGKTWTYHLK AGLKYEDGST ITAQDVKYGI ERTFQKELSG GPQYLQMWLT
GKTDYSSTYS GPWGGQDLPQ IQTPDATTIV FHLASVHADF PFALAMQAYS PLPKDKDAKS
ALDQHPFSSG PYKVDSHDID KGMVLSRNTH WDANTDPVRH AYPDQWKFEF GAQDVDINQR
LAAANGADKD AMTFKVTIGS DLASQVNSSP DLKARLVNQV TPFSEFYNIN TRRVTDVKVR
EALLEAFPRA QTRQLLGGPI YGDFTTTILS PVTNGYQDYD LYGAPDTGDP AKAKALLATT
STPHPTIVYA YQDDTAWQQG AVAIQQALTK AGFTVVTKAI SDKNYYDETQ KTDNQFDVYW
GGWGPDWPSA SSVIPPLFDG RQITDGGSDN SLLNDPTVNA EIDRIQSMTD LSQQNTAWAA
LDKKIMQEVP IIPWVDPRQV SLYGPGLGGV HTGFIGTCYP LDVYVK