Gene Caci_4518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4518 
Symbol 
ID8335872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5144950 
End bp5146797 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content68% 
IMG OID644957620 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003115222 
Protein GI256393658 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.872953 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGAC CCAAACCACT GGTCGCGGCC CTAGCCGTCG CGACGATAGC GGCCACCGCG 
CTGTCGGCGT GCTCCAGCTC CTCGGCGAAG AAAACCAACG GTGGCACCGG TGGCGGTACC
GGCGGGGTCT TCACCAGCAT CGACGCCAAC AACAAGATCA CCGCCGGCGC GCCGATGAAC
CCGTACAACG CCGCGCCCAA CATGTTCCTG GGCTACAACA TCATGGAGCT GGGCTTCACC
AAGAACGACC CCGCGGACCC CAACGCCCTG CTCCCGGGTC TGGCCGCCAG CTGGACCGCC
TCGGACACCG GGCTCACCAT CCAGCTGCAG CCCGGCGCCA AGTGGTCCGA CGGCACCCCG
GTCACCGCCG CGGACATCAA GACCTCGCTG GCCATCGCCT ACACGCAGGG CACGGCAGGT
CCCGTGGCCG GCGCCGGCGG CACCGTCGTG GCCGGCAGCA ACTTCGAGGT CTCCGACGTC
AAGGACCTCG GCGGCGGCAA GATCGAGATC GACCAGCAGC CCGGCGTGAA GAACCTGTAC
TTCCAGCGCC TGGTGCTCAC CTCGACCATC GTCAACGACA AGGTCTACGG CAGCCAGCTC
CCAGCGGACA TCTGGACCCA GATCGCCGCC GTGCAGGGCA CCGACGCCGC CGCGGCGTCC
GCGGCGTCCA CCAAGCTGGC CGCCGAGGGC AAGACGATCG CCGCCTTCGC CCCGGCCAAG
GACATCTCGG CCGGCCCGTT CGTGGAGACC CGGGTCAACC CCGGCGAGGC GCTGCTGGAC
CGCAACCCCT ACTTCTACGC CGCGAGCAAG ATCTCGCCGA AGCAGGTCAT CCTGCGCAGC
TACTCCGGCA ACCAGCAGAT CTGGGGCTAC ATGAACGGCG GCGAGCTGGA CTACGCCCCG
TACACCTCGA TGCCCACCAA CATCCTGAAC CAGGTCCTCA AGGCCGGCTA CACCCGCATC
GACGCCCCCA GCTACGTCAG CGCCTCGATC GCGTTCAACG AGAAGCAGGC GCCGTACAAC
CTGACCCCGG TGCGCCAGGC GCTGGCCTAC GTCATCGACC GCGACGCCGT CACCAAGGTC
GGCGAGCCGG TCGGCGGCAT CGCCGCCCCG ACCACCACCG GCCTGGTCGG CTCGCAGTCC
GACACGATCT TGTCCGCCGA CCAGAAGGCG GCGCTGAACC CCTACAAGCC GGACCCGGCC
AAGGCCGCGT CCCTGCTGCA GGGCGCAGGC TTCACCAAGG ACGCCTCCGG CCAGTGGCAC
CTGCCCGACG GCACGCCGTG GAAGATCACG CTGCAGACCG TGAACGGCTT CTCCGACTGG
ATCGCGGCCT CCACGATCGT GGCCAACGAG CTGACCCAGT TCGGCATCCC GACCACCGCG
GCGATCACCG CCGACTTCGC CACGTACCAG AAGGAGATGG GCGCCGGTAA GTACGCGGTC
GGCTGGTGGC TGGTCGCCCT GGGCCCGCAG ACGGACAAGG CCTACGCCCG CATCTACGGC
TCCGCCGACG GCTTCAGTGT CGCCAACGGC CAGGCCACGC ACAACGACAG CGCGGCCGGC
AACTGGGAGC ACACCCCGGC GACCTACACC GTCAACGGCC AGAGCATCAA CCCCGGCCAG
CTCGCCGCGC AGCTGTCGGT GACCCCGGTC TCCGCCCAAG GGCCGATCAT CGCCCAGCTG
GCAGCGGCCA CCAACCAAGA AGTGCCGATG ATCCAGATCT GGAACTACAC CCACGTGATG
TTCACGCTGG ACAAGCGGTT CACGAACTAC CCGAAGACCG GGCAGGACGA TCTGCTGGCC
AACCCGCCCG GCGTGTGGAT GATGCAGGGG TACGTGCAGG GCAAGTAG
 
Protein sequence
MSRPKPLVAA LAVATIAATA LSACSSSSAK KTNGGTGGGT GGVFTSIDAN NKITAGAPMN 
PYNAAPNMFL GYNIMELGFT KNDPADPNAL LPGLAASWTA SDTGLTIQLQ PGAKWSDGTP
VTAADIKTSL AIAYTQGTAG PVAGAGGTVV AGSNFEVSDV KDLGGGKIEI DQQPGVKNLY
FQRLVLTSTI VNDKVYGSQL PADIWTQIAA VQGTDAAAAS AASTKLAAEG KTIAAFAPAK
DISAGPFVET RVNPGEALLD RNPYFYAASK ISPKQVILRS YSGNQQIWGY MNGGELDYAP
YTSMPTNILN QVLKAGYTRI DAPSYVSASI AFNEKQAPYN LTPVRQALAY VIDRDAVTKV
GEPVGGIAAP TTTGLVGSQS DTILSADQKA ALNPYKPDPA KAASLLQGAG FTKDASGQWH
LPDGTPWKIT LQTVNGFSDW IAASTIVANE LTQFGIPTTA AITADFATYQ KEMGAGKYAV
GWWLVALGPQ TDKAYARIYG SADGFSVANG QATHNDSAAG NWEHTPATYT VNGQSINPGQ
LAAQLSVTPV SAQGPIIAQL AAATNQEVPM IQIWNYTHVM FTLDKRFTNY PKTGQDDLLA
NPPGVWMMQG YVQGK