Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4557 |
Symbol | |
ID | 8335911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5185803 |
End bp | 5187515 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644957658 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003115260 |
Protein GI | 256393696 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAGAA TAGCCGTCCT TTCGGCTGCC CTGGCGCTGG TCGCCGCGGG GGCGGCCGCG TGCTCGTCGT CAGCGGGCCA CACCTCGACC GGCACCGCCG GTTCGGCCGC CTTCGACCCC AAGACCTGCC AGGGCGGCAC CCTGGAAGTC CTGAATCAGG ACAGCATCAG CAAGATGGAC CCGGCGCGGA TCTACACCTC CGGCGGCGGC AACATCCCCT CGCTACTGTT CCGCACGCTC ACCACGCGCA ACCGGCAGCC CGGGCAGGAC GGCGCCAAGC CCGCGCCGGA CCTGGCCACC GACCTCGGCA CGCCGAGCGA CGGCGCGAAG ACGTGGACCT ACCACCTTAA GAGCGACATC TTCTTCGAGG ACGGCACGCC GATCACCGCG CAGGACGTCA AGTACGGCAT CGAGCGCTCC TTCGCCCCCG AACTGCCCGG CGGCGCCCCG TACCTGCGCG ACTGGCTGGT CGACGCCTCG AACTATCAGG GCCCGTACAA GGACCCGAAC GGTATCGCGG CGATCGAGAC CCCGGACGCC AAGACGATCA TCTTCCACCT GCGCAAGCCC GAGGGCGACT TCCCGCTGCT GGCCACCGCC ACGCAGTTCG CCCCGGTGCC CAAGGCCAAG GACACCGGCG TCAACTACGA CAAGCACCCG ATCTCCTCCG GGCCCTACAT GGTCGCCAGC TACGACAAGG GCAAGACGCT GGACCTGGTC CGCAACCCGC ACTGGTCCGC CGCCAGCGAC CCGCTGCGCT ACGCGTGCCC GGACAAGATC GACGTCACCT CTGGTCTGAA CCCCGCGGTC ATCAACCAGC GCATCGCCAC CGGCAGCGGC CAGGACGCGA ACGCCGTCAC CACCGACGCC ACCATCGGCC CGGACCAGCT GGCGCAGCTG AACAGCAACC CCTCGCTGGC CAAGCGGGTG GCGCGCGGCG AGTTCCCGGC GACGACCTAC CTGGCGTTCA ACACCAAGGT GAAGCCGTTC GACGACATCC GCGTCCGCGA GGCGGTCTCC TACGCGATCA ACCGCACCAC CGCGGTCAAC GCCGCCGGCG GCACCGCCGT GGCCGGCGCC TCGACCACCT TCCTGCCGCC GCAGAAGGCG CTGGGCTACC AGCCCTACGA CGACTTCCCG GCCGGCGCGA CCGGGGACGC GGCGAAGGCC AAGGATCTGC TGGCGCAGGC GGGGTATCCC AACGGCCTGA CGATCACGCT GTTCCACCAG TCCGACGACG CCAACAACCT CGGGCCGAAG GAGGCCACCG CGATCCAGGA CGCGCTGAAG GCGGCCGGCA TCACGGTGAA GCTGAACCCG GTGGACGACG ACAGCTACCA GGACGTCACC GGCAAGCCCA GCAGCGAGCC CGGCGTCTCG CTGCAGTACT GGGGCGCCGA CTGGCCCTCC GGCGCGCCGT TCCTGATCCC GATCTTCGAC GGCCGCGAGA TCATCGACAG CGGCGGCAAC TTCAACATGG CGCAGCTGAA CGACCCGGGC GTGAACGCCG AGATCGACGC GATCAACGCG ATCACCGACC CGGCGCAGGC GCAGGCCCGC TGGGGCGCGC TGGACGCCAA GCTCGGGCAG CAGGCGCTGA CCGTGCCGCT GTTCTACGAG AAGGACGTCT ACCTGTTCGG CAAGAACGTC AAGGACGCGG TGCCGGACGG CTGGCGCGGC CAGTACGACC TGGCCCGCGT GTCGGTCAAG TAA
|
Protein sequence | MRRIAVLSAA LALVAAGAAA CSSSAGHTST GTAGSAAFDP KTCQGGTLEV LNQDSISKMD PARIYTSGGG NIPSLLFRTL TTRNRQPGQD GAKPAPDLAT DLGTPSDGAK TWTYHLKSDI FFEDGTPITA QDVKYGIERS FAPELPGGAP YLRDWLVDAS NYQGPYKDPN GIAAIETPDA KTIIFHLRKP EGDFPLLATA TQFAPVPKAK DTGVNYDKHP ISSGPYMVAS YDKGKTLDLV RNPHWSAASD PLRYACPDKI DVTSGLNPAV INQRIATGSG QDANAVTTDA TIGPDQLAQL NSNPSLAKRV ARGEFPATTY LAFNTKVKPF DDIRVREAVS YAINRTTAVN AAGGTAVAGA STTFLPPQKA LGYQPYDDFP AGATGDAAKA KDLLAQAGYP NGLTITLFHQ SDDANNLGPK EATAIQDALK AAGITVKLNP VDDDSYQDVT GKPSSEPGVS LQYWGADWPS GAPFLIPIFD GREIIDSGGN FNMAQLNDPG VNAEIDAINA ITDPAQAQAR WGALDAKLGQ QALTVPLFYE KDVYLFGKNV KDAVPDGWRG QYDLARVSVK
|
| |