Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_1286 |
Symbol | |
ID | 8332621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 1459076 |
End bp | 1460860 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644954433 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003112052 |
Protein GI | 256390488 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGG AAACCGCTGT GATACCTAAG TTCTCGATCC GTGCGGGAGC CGCCGCGGCG GCGCTCGCCC TGGTCCTGTC CGCCTGCGGC TCGAGCAGCG GCGCGGGCGG GGGTTCGGCC ATGGGTCCGG GCTTCGACCT GGCGAGCAAG CAGGTGCTGA ACGCCTCGAC CAAGACCGGC GGCACCATCA ACCTGGTCTC CTCGCAGTCC TTCGACTCGA TCGACCCGGG GGTCACCTAC TCCGCGCAGA CCTGGAACCT GTTCCGGATG TTCGCCCGGC CGATGATGGC CTACGAGCAC ACGCCCGGCG GCAACCAGAT CGTCGGCGAC CTGGCCACCG GCCCGGGCGT GCAGTCCAAC GGCGGCAAGA GCTGGACCTA CACCCTGCGC GACAACGCCA CCTTCGAGGA CGGCACGCCG ATCACCTCCG GCGACGTGCG CTGGGCCATC GAGCGCTCCA ACTGGTCCTC GCTGGTCGGC AACGGCCCGA CCTACTTCCA CAACATCCTC ACGCCGCCGA ACGACCCGCG GTTCACGGAC CTGGACGTGT ACAAGTCCGG GGACAAGATG TTCGACAACA TCATCGTCAC CACGGACCCG AAGAAGATCA CCTTCAACCT CCCGCAGGCC TTCGGCGAGT TCGACTACGT GATGACCATG CTGCAGACCG CTCCGGTCGA GCGCACCGTG GACGAGAAGG ACTCCGGCGA CACCTACGGC AAGCGGCCGG TCTCCACCGG CGCCTACAAG ATCGCCAGCT ACTCCCCGGG CAAGGAGCTC AAGCTGGTCC GCAACGCCGC CTACAACCAG CAGTCCGACC CCAACCACAT GCACACCTCG CTGGCCGACG CCGTCGACGT GCAGCTCGGC GTGGACTCCG GGGAGCGCGA CCAGGAACTG CTGGACGGCC AGGCCGACGC CGACCTGGGC TCGGCGCTGA CGGTGGCCAA CCACGCCAAG GTGCTCCAGG ACCCGACGCT GAAGTCGCAG ACCGACGACG CCCCGGACTA CTCGATCGCC TACTCCTCGA TCAACACCGA GCTGATCCCG GACGTCCCGT GCCGGCAGGC GATCGAGTAC GCGGTGGACA AGAACACGGT GCTGAACCAG CTCGGCGGGC AGTGGGGCGG CACGGTCGCC ACGAACCTGC TCACCGCCGG CATCCCCGGC GCGGTCCAGT TCCCGACGTA CACCTACGAT CCGGCCAAGG CCAAGACCCT GCTGGCGACG TGCAAGACGG CCGACCCCGC GCTGTTCGAC AGCAGCGGGG CGCTGACCTT CAAGATCGCC GCGCAGACCA ACGCCCCGGA CCTGCAGAAC GCCGCGACCG CGATCCAGGC CTCGCTGGCC GGCGTGGGCA TCAGCACCCA GGTGACGCTG TTCCCGTTCG GCCAGTACAG CCAGTACTGC GGCAACCAGG CCTACGCCAA GGCGCACCGG CTCGGCATGT GCCTGGCCAA CTGGGGACCG GACTGGCTCA CCGGCTACGG CATGCTCGAC CAGCTGGTGA CCTCCAACGG CATCGCGGCC ACCGGCAGCC AGAACTACGC CTTCCTCAAC GACACGACGG TCAACTCCCT GGAGAAGGAA GCGCTGTCCA GCTTCGAGCC GAGCACCCAG CAGCAGGACT GGGTCAAGAT CGACCACCGC GTGATGGACC TGGCGGCGTA CGTGCCGCTG ATGGACCGGC ACATCATGCG CTTCCGGTCC GCCAAGCTCT CCAACGTGAT GATCGACCAG GCCGGCAGCG GCGGCTACGA CCTGTCGGTC CTGGGTCTGA AGTGA
|
Protein sequence | MKKETAVIPK FSIRAGAAAA ALALVLSACG SSSGAGGGSA MGPGFDLASK QVLNASTKTG GTINLVSSQS FDSIDPGVTY SAQTWNLFRM FARPMMAYEH TPGGNQIVGD LATGPGVQSN GGKSWTYTLR DNATFEDGTP ITSGDVRWAI ERSNWSSLVG NGPTYFHNIL TPPNDPRFTD LDVYKSGDKM FDNIIVTTDP KKITFNLPQA FGEFDYVMTM LQTAPVERTV DEKDSGDTYG KRPVSTGAYK IASYSPGKEL KLVRNAAYNQ QSDPNHMHTS LADAVDVQLG VDSGERDQEL LDGQADADLG SALTVANHAK VLQDPTLKSQ TDDAPDYSIA YSSINTELIP DVPCRQAIEY AVDKNTVLNQ LGGQWGGTVA TNLLTAGIPG AVQFPTYTYD PAKAKTLLAT CKTADPALFD SSGALTFKIA AQTNAPDLQN AATAIQASLA GVGISTQVTL FPFGQYSQYC GNQAYAKAHR LGMCLANWGP DWLTGYGMLD QLVTSNGIAA TGSQNYAFLN DTTVNSLEKE ALSSFEPSTQ QQDWVKIDHR VMDLAAYVPL MDRHIMRFRS AKLSNVMIDQ AGSGGYDLSV LGLK
|
| |