Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5370 |
Symbol | |
ID | 8336724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 6191694 |
End bp | 6193583 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644958468 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003116070 |
Protein GI | 256394506 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.358498 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAACC CCAACCCACG ACGACGTGCT TTCGCCCTGG CGGCGGCCCT GGCCGGGGCG GTGGCGCTGA GCGCGCCGGC CGCGGCCGCA TCCGGGCCGG CGCAGGCGCG GGCGCGCAGC TCTTTTAGCC AGACCTCTGA CGCTCAGGCG GCGCCGGCTT CGGCGGCTGC CAGCGGCAAG ACCTTGACCG TGGCGACCAC CGGCAGCATC GACTCCCTGT CGCCGTTCCT GGCGCAGCGG GCGCTGCCCA CCCAGATCCA CCGCCTGATC TACGACTTCC TGACGAACTA CGACGCCTCC GACGACCACG CGATCGGCGC CCTGGCCACC TCCTGGACCA CCTCGACGGA CAAGCTGACC TGGACCTTCA CCTTCCGCGA CGGAATGAAG TGGTCCGACG GCCAGCCGGT CACCGCCGCC GACGCGGCCT TCACCTACAA CCTGATGATG ACCAACGACG ACGCGGCCAC CGCGAACGGC AACTTCGTCA CCAACTTCGC CAAGGTCACC GCGACCGGCA ACCAGCTGGT CATCACCTTG AAGCAGCCGC AGTCCACGAT GCTCGCGCTG GACATCCCGA TCGTGCCGCA GCACGTCTGG GCCTCGCACG TCGCCGACAT CGCCACGTTC AACAACGACG CCCAGTTCCC GGTCGTGGGC GACGGGCCGT TCATCCTCAC CGGCTACCAG AAGGACCAGT ACCTCACCCT GGACGCCAAC CCGAACTACT GGCGCGGCAA GCCCGGCTTC GACCACCTGG TGTTCAAGTT CTTCAAGGAC GCCGACGCCG AGGTGGAGGC GCTGAAGAAG GGCGAGGTCG ACTTCGTCAG CGGCCTGACC CCGGCGCAGT ACGACGCGCT GAAGGGCCAG TCGGGCATCG CCACCAACAA CGCGCAGGGC AAGCGGTTCT ACGCCCTGGC GATGAACCCC GGCGCGACCA CCACCACCGG GCAGGCGTTC GGCGACGGCA GCCCGGCGCT GCAGAACCAG CAGTTCCGCC AGGCGCTGAT GTACGCGATC GACACCAAGA CGCTGGTCGC CAAGACCCTC GGCGGCTACG GCACGGTCGG CAGCGGCTAC ATCGCCCCGA TCTTCGCCGC CTACCACTGG GCTCCGGACC CGGCCACCGC CTACACCTAC GACCCGGCCA AGGCGAACCA GATGCTGGAC GCCGCCGGGT TCAAGAAGGG CTCGGACGGC ATGCGCACGC TGCCCGACGG CAAGCCGCTG AAGCTGCGCC TGATGGGCGA GACCAACCGG GCCGACGACA CCCAGAACGT CGCCTACGTC GCCGACTGGC TCAAGGCCGT CGGGATCGCC ACCACCACCA CGGTCGTGGA CCAGGGCAAG CTCGCCGACA CCGAGACCGC CGGCACGTTC GACCTGGCCT TCGACAGCTG GGGGGAGAAC CCGGACCCGG ACGCCGTGCT GTCGATCCAG AAGTGCGACG GCCGGCCCGC CGCGCAGGGC AAGAACTTCA ACGGCGACGA CTTCATCTGC GACCAGGACT ACGACGCCCT GTACCAGAAG CAGATCACCG AGTACGACCC GGCCGCGCGC GCCGCCGACG TCAAGCAGAT GGAGCAGAAG CTCTACACCG ACGCCTACAT CAACGTCCTG TATTACGGGA ACGTGCTGGA GGCCTACCGC TCCGACGTCA TCGGCTCCAT GGACAAGCAG CCGCAGCCCA ACGGCCTGTA CTGGGGTCAG GACGGCTACT GGTCCCTGTG GTCGGCCAAG CCCGTGGCCG CCTCCTCCTC GTCGTCCTCG TCGAGCTCGA ACACCGGTCT GATAGTCGGC ATCGTGATCG CGATCGTGGT GGTCGGCGGC GGCGGTGCCC TGCTCCTGAC CCGCCGGCGC CGCGGCACCA CCGCCGACGA ACGCGAGTAG
|
Protein sequence | MPNPNPRRRA FALAAALAGA VALSAPAAAA SGPAQARARS SFSQTSDAQA APASAAASGK TLTVATTGSI DSLSPFLAQR ALPTQIHRLI YDFLTNYDAS DDHAIGALAT SWTTSTDKLT WTFTFRDGMK WSDGQPVTAA DAAFTYNLMM TNDDAATANG NFVTNFAKVT ATGNQLVITL KQPQSTMLAL DIPIVPQHVW ASHVADIATF NNDAQFPVVG DGPFILTGYQ KDQYLTLDAN PNYWRGKPGF DHLVFKFFKD ADAEVEALKK GEVDFVSGLT PAQYDALKGQ SGIATNNAQG KRFYALAMNP GATTTTGQAF GDGSPALQNQ QFRQALMYAI DTKTLVAKTL GGYGTVGSGY IAPIFAAYHW APDPATAYTY DPAKANQMLD AAGFKKGSDG MRTLPDGKPL KLRLMGETNR ADDTQNVAYV ADWLKAVGIA TTTTVVDQGK LADTETAGTF DLAFDSWGEN PDPDAVLSIQ KCDGRPAAQG KNFNGDDFIC DQDYDALYQK QITEYDPAAR AADVKQMEQK LYTDAYINVL YYGNVLEAYR SDVIGSMDKQ PQPNGLYWGQ DGYWSLWSAK PVAASSSSSS SSSNTGLIVG IVIAIVVVGG GGALLLTRRR RGTTADERE
|
| |