Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4682 |
Symbol | |
ID | 8336036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5332083 |
End bp | 5333780 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644957782 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003115384 |
Protein GI | 256393820 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00316141 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGTT CGAACGGCTT CTCCCGCCGC CAGGTGTTCA AGACCTCCGC GGCGATCGGC GGAGCGATAG CCGCCGCTCC GCTGCTGTCC GCCTGCGGTT CCGGCAAGGC GGCGACCAAG GCCAGCGGGG TCGCGCCCAA GTCCGCGGTG CAGGCGGTGC TGCCGACGTA CAAGCCGAGC AGCGCGGTCA CCGCCGACAT CCCCTCGGTC ACCGGCGCGA ACGGCGCGGC CAGCGACCCG GCGTTCCTGT CCTACCCGGC CAGCCCGCCG AAGTCGGTGA CCGGCGCGGT CGGCAACGGC GGCTCGTACT CGGCGGTCTC GCCGATCTGG GGCTCGGTCC CGGCGCCGGG GAACAGCTAC TACACGGCGG TGAACCAGGC TCTGGGCGCG ACGCTCACCG ACAGCCCGTC CGACGGCACC ACCTTCGCCA CGATGCTGGC GACCCGCTTC GCCTCCGGCA ACATCCCGGA CTGGCTGGAC GTGCCCGGCT GGAACGTCTC CTCGATTCAG AACTTCGCCG AGGGCGTCGA CAAGTTCTTC AAGGACCTGA CGCCCTACCT CGGCGGCGAC AAGGTGCTGG ACTACCCGAA CCTCGCCGCG ATCCCGACCG GCGGCTGGCA GGCCGCGGTG TGGAACGGCA AGCTGTACGG CATCCCGCTG TGGACCTCGG CGGCCAGCAT CCCCGGCGCG ATGTTCTACC GCGCGGACAT CTTCAAGGCC GCCGGCATCG ACGCCGCCTC GGTCACCACC GCCGACACCC TCAAGGCGGT CGGCAAGCAG GTCACCGTCC CGGCGAAGGG CCAGTACGCC TTCGAGGACC TCAGCTCCTT CCTCTACCAG CTGTTCAACG TCCCGGCGAA CAACGGGCGG ACCGGCTGGA AGCGCGACAG CACCGGCAAG CTGGTCAACG GCTACGAGGT GCCGGAGTTC CTGGAGATGC TGAACTTCGC CAACGGCCTG GCCAAGGGCG GCCTGATCCA CCCCGACGCG CTGGCCGGCG ACTCCTCGAA GGCCAAGAAC CGCTTCTGGG CCGGCAAGAC CGTGATCACC GCCGACGGCA CCGGGGCGTG GAACAAGGGC GACGCGCAAA GCGGCGTCGC GGCCAACCCC TCCTATGAGC GCCAGGCCTT CAAGATCTTC GCCTACGACG GCGGCAAGGC GACGATGCCC CTGTATCCGG GCGCCGGGAT GTTCTCCTAC CTGAACAAGA AGCTCTCCGA CGCGCAGGTC AAGGAGCTGC TGCGGATCGC CAACTACCTC GCCGCGCCGT TCGGCAGCGC CGAGTACCTG GTGTCGCGGT ACGGCAAGGA AGGCGTGGAC TACACGATGA CCAGCGGCGC GCCGATCCTC ACCGACCAGG GCAACAAGGA CGTCACCGAC ACCCTGGACC AGCTGGCCAA CTGCCAGTCG GTGACGTTCA ACGCCGGCTA CAACCAGATC ACCAAGGACT ACGCCGCCTG GCAGGGCGAC ATGGTGCAGC ACGCGTACAA GCCGCTGTTC TACGCGATGA ACATCAGCGA GCCGGCGCAG ACCGCGAAGG CGAGCACGGC GCTGGAGGCG GTCATCACCG ACGTGCGCAT GGGCCGCAAG AGCGTGGCGG ACTTCCAGTC GGCGCTGAGC ACTTGGCAGA ACGCCGGCGG CAACCAGCTG CGGGACTTCT ACGACGGCAT CGCCAAGCAG TACGGCACGG GGAACTGA
|
Protein sequence | MTSSNGFSRR QVFKTSAAIG GAIAAAPLLS ACGSGKAATK ASGVAPKSAV QAVLPTYKPS SAVTADIPSV TGANGAASDP AFLSYPASPP KSVTGAVGNG GSYSAVSPIW GSVPAPGNSY YTAVNQALGA TLTDSPSDGT TFATMLATRF ASGNIPDWLD VPGWNVSSIQ NFAEGVDKFF KDLTPYLGGD KVLDYPNLAA IPTGGWQAAV WNGKLYGIPL WTSAASIPGA MFYRADIFKA AGIDAASVTT ADTLKAVGKQ VTVPAKGQYA FEDLSSFLYQ LFNVPANNGR TGWKRDSTGK LVNGYEVPEF LEMLNFANGL AKGGLIHPDA LAGDSSKAKN RFWAGKTVIT ADGTGAWNKG DAQSGVAANP SYERQAFKIF AYDGGKATMP LYPGAGMFSY LNKKLSDAQV KELLRIANYL AAPFGSAEYL VSRYGKEGVD YTMTSGAPIL TDQGNKDVTD TLDQLANCQS VTFNAGYNQI TKDYAAWQGD MVQHAYKPLF YAMNISEPAQ TAKASTALEA VITDVRMGRK SVADFQSALS TWQNAGGNQL RDFYDGIAKQ YGTGN
|
| |