Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5099 |
Symbol | |
ID | 8336453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5856207 |
End bp | 5857541 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644958198 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003115800 |
Protein GI | 256394236 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.291523 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0224239 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGCACCA ACCTCATGAG GGGCACCGCC GCCCTCACCC TGGCCGTCAC CGCCATGGCG ATGACCGCAT GCAGCAGTAG CAGCTCCTCC AGCTCCGCGC CCAAAGGCGG TGCCGCCAGC AGCGGCTCGC ACGGCAAGGT GACCCTGTCC TTCGTCAACT GGGACGGCGG CATGCAGTCC GCCGTCGACC AGTGGAACAA GGCGAACCCC GACATCCAGG TGCAGCTGAC CAAGCCCTCG GGCACCGGCT ACACGCTCTA CAACAAGCTG ATCACCAACA ACGCCGCCGG CACCAACCCC GACGTCACCG AGGTCGAGTA CCAGGCGCTC CCGGCGCTGA TCGCCAACAA GGTGATCGTG CCGATCGACC AGTACGTCGG CGACATCTCC GCCGACTTCG ACAAGTCCTC GCTCGCGCAG GTCCAGTTCG AGGGCAAGAC CTACGGCGTC CCGCAGAACG TCTGCCCGAT GGTCTTCTTC TACCGCAAGG ACATCTTCGA CTCCCTCGGC CTGAAGGCGC CGACGACCTG GGACGAGTAC GCCGCCGACG CCGCGACCAT CCACGCCAAG AACCCCAAGC AGTACATCGG CAACTTCTCG GCCGTGGACT CCGGCTGGTT CGCCGGGCTC GCGCAGCAGG CCGGCGCCAA CTGGTGGACG ACGACCGGGA CCACCTGGAA CGTCGCCATC GACGACGCGC CGACCCAGAA GGTCGCGAAC TACTGGAGCG GCCTGATCGA CAAGGGTCTG GTCTCCCCGG AGCCGAACTG GTCCCCGCAG TGGAACACCG ACATGAACAA CGGCACGATC ATCGGCTGGG TCAGCGCGCA GTGGGCGCCG AACCAGTTCC CCTCGATCGC CAAGGACACC GCTGGCAAGT GGGTCGCCGC GGCGCTTCCG GCCTGGACCG CTGGGGACTC CACGGTCGGC ATCTGGGGCG GGGAGACCGA GGCGGTGACC TCGAACTCCA AGCACCCGGC CGAGGCCGCG AAGTTCGTGA AGTGGCTCAA CGCCTCCTCC GACGGTGTCA AGACACTGAT CCAGCAGGTG GACGTCTTCC CGGCCTCGCT GGCCAACCAG AGCCAGGACT CGCTGAAGAC CCCGCCGCCG TTCATGTCCG ACCAGGCGGA CTACAACACG CTGATCGCCT CCGCGGCGAA GAACGCTCGC ACCTTCCAGG TCTGGGGACC GAACGCGAAC GTCACCTTCG ACGCCTACTC CAACGACTTC GCCGCCGCGC TGCAGAACAA GACGCCGCTG ACCGCGGCGC TGACGCAGAT GCAGCAGGCG ACCGTCGCCG ACCTGAAGAA GCGCGGCTTC TCCGTCACCG GCTGA
|
Protein sequence | MRTNLMRGTA ALTLAVTAMA MTACSSSSSS SSAPKGGAAS SGSHGKVTLS FVNWDGGMQS AVDQWNKANP DIQVQLTKPS GTGYTLYNKL ITNNAAGTNP DVTEVEYQAL PALIANKVIV PIDQYVGDIS ADFDKSSLAQ VQFEGKTYGV PQNVCPMVFF YRKDIFDSLG LKAPTTWDEY AADAATIHAK NPKQYIGNFS AVDSGWFAGL AQQAGANWWT TTGTTWNVAI DDAPTQKVAN YWSGLIDKGL VSPEPNWSPQ WNTDMNNGTI IGWVSAQWAP NQFPSIAKDT AGKWVAAALP AWTAGDSTVG IWGGETEAVT SNSKHPAEAA KFVKWLNASS DGVKTLIQQV DVFPASLANQ SQDSLKTPPP FMSDQADYNT LIASAAKNAR TFQVWGPNAN VTFDAYSNDF AAALQNKTPL TAALTQMQQA TVADLKKRGF SVTG
|
| |