Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_1722 |
Symbol | |
ID | 8333065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 1952625 |
End bp | 1954055 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644954872 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003112484 |
Protein GI | 256390920 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.902381 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGCAC ATTCCGCACG GTCCGGAAAC CTTCGAACCC TGATAGCGGT CGGCGCGGCC GTCGCCGCCC TCGTCGCCGG CTGCTCCAGT TCCTCCTCCG GTTCCAAGCC GGCCGCCGTG AACTCCGCCG ACCAGCCGAA GAACCCCACG CTGGTGATCA CCGCCAACGA CATCGCGGGC GGGAAGAACA GCAACGAGGC GAACTGGATC CAGAAGACGC TGATCCCGGA CTTCGTGAAA GCCGAGGCGG CCAAGGGGAT CACCGCCCAC GTCACCTTCC GGCCCAACGG CGTGGACGAC AACGCCTACA AGTCCAAGCT CGCCCTGGAC CTGCAGTCCG GCACCGGCGA CGACGTCTTC TCCCTGGACG GCATCTGGGT CGGCGAGTTC GCCGACGCCG GCTATGTCAA GCCGCTGAAT CAGGTGGCCG GAGCCCAGGT CGACAGCTGG GACGGCTGGT CGCAGATCAC CCAAGCCGTC CAAGCCCTCG GCGAGTACCA GGGCAAGCGT TTCGGCGTCC CGAACGGCAC CGACGCCCGC GTCATCTTCT TCAACAAGAA GCTGTTCGCG CAGGCCGGGC TGCCCGCCGA CTGGCAGCCG ACGAGTTGGC AGGACCTCTA CGACGCGGCC GCCAAACTCA AGACCCTGCC AGGGGTCACC CCGGTCCAGT GGGACGGCGG CGTCCCGATG GGCGAGGCCA CGACGATGCA GGGCTTCCTG CCGCTGCTGT CCGGCGCGGG CGGCTCGTTG TGGGCGAACG GCAAGTGGAT GAAGGCCGGC ACGGCGTTCA CCTCGGCGCT CGGCTTCTAC CAGAAGATCT ACGGCGGCGG TTACGGCGAT CCGGTGTTGC AGGAGGACGC CAAGGGCCGC GACAAGTCCT TCACCGAGTT CGCGGCGAAC AAGATCGGCA TCTACGCCGA GTCCGACTAC ATGTGGCGCT CGGTCCTGAA TCCCACCGGC GGCACCGCGC CGATGGCCGA CCGCGACACC GATGTCGGCT ACGCGCTGAT CCCGTCGCAG ACCCCGAGCT CGGGTGTGAA GGGCCAGGGC TTCGTGTCCT ACTCGGGCGG TTCGGACTGG TCCATCAATC CCAAGACCAA GTATCCGCAG GCGGCGTGGG ACTTCCTGGC GTTCCTGAAC TCCAAGACCG AGACCGAGTC CCGGATCAGC GGCGCGCCGC TGCTCACCGC CCGGACCGAC GTGAACCAGC AGGTGCTCGG CAACGACCCG ATGCTGAAGT TCGCCACCGA CAAGGTGCTG CCGATCACCG CGTTCCGGCC GTCGCAGGCG GCGTACAACG ACGTGTCGAG CCTGGTCCAG AAGGCGGTCG CGGACGTGGT CGGCGGCAAG AGCCCGGAGC AGGCCGCGGC GGCCTACGAG AAGGCGTTGG AGGGGCTCGT TGGCGCGGAC AGCATCGCCG CGGGCAGCTG A
|
Protein sequence | MGAHSARSGN LRTLIAVGAA VAALVAGCSS SSSGSKPAAV NSADQPKNPT LVITANDIAG GKNSNEANWI QKTLIPDFVK AEAAKGITAH VTFRPNGVDD NAYKSKLALD LQSGTGDDVF SLDGIWVGEF ADAGYVKPLN QVAGAQVDSW DGWSQITQAV QALGEYQGKR FGVPNGTDAR VIFFNKKLFA QAGLPADWQP TSWQDLYDAA AKLKTLPGVT PVQWDGGVPM GEATTMQGFL PLLSGAGGSL WANGKWMKAG TAFTSALGFY QKIYGGGYGD PVLQEDAKGR DKSFTEFAAN KIGIYAESDY MWRSVLNPTG GTAPMADRDT DVGYALIPSQ TPSSGVKGQG FVSYSGGSDW SINPKTKYPQ AAWDFLAFLN SKTETESRIS GAPLLTARTD VNQQVLGNDP MLKFATDKVL PITAFRPSQA AYNDVSSLVQ KAVADVVGGK SPEQAAAAYE KALEGLVGAD SIAAGS
|
| |