Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3821 |
Symbol | |
ID | 8335174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4321811 |
End bp | 4323472 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644956960 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003114563 |
Protein GI | 256392999 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.302994 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0761262 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCAGA TATCCCGGCG CGGCTTCCTG AGCGTGTCCG CCGGCGTGGC CGGTCTGTCC CTCGCTGCCT GCGGGGGCGG CGGCGACGGC GGTTCGAAGC CGAGCAGCAA GCTGACCGCG AACCGCACCG GCGCCATGGC GAAGTACGGC GTCGGGGACC AGTTCAAGGC CACGGTCCCG CTGTCGTTCT CGGCCATGCT GCTGAGCAAC GCGAACTACC CCTACAAGGC CGACTGGGAA TTCTGGTCGG AGCTGACCAA GCGCACCAAC GTGACGCTGC AGCCGACGGT GATCCCGGCC AGCGACTACA ACCAGAAGCG AAGCGTCATG GTCAGCGCGG GCAACGCCCC GACGCTCATT CCGAAGACGT ACCACCCGGA CGAGGAGGCG TACATCTCCG GCGGCGCGAT CCTGCCGGTC AGCGACTACC TGGACCTGAT GCCGAACTTC CAGGACAAGG TCGCCAAGTG GAACCTGGCC GGCGACCTGG ATCAGCTGCG CGAGGCCGAC GGCAAGTTCT ACCTGCTGCC CGGACTGCAC CAGGACGTGT GGAAGGACTA CTCGCTGGCC ATCCGAACCG ACATCCTCAA GCAGCTGAAC CTGCAAGTCC CGCAGACCTG GGACGACCTG ACCACAGTGC TGCGCACGAT GAAGCTGACC TACCCGGACC GGTACCCGTT CTCCGACCGC TGGAGCACGG GGAGCACGAC GCCGCAGCCG GGCGCCAACA ACCTGCTGGC CATCCTCGGC GAGGCCCACG GCGTCTGGGC CGGCTGGAGC TACCAGCACG CGAACTGGAA CGCCGACGCG GGCAGGTTCG AGTACACCGG CGCCACGGAC CAGTACAAGG CGATGATCCA GTATCTCAAC ACCCTGGTGA GCGAGAAGCT GCTGGACCCG GAGAGCTTCA CCCAGAGCGA CGATCAGGCC CGGCAGAAGT TCGCCGACGG CCAGTCCTTC GTGATCAGCG CCAACGCCCA GGAGCTGGTC AACCACTACC GCAAGGACAT CGCCAAGATC TCCGGCGCCA CGGTGGCCAA GATCCCGGTG CCGATCGGCC CGATCGGCGC GGCCAAGACC GGCTACCGCA CCGAGAACGG CATGATGATC TCCAACAAGG CCAAGGACGG CAAGGACTTC GTCGCGCTGA TGCAGTTCAT CGACTGGCTC TGGTACTCCG ACGAGGGCCA GATGTTCGCC AAGTGGGGCG TGCCGGGCAC CACCTACACC GGCAGCGTCG ACGACGGCAC GTTCAAGCTG GCCCCGGACG TCACCTGGGC CGGGGTCAAC CCTTCGGGCA CCAAGAACCT CCAGGTCGAC TACGGGTTCT TCAACGGAGT GTTCGCCTAC GGCGGCAGCA CCAAGCTGCT CGACTCTCAG TTCCCCCCGG AGGAATTGGA GTTCCAGAAG GTGATGGACG CGCGCAAGAC GCTGCCATTG GCCCCGCCCG CACCGCTGAG CTCCGACGAC CGTGAGCAGG CGACGCTGTG GACGACGTCG CTGAAGGACT ACGTCGACCA GGAGACGCTC AAGTTCATCC TCGGCAAGCG TCCACTCTCG GAGTGGACGG CCTACGTCTC CGAGCTCAAG GGCAAGAACA GCGACCAGTA CATCAAGCTC GTGAACCAGG CCTACCAGGA CTTCAAGAAG AACCACGGCT GA
|
Protein sequence | MNQISRRGFL SVSAGVAGLS LAACGGGGDG GSKPSSKLTA NRTGAMAKYG VGDQFKATVP LSFSAMLLSN ANYPYKADWE FWSELTKRTN VTLQPTVIPA SDYNQKRSVM VSAGNAPTLI PKTYHPDEEA YISGGAILPV SDYLDLMPNF QDKVAKWNLA GDLDQLREAD GKFYLLPGLH QDVWKDYSLA IRTDILKQLN LQVPQTWDDL TTVLRTMKLT YPDRYPFSDR WSTGSTTPQP GANNLLAILG EAHGVWAGWS YQHANWNADA GRFEYTGATD QYKAMIQYLN TLVSEKLLDP ESFTQSDDQA RQKFADGQSF VISANAQELV NHYRKDIAKI SGATVAKIPV PIGPIGAAKT GYRTENGMMI SNKAKDGKDF VALMQFIDWL WYSDEGQMFA KWGVPGTTYT GSVDDGTFKL APDVTWAGVN PSGTKNLQVD YGFFNGVFAY GGSTKLLDSQ FPPEELEFQK VMDARKTLPL APPAPLSSDD REQATLWTTS LKDYVDQETL KFILGKRPLS EWTAYVSELK GKNSDQYIKL VNQAYQDFKK NHG
|
| |