Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6904 |
Symbol | |
ID | 8338270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 7972883 |
End bp | 7974097 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644959991 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003117582 |
Protein GI | 256396018 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000382644 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00751503 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACACT CGCGCTGGGC GGCGACGCTC GTCGCTGCCG GTATCGTCGC GGCGGGCCTC AGCGGCTGTT CCTCAAGCTC AGGGGGCGGC AGTTCGGACT CTCTGACGGT CCAGGACTAC TACGCGGAAC CGCAGGCCGG CCAGATGAAG GCGATCTACG ACTCGTGCGC CTCGCAGCTG GGCGTCAAGA TCAACATCGT GCACGTGGCA TCGAACGGTC TGATCGCCAA GGTGCTGCAG CAGGTCTCCT CGAAGACGAT GCCCGACGTT CTGATGCTCG ACAACCCGAA CGTGCAGCAG ATAGCGGCCT CCGGAGCTCT CGCGCCACTG TCGCAATTCG GGATCACCGG AGACGGATTC GCCAAGGGAG CCGTGTCTGC GGGCTCGTAC AACGGCAAGC TGTACGCAGT GCCACCCGTG CTGAACTCGA TCTCGCTGTT CTACAACAAG GACATCCTGT CGCAGGCGGG TATCACGCCG CCGAAGACCT GGGACGAGCT GGCCGCGGAC GCCAAGCAGC TGACGAAGCC CGGCCGTTAT GGCTTCGCGT TCAGCGCTGC CAACACCGGC GAGGGCACGT GGACGTTCCT GCCGTTCATG TGGAGCAACG GCGGGGACGA GACGAACATC GCCACCCCCC AGACGGCGCA GGCCCTGCAG TACCTCACCG GTCTCGTCAG CAGCGGTTCC GCATCCAAGA GCGTGGTCAA CTGGACACAG GCGGATGTGA ACGACCAGTT CATCGCCGGC AAGGCGGCCA TGATGATCAA TGGTCCCTGG CAGATTCCGG CGCTGGACAA GGCCGGTGTG CACTGGGCCA GCGTGAGCAT CCCGACGCGC GAGGCCGGCC AGACCGTGGT TTCGCCGCTG GGTGGGGAGA CGTTCAGCGT TCCCAACACC GGGCACTCCG CGTCGATGAA AAAGGCCGCA CAGTTCGTGA GCTGCCTGAC CAACGACCAG AACGAGGCGA CGAAAGCGGC CAATGAGGAC GCGGTCCCCT CGCGAACGGA TGCCGCAGCC AAGTTCGCCT CATCCAATCC GGAGCTGGCG TCCTTCGTGA GCATCGTGGC CGACGGCCGC TCTCGCACGG CGCAGTTGGG CGCGAAGTGG CCCGCGACGG AAACGGCGAT CTACACAGCG GTGCAGGCGG CCATCACGGG CGAGGCGTCG CCCCAGGCCG CACTTCAGCA GGCGCAGTCG CAGATCAGCA AGTAG
|
Protein sequence | MKHSRWAATL VAAGIVAAGL SGCSSSSGGG SSDSLTVQDY YAEPQAGQMK AIYDSCASQL GVKINIVHVA SNGLIAKVLQ QVSSKTMPDV LMLDNPNVQQ IAASGALAPL SQFGITGDGF AKGAVSAGSY NGKLYAVPPV LNSISLFYNK DILSQAGITP PKTWDELAAD AKQLTKPGRY GFAFSAANTG EGTWTFLPFM WSNGGDETNI ATPQTAQALQ YLTGLVSSGS ASKSVVNWTQ ADVNDQFIAG KAAMMINGPW QIPALDKAGV HWASVSIPTR EAGQTVVSPL GGETFSVPNT GHSASMKKAA QFVSCLTNDQ NEATKAANED AVPSRTDAAA KFASSNPELA SFVSIVADGR SRTAQLGAKW PATETAIYTA VQAAITGEAS PQAALQQAQS QISK
|
| |