Gene Caci_2418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2418 
Symbol 
ID8333767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2735850 
End bp2737193 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content69% 
IMG OID644955571 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003113177 
Protein GI256391613 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGC GCATCACCGC GGCCGCCGGC CTGGCCGCCC TCGCCTTGAC CGCCACGGCC 
TGTGCCGGCG GCGGGACCTC CACGCCGAAG TCCGACGGCG GCTCCTCCGC CAAGAGCACC
GGCTCCGGCA CCGAGCTCGC GGTGCCCGCC GGCCCGGTCA CCATCACCTT CGAGGAGGCG
ATGACGATCG GCACGCTGAA GCCGGCGATG GACAAGCTGG TCTCGGACTT CCAGGCCAAG
TACCCGAACA TCACGGTCAA GCTCCAGGGC GAGCCGGACT ACGCCACGAT GTACACCAAG
GAGAAGGCCG AGGTGCAGGC CGGGAACGCC CCGACCATCG GCCAGGCCTA CGAGAGCTGG
GCCTCCTACT TCCAGTCCTC CGGCGTGCTG GCGCCGATCA GCGACCTGGC CGGGACCGAC
ACCCCGCCGG CGATGTCCAC GTTCTACAAG GGCATCCAGG CCGACATGAA GCTGCCGGAC
GGCAAGACCT GGATGTGGCC GTTCAACAAG AGCGTGCTGA TCCTGTTCTA CAACGCCGAC
CTGATGGCCA AGGACGGTCA GAGCGAGCCC AAGACCTGGG ACGACTACGC CAGCGTCATG
AAGGCGGTCT CCAAGGACGG CGTCACCGGC TCCACGGTCG ACCCGGGCTC GGCCAAGGCC
GCGCAGTACG GCACGCAGTG GTTCGAGATC CTGGCCAAGG CCAACGGCGC GACCCTGTAC
GACGCCGACG GCACCCCGCA CCTGAACGAC CCCGGCGTGG TCAAGGCCCT GCAGTACATG
AAGGACCTCA AGGACGCCAA CGCGCTGGCG ACCGGGACCA ACTACCCCGG CGAGACCGCG
CTGGGCGCCC AGAAGGGCAT GTTCGACATC TCCTCGGCGG CCGGCTACGG CTTCGAGAAC
AAGACCGTCG GCGGCAAGTT CAAGCTCGGC ATCAGCGCGC TGCCCTCGGG CCCGGCCGGC
GCCGTGAACC AGCTGACCGG GACCAACATA GTGGTCTTCA AATCCGCCAG CGCCGACCAG
AAGGCGGCGG CCTGGGCGTT CCTGAAGTTC ATCACCAGCC CCGCCGAGCA GGCGCAGTGG
GCGGCGACCT CCGGCTACCT GCCGGTGACC AGCCAGGCCC TCTCCGACCC GGTGCTGCAG
GCCTTCGTGG CCAAGAACCC GTATGAGACC GCCGCGGTCT CCGAGCTCGA CACCGCCTTC
ACCCTGCCCG GCTTCTCCTG GATCTTCCAG TGCCAGGGCT ATGAGGCCAC CGCGATCCAG
GAGGCGCTGG AGAACGGCAA GCAGCCCTCT GACGCGCTGA ACACCGCGCA GTCCGCCTGC
GCCGCCGCGA AGGCACAGGG GTGA
 
Protein sequence
MRKRITAAAG LAALALTATA CAGGGTSTPK SDGGSSAKST GSGTELAVPA GPVTITFEEA 
MTIGTLKPAM DKLVSDFQAK YPNITVKLQG EPDYATMYTK EKAEVQAGNA PTIGQAYESW
ASYFQSSGVL APISDLAGTD TPPAMSTFYK GIQADMKLPD GKTWMWPFNK SVLILFYNAD
LMAKDGQSEP KTWDDYASVM KAVSKDGVTG STVDPGSAKA AQYGTQWFEI LAKANGATLY
DADGTPHLND PGVVKALQYM KDLKDANALA TGTNYPGETA LGAQKGMFDI SSAAGYGFEN
KTVGGKFKLG ISALPSGPAG AVNQLTGTNI VVFKSASADQ KAAAWAFLKF ITSPAEQAQW
AATSGYLPVT SQALSDPVLQ AFVAKNPYET AAVSELDTAF TLPGFSWIFQ CQGYEATAIQ
EALENGKQPS DALNTAQSAC AAAKAQG