Gene Caci_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2031 
Symbol 
ID8333375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2300510 
End bp2301838 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content67% 
IMG OID644955181 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003112792 
Protein GI256391228 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.298737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAATGA GGACAACACG GCGTTCGGCA CTGCGGCTGG GCATCGGGGC CCTCGCGGTC 
GCGGTGACCG GCGGATGCGC GACCGGCGGC GGGAAGAAGA CTCCGGCGCC GGCTGTGAAG
ACCGGCGCCG CGGCTCAGGT GGGCGGGACC ATCACGGTGT GGTCCTGGGA CGTGGCGGCC
AAGGCGCTCA AGCGGCTGGC GCCCGCGTTC GAGCAGCAGC ACCCGGGCGT GAAGGTGAAC
GTCGTCGACA TCGGCTACGA CAACGCCTAC GACAAGATCA CCGTCGGCCT GAAGTCCGGC
TCCGGACTCC CCGACGTCCT GCAGGTCGAG GGCCCGAAGA TGCAGAGCTA CATCGGCACC
TTCCCCAGCG GCTTCTACGA CCTCAGCACC CTGGCGGCAC CGCTGAAAGC GCAGTTCAAC
GCCGCCGCCT GGGCCACCGG CACGGACGCG AACGGCAAGG TCTACGCGCT GCCCTGGGAC
ATCGGGCCCT GCGGCGTGTT CTACCGGACC GACATCTTCC AGCAGGCCGG CGTCGACCCC
GCGTCGATCC AGACCTGGGA CGACTACATC GCCGCCGGCG TGCGGATCAA GGCCAAGACC
GGCAAGAAGC TGCTGGTGGT GGACCCCACC GGCGACAGCA CGTTCCCGAT GATGCTCCAC
CAAGAAGGAC AGGGCTACTT CGTCGGCGAC AAGATCGCCG TCGACACCCC GGCGGCGGTG
AAGGCCATGA CCGTCATGAA GGAACTCAAC GACAAAGGCC TGGTCGACTA CGAAAAGGGC
TGGGACGCCC TGGTCGCCGC CACCAAGGAC GGCACCGTCG CCACCACCCC GACCGCGGTC
TGGTGGTCCG GCACCCTCAC CGACGAGATG CCGGAGCTGA AGGGCAAGTT CGCCGCGATC
CCGCTGCCCG CCTTCACTTC CGGCGGCATC CGTACCTCCA ACAACGGCGG CTCCCTGCTC
ACCATCTCAG CGCAGAGCAA GAACTCGGCG ACCGCCTGGG CGTTCATCCA GTTCGTCCTG
GCCGACGCCG ACAACCAGGT CTCGATGCTG AAGAACGAAG GCATCTTCCC CGCCTTCGAG
CCGGCCCTGT CAGACCCCTA CATCACCGGC CCGCAGGACT ACTACGGCGG CCAAACCACC
TTCAAGATCT TCGCCGACCT GGCCAAGAAC ATCCCGGCAG TGCAGTACAC CGCCGACTTC
TCCAAAGCCT CCGACCTGAT CAACACCGCC ACCGGCGCGG TGATGCAAGG CGGCAAGGAC
CCGAAGTCCG CCCTGGACTC GGCAGCGCAA CAGATCGCCT CCGCGACCAA TCGGCAGATC
GCGCACTAG
 
Protein sequence
MSMRTTRRSA LRLGIGALAV AVTGGCATGG GKKTPAPAVK TGAAAQVGGT ITVWSWDVAA 
KALKRLAPAF EQQHPGVKVN VVDIGYDNAY DKITVGLKSG SGLPDVLQVE GPKMQSYIGT
FPSGFYDLST LAAPLKAQFN AAAWATGTDA NGKVYALPWD IGPCGVFYRT DIFQQAGVDP
ASIQTWDDYI AAGVRIKAKT GKKLLVVDPT GDSTFPMMLH QEGQGYFVGD KIAVDTPAAV
KAMTVMKELN DKGLVDYEKG WDALVAATKD GTVATTPTAV WWSGTLTDEM PELKGKFAAI
PLPAFTSGGI RTSNNGGSLL TISAQSKNSA TAWAFIQFVL ADADNQVSML KNEGIFPAFE
PALSDPYITG PQDYYGGQTT FKIFADLAKN IPAVQYTADF SKASDLINTA TGAVMQGGKD
PKSALDSAAQ QIASATNRQI AH