Gene Caci_4955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4955 
Symbol 
ID8336309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5657201 
End bp5658577 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content69% 
IMG OID644958054 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003115656 
Protein GI256394092 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.117227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTGA AGTTCCGCGC CATCGGCGCG CCAGGCCCGG GCCGGCCGCG TCGGCGTGCC 
GGGCTGCTCG CGCTCGGAGC CCTGCTCGCC CTCGGCGCCG CCGCCTGCGG CACGAGCAGT
GGAAAGGCCA ACACGCCGAG CACCGGTGGC AGCTTCAAGA CCGCGGCGCA GACCGGCGGG
ACCCTGACGG TCTGGGTCGA CTCGGACCGG CTGGCCGCCG CGAAGCTGTA CCAGAAGGCC
CATCCCGAGG TGAAGATGGA CATCGTCACC TACGACGGGG ACGCCAACGG CTCCAACTAC
CTGCAGACCA AGGTGTCCCT GTTCAACCGG ACCAGCTCGG GCTGGCCGGA CGTGGTCTTC
AGTTCGCAGA ACAACGAGAC CAGCTGGGCG GTGCAGGCCG GGTTCGCCGC ACCGCTGAAC
AAGGGTCTGA TACCGCAGGC CACCCTGGAC GGCTGGGCCA CCGACGCCAA CGCCCCGTGC
ACCGTGGACG GCACCGTCTA CTGCCTGCGC AACGACCTGT CGCAGACCGT GCTCTGGTAC
AACGACAAGC TGATGAAGCA GTGGGGCTAT CAGGTCCCCA CGACCTGGGA GCAGTACCAG
GCCCTCGGTG AGAAGGTCGC CGCCGAGCAC CCCGGCTACC TGGTCGGCGC CGCCGGCGAC
ACGTTCGCTC CGGAGATCTA CCTGTGGGCG GGCAAGTGCG GCGCCAACCA GATCACCGGG
CCCAAGGCGG TCACGGTCGA CGCCACCAGC GCGGCCTGCA CCAAGATGGC CACGCTGATG
GACACCCTGA TCAAGGACAA GACCCTGTCG ACGTCGAGCG TGTTCAGCTC CGACTTCGAC
AAGAACGAGG CTGACAAGAT CCTGATGATG CCCGGCCCGT CGTGGTACGG CGGAGCGCTG
TTCCAGGGCA CGTTCAAGAC CCCGGCCGGG CAGCTCGGCG TGGCGCCGAT GCCGCAGTGG
TCCGGCGACT CCAGCCCGTC GGTGGGCAAC GTGGGCGGCG GCACCTGGCT GGTCTCGGCG
CACAGCAAGA ACCTGAAGGC CTCGACCGAC TTCGTGACCT GGGTCACCAC CTCGGATGAC
TACCAGGGCA AGCTGGCGCC GGGCTTTCCG GCGTACACGG CGGCGGCCAA GACGTGGCTG
GCGGCGCAGC AGTCCTCCGG GTACTACGCC GACGACATCA CCGCGCCGCT CACCGCCGCG
GCGAACCAGG TCTGGGCGGG CTGGGGCTAC GGGCAGTTCA GCCAGGAGGC GGTCTGGGCC
GCGACCGTCA CCCCCGGCGT CAACGCCGGC AAGACCATCG TCTCGCTGCT GCCGGCCTGG
CAGGACTCGA TCGTGAACCA CGCCAAGGCC GACGGATACC AGGTGGCGAC GAAGTGA
 
Protein sequence
MTVKFRAIGA PGPGRPRRRA GLLALGALLA LGAAACGTSS GKANTPSTGG SFKTAAQTGG 
TLTVWVDSDR LAAAKLYQKA HPEVKMDIVT YDGDANGSNY LQTKVSLFNR TSSGWPDVVF
SSQNNETSWA VQAGFAAPLN KGLIPQATLD GWATDANAPC TVDGTVYCLR NDLSQTVLWY
NDKLMKQWGY QVPTTWEQYQ ALGEKVAAEH PGYLVGAAGD TFAPEIYLWA GKCGANQITG
PKAVTVDATS AACTKMATLM DTLIKDKTLS TSSVFSSDFD KNEADKILMM PGPSWYGGAL
FQGTFKTPAG QLGVAPMPQW SGDSSPSVGN VGGGTWLVSA HSKNLKASTD FVTWVTTSDD
YQGKLAPGFP AYTAAAKTWL AAQQSSGYYA DDITAPLTAA ANQVWAGWGY GQFSQEAVWA
ATVTPGVNAG KTIVSLLPAW QDSIVNHAKA DGYQVATK