Gene Caci_5027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5027 
Symbol 
ID8336381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5758340 
End bp5759644 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content66% 
IMG OID644958126 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003115728 
Protein GI256394164 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0894368 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.625584 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCACA CGCTTCCCTC CGCTAGATCG ACCCGGTTGC TCGCGGCATG CCTGGCCGCG 
GGGATCTCGC TCGCCGGGTG TTCGGCGGCG ACCTCCAGCA GCTCCAAGAG CAGCGGCTCC
GGCTCGGCGT CGCTGTCCTA CCTGACGTTC GAGACTCCGA GCCTGACCGC CTCGTTCTGG
GACACCTCGA TAGCCAGCGG TGAGAAGGCG GTCCCCGGCG TCACGATCAA GAAGCTGGTC
TCGCCGAGCA CCGACCGCGA CGCCTACGCC AAGCAGCTGC AGGCCTCCGG GCAGTTCCCG
GACCTGCTGC AGTCGATCAC GCCGTCCACC TTCGTACAGG CCGGGCTGCT CAAGCCCTAC
GACCAGAGCT GGGTGAACGC CAACTTCCTG CTCCCGATGG GCAACGCCTA CAAGGGCAAG
GTCTACATCC CGCCGACCAA CTCCCAGATC ATCCCGATGG TCTTCTACAA CAAGACGATG
TTCGCCAAGG CGGGCATCGC CAGTGCGCCG AAGACGTGGG CGGACTTCAT GGCCGACTGC
GCCAAGCTCA AGGCGGCCGG CATGACGCCG ATCGAGCTCG GCGGCGGCGA CCCGTTCGCG
GCCTCGATGC CGCTGACCGG GATCCTGTCG GCGGACGTGC TCGGCAAGGA CCCGAACTGG
CTGCAGGAAC GCTACGCGGG CAAGGTCAAG TTCACCGACG CGAACGTCGA GACCGCGGTC
GCCAAGTACC GGACGATGGC CAAGAACGGG TACTTCGAGG ACGGCGCGCT GGGCGTGAAG
TACGCCGACT CCATCACGAA CTTCACCTCC GGCAAGGCCG CGATGTACAT GATGGGCAGC
TGGTTCCTCG GCTCGGTGCC CAAGGACCAC GCCGACGACT TCGGCTCCTT CATGACCCCC
ACCGACGACG GCTCCCTGGT GGTGCCGTTC TCGGTCGGGG GCTCGATGGC GATCAGCGCG
AAGACCCCGG CGCCGGACAA GGCGACCGCG TTCGCCGAGG CCTGGTCCAC CGACCCGGCG
AACCTGAAGA CCCTGATCGA GGGCGACGGT GCGTTCCCGA TGCTCAAGGG CAAGACCCTG
GCCGACTACA ACGTGACCGT GACGCAGGTG TTCAAGGACT CCTATGCCTA CGTCACGGAC
AAGAACACGA AGGTCTCCTC GATCGGGTGG GCGACCAACG ACGACTCCAT GCCCTCGGGT
CTGAACGACG CCTACTACGC GGCCAGCCAG GCTCTGTTCA CCTCCGACGA CGTCGCGGGT
CAGATGGCCA AGCTCGACTC CGCGTGGAAC GCGGCGACCA AGTGA
 
Protein sequence
MRHTLPSARS TRLLAACLAA GISLAGCSAA TSSSSKSSGS GSASLSYLTF ETPSLTASFW 
DTSIASGEKA VPGVTIKKLV SPSTDRDAYA KQLQASGQFP DLLQSITPST FVQAGLLKPY
DQSWVNANFL LPMGNAYKGK VYIPPTNSQI IPMVFYNKTM FAKAGIASAP KTWADFMADC
AKLKAAGMTP IELGGGDPFA ASMPLTGILS ADVLGKDPNW LQERYAGKVK FTDANVETAV
AKYRTMAKNG YFEDGALGVK YADSITNFTS GKAAMYMMGS WFLGSVPKDH ADDFGSFMTP
TDDGSLVVPF SVGGSMAISA KTPAPDKATA FAEAWSTDPA NLKTLIEGDG AFPMLKGKTL
ADYNVTVTQV FKDSYAYVTD KNTKVSSIGW ATNDDSMPSG LNDAYYAASQ ALFTSDDVAG
QMAKLDSAWN AATK