Gene Caci_5099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5099 
Symbol 
ID8336453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5856207 
End bp5857541 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content67% 
IMG OID644958198 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003115800 
Protein GI256394236 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.291523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0224239 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCACCA ACCTCATGAG GGGCACCGCC GCCCTCACCC TGGCCGTCAC CGCCATGGCG 
ATGACCGCAT GCAGCAGTAG CAGCTCCTCC AGCTCCGCGC CCAAAGGCGG TGCCGCCAGC
AGCGGCTCGC ACGGCAAGGT GACCCTGTCC TTCGTCAACT GGGACGGCGG CATGCAGTCC
GCCGTCGACC AGTGGAACAA GGCGAACCCC GACATCCAGG TGCAGCTGAC CAAGCCCTCG
GGCACCGGCT ACACGCTCTA CAACAAGCTG ATCACCAACA ACGCCGCCGG CACCAACCCC
GACGTCACCG AGGTCGAGTA CCAGGCGCTC CCGGCGCTGA TCGCCAACAA GGTGATCGTG
CCGATCGACC AGTACGTCGG CGACATCTCC GCCGACTTCG ACAAGTCCTC GCTCGCGCAG
GTCCAGTTCG AGGGCAAGAC CTACGGCGTC CCGCAGAACG TCTGCCCGAT GGTCTTCTTC
TACCGCAAGG ACATCTTCGA CTCCCTCGGC CTGAAGGCGC CGACGACCTG GGACGAGTAC
GCCGCCGACG CCGCGACCAT CCACGCCAAG AACCCCAAGC AGTACATCGG CAACTTCTCG
GCCGTGGACT CCGGCTGGTT CGCCGGGCTC GCGCAGCAGG CCGGCGCCAA CTGGTGGACG
ACGACCGGGA CCACCTGGAA CGTCGCCATC GACGACGCGC CGACCCAGAA GGTCGCGAAC
TACTGGAGCG GCCTGATCGA CAAGGGTCTG GTCTCCCCGG AGCCGAACTG GTCCCCGCAG
TGGAACACCG ACATGAACAA CGGCACGATC ATCGGCTGGG TCAGCGCGCA GTGGGCGCCG
AACCAGTTCC CCTCGATCGC CAAGGACACC GCTGGCAAGT GGGTCGCCGC GGCGCTTCCG
GCCTGGACCG CTGGGGACTC CACGGTCGGC ATCTGGGGCG GGGAGACCGA GGCGGTGACC
TCGAACTCCA AGCACCCGGC CGAGGCCGCG AAGTTCGTGA AGTGGCTCAA CGCCTCCTCC
GACGGTGTCA AGACACTGAT CCAGCAGGTG GACGTCTTCC CGGCCTCGCT GGCCAACCAG
AGCCAGGACT CGCTGAAGAC CCCGCCGCCG TTCATGTCCG ACCAGGCGGA CTACAACACG
CTGATCGCCT CCGCGGCGAA GAACGCTCGC ACCTTCCAGG TCTGGGGACC GAACGCGAAC
GTCACCTTCG ACGCCTACTC CAACGACTTC GCCGCCGCGC TGCAGAACAA GACGCCGCTG
ACCGCGGCGC TGACGCAGAT GCAGCAGGCG ACCGTCGCCG ACCTGAAGAA GCGCGGCTTC
TCCGTCACCG GCTGA
 
Protein sequence
MRTNLMRGTA ALTLAVTAMA MTACSSSSSS SSAPKGGAAS SGSHGKVTLS FVNWDGGMQS 
AVDQWNKANP DIQVQLTKPS GTGYTLYNKL ITNNAAGTNP DVTEVEYQAL PALIANKVIV
PIDQYVGDIS ADFDKSSLAQ VQFEGKTYGV PQNVCPMVFF YRKDIFDSLG LKAPTTWDEY
AADAATIHAK NPKQYIGNFS AVDSGWFAGL AQQAGANWWT TTGTTWNVAI DDAPTQKVAN
YWSGLIDKGL VSPEPNWSPQ WNTDMNNGTI IGWVSAQWAP NQFPSIAKDT AGKWVAAALP
AWTAGDSTVG IWGGETEAVT SNSKHPAEAA KFVKWLNASS DGVKTLIQQV DVFPASLANQ
SQDSLKTPPP FMSDQADYNT LIASAAKNAR TFQVWGPNAN VTFDAYSNDF AAALQNKTPL
TAALTQMQQA TVADLKKRGF SVTG