Gene Caci_5949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5949 
Symbol 
ID8337311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6868172 
End bp6869332 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content67% 
IMG OID644959053 
Productsolute-binding protein 
Protein accessionYP_003116648 
Protein GI256395084 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAGG CGATACTCGC AATCACCGCA CTCGGCGCCG CGGTTGCGTT GAGCGCAGCT 
GGATGCAGTA GCTCTAAGAG CAGCAGCAGC GGTTCCACCA CGGGCGGTGG CAGCTCGACG
ACCAGCGCCG CTGCGGGCTC CAGCAGCAGC AGTAGCAGCA GTGGCACGCC CACGTACAAG
AACAACAAGG TCGGCATCCT GCTGCCGGAC ACCAACTCCT CGCCGCGGTG GGTCAACTCC
GACCCCGACG AGCTGAAGAC GCAGTGCGCG CAGTACGGCC TGACCTGTGA CATCCAGAAC
TCCAACGGTT CTGCCACGAC GATGACCTCG CAGGCGCAGT CGATGCTGAA CGAGGGCGTC
GGCGTGCTGA TGCTCACCAA CCTGGACTCC GGCTCGGCCA AGGCGATCGA GGCGCAGGCG
CAGGCCAAGG GCGTCGTCAC CATCGACTAC GACCGGCTCA CGCTCGGCGG CACCGCGCAG
TACTACGTCT CCTTCGACAA CGTGGCCGTC GGCAAGGCGC AGGGCACCGC CCTGACCAAG
TGCACCCAGG TCGCCGGCAA GACCGCGGTG AAGTACGTCG AGGAGGACGG CGCCGCGACC
GACAACAACG CGACGCTGTT CAAGCAGGGC TACGACAGCG TGCTGAAGGC GCAGACCGGC
TGGACCCAGG CCGGCGACCA GTCCGGCAAC TGGGACAACC CGGCCGGCAC CGCGCAGTCG
GTGTTCCAGA AGCTGCTGCA GGGCGCTCCG GACCTGAACG CGGTCATGGT CGCCAACGAC
GAGATGGCCA ACGCGGCCAT CACCGTCCTG AAGCAGCAGG GCCTCAACGG CAAGGTGGCT
GTCTCCGGCC AGGACGCGAC CGCGACCGGT CTGCAGAACA TCCTCAACGG CGACCAGTGC
TTCACGATCT ACAAGCCGGT CAAGGGCGAG GCCGACGTGG CCGTCAAGCT GGCCAGCCAG
GTCCTGTCCG GCCAGAAGCC GACCGCGCCG GCCGTGGTCC ACGACCCGAC CGGCAACCGT
GATGTCCCGT CCTACCTGGC GACCCCGGTC GTGGTGGACA AGTCCAACAT CACCCTGCCG
TTCACCGACG GCTACCAGAA GGCCGCCGAC GTCTGCACCG GCGACTTCGC CGCCAAGTGC
ACGGCGGCCG GCATCAAGTA G
 
Protein sequence
MRKAILAITA LGAAVALSAA GCSSSKSSSS GSTTGGGSST TSAAAGSSSS SSSSGTPTYK 
NNKVGILLPD TNSSPRWVNS DPDELKTQCA QYGLTCDIQN SNGSATTMTS QAQSMLNEGV
GVLMLTNLDS GSAKAIEAQA QAKGVVTIDY DRLTLGGTAQ YYVSFDNVAV GKAQGTALTK
CTQVAGKTAV KYVEEDGAAT DNNATLFKQG YDSVLKAQTG WTQAGDQSGN WDNPAGTAQS
VFQKLLQGAP DLNAVMVAND EMANAAITVL KQQGLNGKVA VSGQDATATG LQNILNGDQC
FTIYKPVKGE ADVAVKLASQ VLSGQKPTAP AVVHDPTGNR DVPSYLATPV VVDKSNITLP
FTDGYQKAAD VCTGDFAAKC TAAGIK