Gene Caci_0477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0477 
Symbol 
ID8331804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp542651 
End bp543949 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content66% 
IMG OID644953643 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003111270 
Protein GI256389706 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.838392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACGA GAATGCGCAG CGCGATAGCC GTCTTCCTCG GCTTGGGCAC GATGCTCGCC 
GCCACCGGCT GTGCCGGCAG CAGCGGCGGC GGGGGCGGCG GTTCGTCCGA CGGCAAGGTG
ACGGTGACCG TCTGGGAGAA CGCGACGAAC GGCCCGGACG GCCTGCAGTA CTTCCAGAGC
GCCGCGAAGC AGTACCAGGC CCTGCACCCG AACGTCACGA TCTCGTTCCA GACGATCCAG
AACGAGGCAC TCGACGGCAA GCTCCAGACC GCCCTCAACT CGAACAGCGC GCCGGACGTC
TTCTTCCAGG TCGGCGGCGG CAAGATGCGG GCCCAGGTCG CCGCCGGCGA ACTCCAGCCG
CTGAACCTCA CCGACGCGGA CAAGACCGAC GTCGGCGCGG CGGCCCTGTC CGGCAGCACG
CTCGACGGCA AGGTCTACAT GATGCCGGTC GACACGCAGC CCGAGGGCAT CTACTACAGC
AAGAACCTGT TCCAGCAAGC CGGCATCACC ACGACGCCCA CGACGATCGA CGAGCTCGAA
GCCGACGTCG CCAAGCTCAA GGCAATCAAC GTCGCACCGA TCGCAGTCGG AGCCAAGGAC
GCCTGGCCCG CCGCGCACTG GTACTACAAC TTCGCCCTCC GCGAGTGCAG CCAATCCGTC
ATGGCGAGCA CCGCCAAGTC GCTCAAGTTC ACCGACCCAT GCTGGACCAC AGCCGGAAAC
GCCCTGGCCA CATTCCTCAA GACCAACCCC TTCCCAGCCG GCTTCCTGAC AACCGCAGCC
CAGCAAGGCG CCGGCTCCTC AGCGGGCCTA CTCGCCAATC ACAAGGCAGC CATGGAGCTC
ATGGGCTCCT GGGACCCCGG CGTAATCGCC AGCTTGACCC CGGACCAGAA GCCGCTCCCC
GACCTGGGCT GGTTCCCGTT CCCCGCAGTA GCCGGCGGCC AAGGCGACCC CTCCGCAATC
ATGGGCGGCA ACTCCGCCTA CTCGCTGTCC AAGAAGGCAC CAAAGGAGGC CTTCGGCTTC
CTGGAGTTCA TGCTGACCAA GGACCAGCAG GAGGCATACT CCAAGGCCTT CCAATCAATC
CCGGTGAACC CGGCGTCCCA GGACGTCGTC ACCACCTCCT ACAACATCTC AGCACTGCAA
GCCTTCAACA AGGCCGCCTA CTCAATGCAG TACCTCGACA CCCAGTTCGG CCTAAACGTC
GGCAACGCCC TAAACACCGC CGTCGTCAAC CTCATGGCCG GCCGAGGCAG CGCCGCCGAA
ATCGTCACGC AGGCCAACGC CGCCGCGGCG AAGGGCTGA
 
Protein sequence
MTTRMRSAIA VFLGLGTMLA ATGCAGSSGG GGGGSSDGKV TVTVWENATN GPDGLQYFQS 
AAKQYQALHP NVTISFQTIQ NEALDGKLQT ALNSNSAPDV FFQVGGGKMR AQVAAGELQP
LNLTDADKTD VGAAALSGST LDGKVYMMPV DTQPEGIYYS KNLFQQAGIT TTPTTIDELE
ADVAKLKAIN VAPIAVGAKD AWPAAHWYYN FALRECSQSV MASTAKSLKF TDPCWTTAGN
ALATFLKTNP FPAGFLTTAA QQGAGSSAGL LANHKAAMEL MGSWDPGVIA SLTPDQKPLP
DLGWFPFPAV AGGQGDPSAI MGGNSAYSLS KKAPKEAFGF LEFMLTKDQQ EAYSKAFQSI
PVNPASQDVV TTSYNISALQ AFNKAAYSMQ YLDTQFGLNV GNALNTAVVN LMAGRGSAAE
IVTQANAAAA KG