Gene Caci_4419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4419 
Symbol 
ID8335773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5019145 
End bp5020515 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content69% 
IMG OID644957522 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003115124 
Protein GI256393560 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATAT CCCGTCGACG CGGCGTCCTG CTGGGCGCGA CCGCGCTGGC CGTGGCGCTG 
GCCGCGACCG CGTGCTCCAG CTCGGCGAGC AGCGGTGGCA GCGGCAAGAC CTCCTCCGGC
TCCTCGCAGG CCGCCACCGT CACCGACGCC GACCTGCAGG CGGCGCTGAC CGCCGGCGGG
AACCTGACGG TGTGGGCCTG GGAGCCGACC TTGAAGAAGG TCGTCGCCGA CTTCCAGACC
AAGTACCCGA ACGTGCACGT CAACCTGGTC AACGCCGGGA CCGGCAACGA CGAGTACAAG
GCGCTGCAGA ACGCGGTCCA GGCCGGCAAG GGCGTCCCGG ACGTCGCGCA CATCGAGTAC
TACGCGCTGC CGCAGTTCGA GCTGACCAAG TCGGTGGCGA ACCTGGACGA GTTCGGCGCC
GCCGCGCTGA ACGGCACGTT CACCCCCGGG CCGTGGAGCT CGGTCCAGGC CGCCGGCGGC
GTCTACGGCC TGCCGATGGA CTCCGGACCG ATGGCGCTGT TCTACAACCA GACGGTCTTC
ACCAAGTTCG GCATCACCAC GCCCCCGGCC ACGTGGGACG AGTACATCGC CGACGCCAAG
AAGATCCACA CCGCCGACCC CAGCGTCTAC ATGACCAACG ACACCGGCGA CGCCGGCTTC
ACCACCAGCA TGATCTGGCA GGCCGGCGGC AAGCCCTACT CGGTCAGCGG CACCACCCTC
GGTGTGAACT TCGCCGGCGA CGCCGGCACG CAGAAGTTCG CGACCGCCTG GCAGCAGCTG
CTGGACGGCC ACGACCTGGC GCCGATCAGC TCCTGGAGCG ACGCCTGGTA CCAGGGCATG
GCCTCGGGCA AGATCGCCTC GCTGACCATC GGCGCCTGGA TGCCGGCCTC CCTGGAGTCC
GGCGTGAAGT CCGGCTCCGG CCAGTGGCGC GTCGCCCCGA TGCCGCAGTG GACCGCCGGG
GGCAAGGTCA CCTCCGAGAA CGGCGGCAGC TCCCTGGCCG TGATGAAGGC GAGCACCAAC
CAGAAGCTGG CCTACGCGTT CCTGAAGTAC GCCACCGTGG ACGAGGGCGC GCAGACCCGC
GTGGACAACG GCGCCTTCCC GGCCACGGTG AAGCAGCTGA ACTCCCCGGA CTTCCTGAAC
AAGACCGACG CCTACTTCGG CGACCAGAAG ATCAACCAGG TGCTCGCACA GAGCGCCGCC
GAGGTCGCCC CGGGCTGGTC CTACCTGCCC TTCCAGGTCT ACGCCAACAG CGTCTTCAAC
GACACCGCCG GCAAGGCCTA CATCGGGTCC TCCTCCCTGG CCGACGGGCT GAAGGCCTGG
CAGGACGCCT CGATCAAGTA CGCCAAGGAC CAGGGCTTCA CCGTCAAGTA G
 
Protein sequence
MTISRRRGVL LGATALAVAL AATACSSSAS SGGSGKTSSG SSQAATVTDA DLQAALTAGG 
NLTVWAWEPT LKKVVADFQT KYPNVHVNLV NAGTGNDEYK ALQNAVQAGK GVPDVAHIEY
YALPQFELTK SVANLDEFGA AALNGTFTPG PWSSVQAAGG VYGLPMDSGP MALFYNQTVF
TKFGITTPPA TWDEYIADAK KIHTADPSVY MTNDTGDAGF TTSMIWQAGG KPYSVSGTTL
GVNFAGDAGT QKFATAWQQL LDGHDLAPIS SWSDAWYQGM ASGKIASLTI GAWMPASLES
GVKSGSGQWR VAPMPQWTAG GKVTSENGGS SLAVMKASTN QKLAYAFLKY ATVDEGAQTR
VDNGAFPATV KQLNSPDFLN KTDAYFGDQK INQVLAQSAA EVAPGWSYLP FQVYANSVFN
DTAGKAYIGS SSLADGLKAW QDASIKYAKD QGFTVK