Gene Caci_4984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4984 
Symbol 
ID8336338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5701558 
End bp5702856 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content66% 
IMG OID644958083 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003115685 
Protein GI256394121 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000108283 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000897425 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGACAA GACGTGTTCG GCTGGCCTGC CTCACCCTTT CCGCCATCTC CCTGCTCACC 
GTCGCGCTGA CGGGATGCTC CTCCTCGAGC GGATCGGGAT CCGGCGCCAA GAACGTCGCG
CTGCGGATGA CCGTCTGGAG CAACGACAAA TCGCAGACAG CGCTGTTCAA CAGCATCGCG
GACAGCTACC TCAAGACCCA TCCCGACATC AAGTCGATCA CGTTCGACTA CCTCCCGATC
GGCAGCTACA CGACCGCGCT GACGACCCAG ATCGCCGGCA GCAGCCCGCC GGACATGGCC
TGGATCCTGG AGCGGGACGC GCCGGACTTC GTCTCCTCCG GAGCGCTCAC CGACGTCTCG
GCGGCCCTGC AGAACTCCCC CGGCTACCAG TACGGCGACC TGACCCCGGC AGCGACCAAG
CTGTGGACGC AGAATGGGAA GCTCTACGCC TACCCCTTCT CCACGTCCCC GTTCGGGATG
TTCTACAACA AGGACCTGCT GACCCAAGCC GGTGTGACCC AGACGCCGGA CCAGCTCGTC
GCCGCCGGCC AGTGGACGTG GCAGAACGCC GAGAAGATGG CCGCCCAGGT CGCGGCGCAC
ACCGACAAGC AGGGTCTGGT GATCCGGGAC TGGGACTACA AGACCTGGAT CGAGCTGGCG
AGCATCTGGC GCGGCTGGGG CGCCGACGCC TGGTCGGCCG ACGGCAAGAC CTGTGACTTC
GACGCCCCGC AGATGCAGCA GGCGATGACC TTCCTGCACA ACGCGATCTT CACCGACAAG
GCGCTGCCCG CACCGGGCCA GACCGCTGAC TTCTTCGCCG GTGAGTCCGC CATGACCGTC
ACTCAGATCA GCCGGGCTTC CCTGCTGGCC AAGCACCCGT TCAACTGGGG GATCGTGCCG
CTGCCGTCTG GTCCGACCGG GTCGGCGCAG GTCATCGGAC AGGCCGGCAT CGGTGTGATG
ACCAAGGGTT CGCACAAGCA GCAGGCGGCG GACTTCCTGG CCTACTTCAC CGACCCGGCC
AACTCCGCCA AGCTCGCCCA GTACTTCCCG CCGGCTCGTC AGAGCCAGCT CAACACCACG
ACCCTAGCCG CCGCGAATCC CCTGTTCACC CCGCAGCAAC TTCAGGATGT GGTCATCAAC
GGCATCAAGA CCGGCTCGGT GCTGCCGTCC CATGAGAACA GCGCCAAGCT CGCCACCCTC
GTGCAGAACG CCTTGGACCC GCTGTGGACG CCCGGAGCCA ACGTCGACTC AGTGCTCGCC
GGGGTGTGCA AGGCGATCGA CCCGGCTCTG AGCCAGTGA
 
Protein sequence
MKTRRVRLAC LTLSAISLLT VALTGCSSSS GSGSGAKNVA LRMTVWSNDK SQTALFNSIA 
DSYLKTHPDI KSITFDYLPI GSYTTALTTQ IAGSSPPDMA WILERDAPDF VSSGALTDVS
AALQNSPGYQ YGDLTPAATK LWTQNGKLYA YPFSTSPFGM FYNKDLLTQA GVTQTPDQLV
AAGQWTWQNA EKMAAQVAAH TDKQGLVIRD WDYKTWIELA SIWRGWGADA WSADGKTCDF
DAPQMQQAMT FLHNAIFTDK ALPAPGQTAD FFAGESAMTV TQISRASLLA KHPFNWGIVP
LPSGPTGSAQ VIGQAGIGVM TKGSHKQQAA DFLAYFTDPA NSAKLAQYFP PARQSQLNTT
TLAAANPLFT PQQLQDVVIN GIKTGSVLPS HENSAKLATL VQNALDPLWT PGANVDSVLA
GVCKAIDPAL SQ