Gene Caci_5593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5593 
Symbol 
ID8336953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6446912 
End bp6448615 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content66% 
IMG OID644958697 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003116293 
Protein GI256394729 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.38037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGCCG CGCTGGCGCT GGGCCTGGCC GCGTGCGGCG GTTCCTCGTC CGGCAAGTCC 
GGTACCGGCA GCACTGGCAG TTCATCGAAG AACATCAACA CCCAGCCCGG CAACGACATC
AACCCGCAGC CGCGCGAGAA GATCGCCGAC GGCGGCACGC TGCGCTGGCC GGAGGCCGGG
ATCTCCGACC AGCTGAACTA CAACGAGGTC GACGGCACCG ACGGCTCGGT GTCGGACATC
ATGGCCGCGG TGTTGGAGGA GCCGTTCTAC GCCGATGCCA AGGGCATCCC GCGCAACAAC
ACGAACCTGG TCGCCTCCTA CACGGTCACA CAGTCCCCGC AGCAGGTGGT GACGCTGGAG
ATCAACCCGA AGGCGGTGTG GCAGGACGGC ACGCCGGTCT CCGAGGCCGA CTTCGAGGCG
CAGTGGAAGG CGCTGAACGG CACCAACCAG GCGTTCCAGG TGTCCACCAC GGTCGGTTAC
AGCCTGATCC AGAGCATCAA GCCGGGCAAG AGCGACAAGG AGGTGGTGAT CACCTTCAGC
AAGCCCTACA GCGAGTGGCA GGGCCTGTTC TCCCCGATGT ACCCGGCGGC GACCAACAGC
GACCCGAAGA AGTTCGTGGC CGACTTCAAG GACGCGATCC CCACCAGCGA CGGTCCGTTC
AAGCTGGGCT CGATCGACCA GACCGCCAAG ACGCTGACCC TGGTCCGCAA CGACAAGTGG
TGGGGTGACC CGCCGAAGCT GGACAAGATC ATCTTCATGT CCATCGACCT CGACGCGCAG
ACCGACGCCC TGGCCAACAA CGAGGTCGAC CTGCAGTACG GGATCGGCTC GCACGTCTCG
TACTACGCGC GGGTGAAGAA CCTGCCGAAC GTCACCGTGC ACAAGGCCGC AGGCCCGGTG
TGGGCGAACC TGACCTTCAA CGGCGCCAGC GGCAGCCCGC TGTCCGACGT CCTGGTCCGC
AACGCCCTGA CCATCGGCAT CAACCGCAAG CAGATCGACA CCGCGCTCGT CGGCCCGCTG
GGGGTGAGCA CCGACCCGCT GAACAACCAC GTGCTCCTCA CCAACCAGCA GGGCTACCAG
GACAACTCCG GCGACCTGGG CAAGCAGGAC GCGGCGCGCG CCAAGCAGCT GCTGGACCAG
GCGGGCTGGA CCAGCACCGA CGGCGGCAAG ACCCGCACGA AGAACGGCAA GCCGCTGAAC
CTGCGCTTCG TCATCAACGC CACCAGCGAC TCCAACAAGC AGCTGGCGGA GATCGTGCAG
AACCAGCTGG CGGCGATCGG CGTGCAGATC ACGATCGTCC CGGTGCCCGG CGACGACTAC
TACACGAAGT ACGTCAACGT CGGCGACTTC GACATCGCGC AGGTCGTCTT CGGCGGCAAC
GCCTATCCGC TGAGCACCGC GCAGCCGGAG TTCGCCAACC CGACCACCGG CTCGGACGGC
ACGCTGAACA TCCAGCAGAA CTACGGCCGC ATCGGCGACC CGGCGATCGA CACGCTGTTC
ACCGAGGCGC TGAGCTCGCT GGACCGCACC AAGGCGGAGT CCTACGCGAA CCAGGCGGAC
GCGGCGATCT GGAAGCTGGA CACGGTCGTG CCGCTGTTCC AGCGCCCGCA GATCGTGGCG
ACGAACACGA AGCTGGCGAA CTACGGCGCG CCGGGCGTGC AGGACACGAT CTACGAGAAC
CTGGGCTTCG TGAGCGGGAG TTGA
 
Protein sequence
MAAALALGLA ACGGSSSGKS GTGSTGSSSK NINTQPGNDI NPQPREKIAD GGTLRWPEAG 
ISDQLNYNEV DGTDGSVSDI MAAVLEEPFY ADAKGIPRNN TNLVASYTVT QSPQQVVTLE
INPKAVWQDG TPVSEADFEA QWKALNGTNQ AFQVSTTVGY SLIQSIKPGK SDKEVVITFS
KPYSEWQGLF SPMYPAATNS DPKKFVADFK DAIPTSDGPF KLGSIDQTAK TLTLVRNDKW
WGDPPKLDKI IFMSIDLDAQ TDALANNEVD LQYGIGSHVS YYARVKNLPN VTVHKAAGPV
WANLTFNGAS GSPLSDVLVR NALTIGINRK QIDTALVGPL GVSTDPLNNH VLLTNQQGYQ
DNSGDLGKQD AARAKQLLDQ AGWTSTDGGK TRTKNGKPLN LRFVINATSD SNKQLAEIVQ
NQLAAIGVQI TIVPVPGDDY YTKYVNVGDF DIAQVVFGGN AYPLSTAQPE FANPTTGSDG
TLNIQQNYGR IGDPAIDTLF TEALSSLDRT KAESYANQAD AAIWKLDTVV PLFQRPQIVA
TNTKLANYGA PGVQDTIYEN LGFVSGS