Gene Caci_4557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4557 
Symbol 
ID8335911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5185803 
End bp5187515 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content70% 
IMG OID644957658 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003115260 
Protein GI256393696 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAGAA TAGCCGTCCT TTCGGCTGCC CTGGCGCTGG TCGCCGCGGG GGCGGCCGCG 
TGCTCGTCGT CAGCGGGCCA CACCTCGACC GGCACCGCCG GTTCGGCCGC CTTCGACCCC
AAGACCTGCC AGGGCGGCAC CCTGGAAGTC CTGAATCAGG ACAGCATCAG CAAGATGGAC
CCGGCGCGGA TCTACACCTC CGGCGGCGGC AACATCCCCT CGCTACTGTT CCGCACGCTC
ACCACGCGCA ACCGGCAGCC CGGGCAGGAC GGCGCCAAGC CCGCGCCGGA CCTGGCCACC
GACCTCGGCA CGCCGAGCGA CGGCGCGAAG ACGTGGACCT ACCACCTTAA GAGCGACATC
TTCTTCGAGG ACGGCACGCC GATCACCGCG CAGGACGTCA AGTACGGCAT CGAGCGCTCC
TTCGCCCCCG AACTGCCCGG CGGCGCCCCG TACCTGCGCG ACTGGCTGGT CGACGCCTCG
AACTATCAGG GCCCGTACAA GGACCCGAAC GGTATCGCGG CGATCGAGAC CCCGGACGCC
AAGACGATCA TCTTCCACCT GCGCAAGCCC GAGGGCGACT TCCCGCTGCT GGCCACCGCC
ACGCAGTTCG CCCCGGTGCC CAAGGCCAAG GACACCGGCG TCAACTACGA CAAGCACCCG
ATCTCCTCCG GGCCCTACAT GGTCGCCAGC TACGACAAGG GCAAGACGCT GGACCTGGTC
CGCAACCCGC ACTGGTCCGC CGCCAGCGAC CCGCTGCGCT ACGCGTGCCC GGACAAGATC
GACGTCACCT CTGGTCTGAA CCCCGCGGTC ATCAACCAGC GCATCGCCAC CGGCAGCGGC
CAGGACGCGA ACGCCGTCAC CACCGACGCC ACCATCGGCC CGGACCAGCT GGCGCAGCTG
AACAGCAACC CCTCGCTGGC CAAGCGGGTG GCGCGCGGCG AGTTCCCGGC GACGACCTAC
CTGGCGTTCA ACACCAAGGT GAAGCCGTTC GACGACATCC GCGTCCGCGA GGCGGTCTCC
TACGCGATCA ACCGCACCAC CGCGGTCAAC GCCGCCGGCG GCACCGCCGT GGCCGGCGCC
TCGACCACCT TCCTGCCGCC GCAGAAGGCG CTGGGCTACC AGCCCTACGA CGACTTCCCG
GCCGGCGCGA CCGGGGACGC GGCGAAGGCC AAGGATCTGC TGGCGCAGGC GGGGTATCCC
AACGGCCTGA CGATCACGCT GTTCCACCAG TCCGACGACG CCAACAACCT CGGGCCGAAG
GAGGCCACCG CGATCCAGGA CGCGCTGAAG GCGGCCGGCA TCACGGTGAA GCTGAACCCG
GTGGACGACG ACAGCTACCA GGACGTCACC GGCAAGCCCA GCAGCGAGCC CGGCGTCTCG
CTGCAGTACT GGGGCGCCGA CTGGCCCTCC GGCGCGCCGT TCCTGATCCC GATCTTCGAC
GGCCGCGAGA TCATCGACAG CGGCGGCAAC TTCAACATGG CGCAGCTGAA CGACCCGGGC
GTGAACGCCG AGATCGACGC GATCAACGCG ATCACCGACC CGGCGCAGGC GCAGGCCCGC
TGGGGCGCGC TGGACGCCAA GCTCGGGCAG CAGGCGCTGA CCGTGCCGCT GTTCTACGAG
AAGGACGTCT ACCTGTTCGG CAAGAACGTC AAGGACGCGG TGCCGGACGG CTGGCGCGGC
CAGTACGACC TGGCCCGCGT GTCGGTCAAG TAA
 
Protein sequence
MRRIAVLSAA LALVAAGAAA CSSSAGHTST GTAGSAAFDP KTCQGGTLEV LNQDSISKMD 
PARIYTSGGG NIPSLLFRTL TTRNRQPGQD GAKPAPDLAT DLGTPSDGAK TWTYHLKSDI
FFEDGTPITA QDVKYGIERS FAPELPGGAP YLRDWLVDAS NYQGPYKDPN GIAAIETPDA
KTIIFHLRKP EGDFPLLATA TQFAPVPKAK DTGVNYDKHP ISSGPYMVAS YDKGKTLDLV
RNPHWSAASD PLRYACPDKI DVTSGLNPAV INQRIATGSG QDANAVTTDA TIGPDQLAQL
NSNPSLAKRV ARGEFPATTY LAFNTKVKPF DDIRVREAVS YAINRTTAVN AAGGTAVAGA
STTFLPPQKA LGYQPYDDFP AGATGDAAKA KDLLAQAGYP NGLTITLFHQ SDDANNLGPK
EATAIQDALK AAGITVKLNP VDDDSYQDVT GKPSSEPGVS LQYWGADWPS GAPFLIPIFD
GREIIDSGGN FNMAQLNDPG VNAEIDAINA ITDPAQAQAR WGALDAKLGQ QALTVPLFYE
KDVYLFGKNV KDAVPDGWRG QYDLARVSVK