Gene Caci_8321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_8321 
Symbol 
ID8339700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp9645092 
End bp9647023 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content66% 
IMG OID644961407 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003118985 
Protein GI256397421 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.860873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCA GAAATACCCG AAAAATCGTG GCGATAGCGG GTGTGCTCGC TGTCCTCGCG 
TCCGCCGCCG CGTGTGGGAG CTCGAAGAAG AGCTCCAACG GCAACGGCGC GACGAACCCG
GTCACGTCGG CCAGCAGCTC GTCTGCACCG GCCAAGCAGG GCGGCGTCGC ACACGTCGCG
GAGTGGCCGG CCGGCTCGAG CCCGGACGCG ATCTGGCCGT TCATGAGCAG CGAGCAGCTG
AGCACTCAGA ACGCGGGCCA GTTCCAGTAC TACTTCTACC GCCCGCTGTA TTTCGTCGGC
CTCAACGACA AGCTCGCGGT CAACTACGAC ATGGGTCCGG CGGAGAAGCC GACCTGGAGC
GCGGACGGCC TGACCATCAC GGTTCCGCTG AAGTCGACCT GGAACTGGAG CAACGGCGAG
AAGGTCACCG GCCAGGACGT CCAGTTCTGG CTGAACATGA TGAAGGCCGA GGAGAAGAAC
TCGGGCTACT ACAGCCCGCC GAACGCGGCC GCGGCGGTCA ACTACCTGCC GGACAACGTC
AAGTCCACGA CGGTCAGCGA CTCCAGCATC AGCATCACGT TCGACCAGCA GTACAACCAG
AACTACATCG TCGGCAACGC CCTGCAGACG GTCACGCCGA TGCCGCTGGC CTGGGACGTC
ACCGACGGCA ACGGGACCAA GGGCAAGTGT TCGACGGACA CCCTGACCTC CCCGACCCTG
CAGGCCGACT GCGACGCGGT CTGGAAGTAC ATGAACGCGG CCGGCAAGGA CGTCAAGACT
TTCGCCAGCA ACCCGCTGTG GAAGATCGTC GACGGCCCGT GGGTTCTGAA GGACTTCAAC
GCCACCTCCG GCGGGTTCTC CATCGTGCCG AACACCAAGT TCACCGGCGA GCACAAGCCG
GTCCTGGACG AGGTCGACTT CGTGCCGTTC CAGAGCCAGG ACGCGGAGTG GACGGCCCTG
AAGGCCGGCT CGACCGCCGC GAACTCGCTG CAGATCGGCG TGTTCCCGAA CGCCGACTCC
CCGCAGTACA ACGGTGACAA CCTGCAGGCG GGCAACCCGC TGCTCTCGGC CGGGTACGAC
GTCGAGAAGG GTCCGCTGCT GGACTCGATC GGCTACTACC AGGTGAACTT CGGCTCCAAG
AACCACGGGA ACCTGTTCAA GCAGCCGTAC TTCACCAAGG CGCTGCAGGA CGACATGGAC
CAGACCGGCG CCATCAAGGG CCCGTACAAG GGGTGGGGCT ACCCGACCAC CGGCATCGTG
CCCGGCTACC CCGACGGCAA CGTTCTGTCC CCGGCCGCCA AGGCCGCCGC GGCGACCTTC
AACCCGACCG AGGCCAAGTC GCTGATGCAG GCGCACGGCT GGGATCTGTC GACCACCCCG
GCCACCTGCA AGACCCCCGG TACCGGTGAC AACCAGTGCG GCGCGGGCAT CAACGCCGGC
GACAAGGCGG AGTTCACGCT GGAGTACCCC TCGGCGCACT CGGCCATGGA CACCATGCTG
GCCTCCTACA AGCAGACCGC GGCCCAGTCC GGCATCGGGA TCACCCTGAC CACCAAGACC
CAGAACACCC TGGGCGGCGA GCTGGTCGGC TGCGACCCCA GCACCCCGGC GGGCTGCCAG
TGGGACGCGA TTCTTTACGG CGGCTGGGTG TTCTCGCTGA ACCCGACCGC GGACTCGCTG
CTGACCACCG GCGCCGGCTC GAACATCTTC GGGTTCTCCG ACCCGAAGTT CGACGCCGCG
GTGGCCAAGA CCATCAAGAG CAGCGACCCG CAGGCCTGGT ACGACTACGA GGCCTACGCC
TCCAGCATCT CCCTGCCGCT GATCTTCATG AACAACGACA TCTGGCCGTT CGCCGTGGCG
AAGAACTTCC ACGACTCCGG TCAGGACGCG TTCCAGGGCT TCGAGCCTGA GTTCTGGTAC
TACACCCAGT GA
 
Protein sequence
MAVRNTRKIV AIAGVLAVLA SAAACGSSKK SSNGNGATNP VTSASSSSAP AKQGGVAHVA 
EWPAGSSPDA IWPFMSSEQL STQNAGQFQY YFYRPLYFVG LNDKLAVNYD MGPAEKPTWS
ADGLTITVPL KSTWNWSNGE KVTGQDVQFW LNMMKAEEKN SGYYSPPNAA AAVNYLPDNV
KSTTVSDSSI SITFDQQYNQ NYIVGNALQT VTPMPLAWDV TDGNGTKGKC STDTLTSPTL
QADCDAVWKY MNAAGKDVKT FASNPLWKIV DGPWVLKDFN ATSGGFSIVP NTKFTGEHKP
VLDEVDFVPF QSQDAEWTAL KAGSTAANSL QIGVFPNADS PQYNGDNLQA GNPLLSAGYD
VEKGPLLDSI GYYQVNFGSK NHGNLFKQPY FTKALQDDMD QTGAIKGPYK GWGYPTTGIV
PGYPDGNVLS PAAKAAAATF NPTEAKSLMQ AHGWDLSTTP ATCKTPGTGD NQCGAGINAG
DKAEFTLEYP SAHSAMDTML ASYKQTAAQS GIGITLTTKT QNTLGGELVG CDPSTPAGCQ
WDAILYGGWV FSLNPTADSL LTTGAGSNIF GFSDPKFDAA VAKTIKSSDP QAWYDYEAYA
SSISLPLIFM NNDIWPFAVA KNFHDSGQDA FQGFEPEFWY YTQ