Gene Caci_1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1289 
Symbol 
ID8332624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1463641 
End bp1465473 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content66% 
IMG OID644954436 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003112055 
Protein GI256390491 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.172232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCT CCAAGCTGTT GGGAGGCGTC GCGCTCGCCG CGTGCGTCGC CCTGGCAGCC 
TCGGCGTGCT CCAGCTCGAA GACGAACTCC GGCTCGAGCG CGCCGACGAC TGCCGTCACC
ACCCTGCCTT CGGGCAACAC CAGCGCCGCC ACCGGCTTCA ACGCCGCCGT GACCGGCATC
GTCCGCCCGA CAGACGCCGA GGGCGGCACC ATGACACTGG TGGACCGCTC CGACTTCGAC
TCGCTCGACC CGGGCAACAC CTACGACGCC TTCTCCTGGT CGATCATCGG CGACTGGGCT
CGTCCGCTGA TGACCTACGC CCAGCAGCCC GGCAACGCCG GCGCCAAGGT CGTGCCGGAC
CTGGCCGAGG GTCCCGGTGT CGTGTCCAAC AACGGCATGA CCTGGACCTA CAAGATCAAG
CCGGGCGTGA AGTACCAGGA CGGCTCCGTG GTCACCTCCG CGGACGTCAA GTACGCGATC
GAGCGCTCCA ACTGGGGCCA GGACACCCTG GTGAACGGTC CGGCGTACCT GCCGAACTTC
ATCCAGGACA CCACGAAGTA CGCCGGTCCC TACAAGGACA AGAACCCGAA CGACGGCGTC
TCCGGCATCA CCACGCCGGA CAACCAGACC ATCGTGTTCA ACCTGACCTC GCCGTTCTCG
GACTTCGACT ACCTGATGGC GCTCCCGGGC TCGGCCCCGG TGCCGCGGGC GAAGGACACC
GGCGCGGACT ACTTCAAGAC GCTGCTGTCG ACCGGTCAGT ACAAGGTCGA CAACTACCAG
GTCGGCAACG AGCTGGACCT GTCGCCGAAC CCGAACTTCG ACAAGTCCAC GGACCCGGAC
AAGCTGCACG TCGTCCGCGC CTCGAAGATC GTGGTCAAGC TCAAGCAGGA CAAGGCGACG
ATCGACGACA CGCTGTTCGA CGGCTCGGCC AACGCCGACC TGACCGGCGT GGGCGTGCAG
CCGGCCACCC AGAGCAAGAT CCTGGGCGAC CCGAAGCTCA AGGCCGACTC GGACTCCGCG
TACGCGAACT CCACTGAGTA CTTCGCGATC AACGCGACTC AGAAGCCGTT CGACAACGTG
AACTGCCGCC AGGCCATCGA GTGGATCATC GACAAGGCCA CGCTGCAGAC CGAGGCCGGC
GGCTCGCAGG GCGGCGGCGA CATCGCCAGC ACCATCGACC CGCCGACGAT CCCGGGCTGG
AAGGCCGGCG ACCAGTACCT GACGGCGGGC AACAAGGGCG ACGCCACCAA GGCGAAGGCC
TCGCTGGCCC AGTGCAAGAC CGCTGAGCCG GACGCCTTCA ACGCCGACGG CAGCATCAAG
GACACCTTCG AGATCATGAC TCGCGACAAC TCCACCAAGG AGGCGACCAT GGTCCAGACG
ACGCAGACCA ACCTGAAGTC GATCGGCATC AACACCACCA TCGACACCAA GCCGTTCGAC
AAGTACAACT CGCAGTTCGC CGGCAACAAG ACGTACGTGG ACCAGCACAA GGTCGCCATC
AGCTTCATGA AGTGGGGCGC CGACTTCCCG TCGGGCTACG GCTTCATGTA CAGCCTGCTG
GCCTCCTCCG CGATCCACCC GTCCGGCGGC TACAACCTGT CCTGGTACAA GGACGACGCG
ATCGACCAGG GCTTCACCAA GGCTCTGGGC GAGAACGACC CGACCGCCCG TGGCGCCGAC
TACGCGGCGA TCGACCACCA GGCGCTGGCC GACGCGCTGG TCGTCCCGCT GATCTGGGAC
AAGAACCTGG TCTACCGGCC GGAGTCGACG ACCAACGTCA TCTTCAGCCA GGGCTACGGC
ATGTACATCT TCTCCGCGAT GGGCGTGAAG TAA
 
Protein sequence
MNRSKLLGGV ALAACVALAA SACSSSKTNS GSSAPTTAVT TLPSGNTSAA TGFNAAVTGI 
VRPTDAEGGT MTLVDRSDFD SLDPGNTYDA FSWSIIGDWA RPLMTYAQQP GNAGAKVVPD
LAEGPGVVSN NGMTWTYKIK PGVKYQDGSV VTSADVKYAI ERSNWGQDTL VNGPAYLPNF
IQDTTKYAGP YKDKNPNDGV SGITTPDNQT IVFNLTSPFS DFDYLMALPG SAPVPRAKDT
GADYFKTLLS TGQYKVDNYQ VGNELDLSPN PNFDKSTDPD KLHVVRASKI VVKLKQDKAT
IDDTLFDGSA NADLTGVGVQ PATQSKILGD PKLKADSDSA YANSTEYFAI NATQKPFDNV
NCRQAIEWII DKATLQTEAG GSQGGGDIAS TIDPPTIPGW KAGDQYLTAG NKGDATKAKA
SLAQCKTAEP DAFNADGSIK DTFEIMTRDN STKEATMVQT TQTNLKSIGI NTTIDTKPFD
KYNSQFAGNK TYVDQHKVAI SFMKWGADFP SGYGFMYSLL ASSAIHPSGG YNLSWYKDDA
IDQGFTKALG ENDPTARGAD YAAIDHQALA DALVVPLIWD KNLVYRPEST TNVIFSQGYG
MYIFSAMGVK