Gene Caci_1178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1178 
Symbol 
ID8332513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1326870 
End bp1328600 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content66% 
IMG OID644954325 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003111944 
Protein GI256390380 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.412652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGCA ATACGTCGTC ACTGCTCGCG GTCGCCATCT CCATAGGCCT GGCGGCCACC 
GCGTGCTCCA GCACCAAGCA CACCTCGGAC TCCAGCACTG GGGGCGGCTC CAGCGCGTCG
TCGTCGGCCG GGGGCTCGTC GACGGGAACC CAGACGAGCT ACAAGCAGGG CGGGACGCTC
ACCATCTCGA ACGAGCAGGG CCAGACCTGG CCGTGTTCGT TCAACCCGTA CAACTCGACG
TTCAACCTGG AGAGCCTCGG CTTCATCTAC GAGCCGCTCA TCTACACCAA CATCCTGCAG
GACGCCAAGG AGACGCCGAT GCTGGCGTCG GACTACAAGT GGAACGCGGA CAAGACCCAG
ATCACCTTCA CCATCCGCGA CGGGGTGAAG TGGAGCGACG GCCAGCCGCT GACCGCCGAC
GACGTGGCCT ACACGTACAA CCTGATGAAG CAGACCCCGG CGCTGGACAA CTACTCGCTG
TGGTCCGCGG CCGGCCTGAC GAGCGTCGCG GCCACCGGCA ACCAGGTCAC GATGACCTTC
AAGCAGAACG CGCAGGTGTA CTTCTACACC TTCGCCACGC TGGTCGGCAT CGTCCCCAAG
CACATCTGGT CCACCGGCGA CGCCGCGGCG CACCCGGACA CCTGGACCGA CCCGAACCCG
ATCGGCTCGG GTCCGTACAC GGTCAAGTGC ACGCCGAACA ACATGGAGTA CAAGGCCAAC
GCCAGCTACT GGCAGCCCGG CAAGCCCTAC GTGACCACCC TGGAGTACCC GGCCTACCTG
GACAACGGCC CGGCGAACCA GGACCTGGCC AGCGGCAAGG CGCAGTGGGG CTCGCAGTTC
ATCACCGGCA TCAAGTCCTT CTACCTGAAC AAGTCGCAGG ACAACCACAC CTGGTCCCCG
CCGGTGCTCA ACGTCTCGAT CATCCCGAAC CTGGACCCCT CGCACGCGGC GACCAGCAAG
CTCGGCGTCC GCCAGGCGAT CGCCTACGCG ATCGACAAGG CCAAGGTCTC GGCGATCGGT
GAGGACGGCC AGCAGCTGAC GGGCAACCAG AGCGGCGTGG TCACCCCGAC CTTCGACAAG
TTCAACGACG CGGCGGCGAT CTCCGCGGCC GGGTACGACA AGCCGGACAT GGCCAAGGCC
GCGGCGGCGC TGCAGGGTGC CGGGTACAGC CCGAGCAACC CGCTGAACCT GACCATCATC
TCGATCCAGG GCTACACCGA CTGGGACGCC TCGATCGCGA TCATCAAGGA CGAGCTCAAG
CCGCTGGGCA TCAACCTCAC CGAGAGCTCG CTGACCAACC AGACGTACTA CGACAAGCTC
TACAAGGGCG ACTTCGACCT GGCCTACGGC TCCCAGCCCT CCGCCGGACC GTCTCCCTAC
ACCGAACTGC GCGCGTGGCT GCACTCGGCC AACACCGCGC CGCTGGGCCA GAGCGCGTCG
GCGGGCAACT TCGAGCGGTA CAAGAACCCC GCCGTGGACA GCCTGCTCGA CCAGTACGCC
ACGGCCGGCT CGGAGGACCA GCAGGTCTCG ATGATCAAGC AGGTGAGCCA GCACGTGCTG
CAGGACCTGC CGTTCATCCC GGTGACCGAG TCCGCGGACT GGTTCCAGTA CAACACCAAG
AACTTCGGCG GCTGGCCGAC GTCGGACAAC CCGTACGCGC AGCCCGCGGC CTACAACTAC
CCGGACAACG AGCAGGTGCT GCTGCACCTG TACTACAAGC CGGCCCAGTA A
 
Protein sequence
MRRNTSSLLA VAISIGLAAT ACSSTKHTSD SSTGGGSSAS SSAGGSSTGT QTSYKQGGTL 
TISNEQGQTW PCSFNPYNST FNLESLGFIY EPLIYTNILQ DAKETPMLAS DYKWNADKTQ
ITFTIRDGVK WSDGQPLTAD DVAYTYNLMK QTPALDNYSL WSAAGLTSVA ATGNQVTMTF
KQNAQVYFYT FATLVGIVPK HIWSTGDAAA HPDTWTDPNP IGSGPYTVKC TPNNMEYKAN
ASYWQPGKPY VTTLEYPAYL DNGPANQDLA SGKAQWGSQF ITGIKSFYLN KSQDNHTWSP
PVLNVSIIPN LDPSHAATSK LGVRQAIAYA IDKAKVSAIG EDGQQLTGNQ SGVVTPTFDK
FNDAAAISAA GYDKPDMAKA AAALQGAGYS PSNPLNLTII SIQGYTDWDA SIAIIKDELK
PLGINLTESS LTNQTYYDKL YKGDFDLAYG SQPSAGPSPY TELRAWLHSA NTAPLGQSAS
AGNFERYKNP AVDSLLDQYA TAGSEDQQVS MIKQVSQHVL QDLPFIPVTE SADWFQYNTK
NFGGWPTSDN PYAQPAAYNY PDNEQVLLHL YYKPAQ