Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_8314 |
Symbol | |
ID | 8339693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 9634922 |
End bp | 9636652 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644961400 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003118978 |
Protein GI | 256397414 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.5259 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGCA TCACCAAGAA GTCGCCGGCC ATCCTCGCGG TGCTGGCGAC GGCCGCCCTC GCGACGTCGG CCTGTTCCTC CAGCAAGAGC TCGTCCGCGG GCGGGTCGTC CTCCGGCGGC GGCAAGTCGA TCCCGGTCGC GACCGCCAAC GACATCAACG CCAAGGACGC CTCGACCCTC AAGGGCGGCA CCCTGACCCT CGCGATCGAC CAGTACTCGT CGCAGTGGAA CGGCATGACG AACAACGGCA ACGAGCAGGA CACCCAGAAG GTGCTGTCCA CGATGATGCC GCAGCTGTTC CACTTCGACG CCACCGGCAA GGCCACACCG AACGCGGACT ACCTGGTCTC CGCCGACGAG AGCACCGTCG CGGGCAAGCA GACGGTCACC TACAAGCTGA ACCCGAAGGC GAAGTGGTCG GACGGCACCC CGATCACCTA CAAGGACTTC GTCGCCACCT GGAAGGGCGA GGAGGGCTCG GCGGCCGGCT TCGACGTCGC GAGCTCCACC GGCTATGACC AGATGGCCTC GGTCGTGCGC GGCGCCGACG ACTTCACCGT CGTGGTGACC TACAAGACGC CGTTCTCGGA CTGGAAGTCC ATGTTCGACC AGTTCGACGG CGGCGGCCTG TACCCGGCCT CGAAGGTCTC CACGGCCGAC GGCTGGAACA AGTCCTACAT GAACGCCATC CCGGTCACCG CCGGCCCGTT CAAGCTCGAG GGCATGGACC CGACGAACAA GACCGTCTCG GTCATCGCCG ACCCGAACTG GTGGGGCGCC AAGCCGATCC TGAGCAAGAT CGTCTTCCGG GCGATCGAGG ACACCTCGGC CCAGGCCGAC GCCTACAACA ACCACGAGAT CGACGCCTTC GAGGTCGGCC CGCAGTCGGC GCTGTACGCG AAGATCAAGG ACACCACCGA CAGCACCGTG CACTACGCCG GCGGCCCGGA CTGGCGCCAC ATCTCGATGA ACACCCAGAG CCCGGCGCTG AAGGACGACG CCGTCCGCAA GGCGATCTAC CAGGCGCTGG ACCGCCAGCA GATCGCCGAC GTGGACCTGA AGAACCTCGG CACCTGGAAG CCCACGGTCC TGAACAACCG CTTCTTCGTG AACAACCAGA CCGGCTACCA GGACAACGGC GCCGACGTCG CCTTCAACCC GACCGCCGCC AACGCCGCCC TCGACGCCGC CGGCTGGGTC AAGGGCGGCG ACGGCATCCG CGCCAAGGGC GGCGTGAAGC TCAACCTGAA GTGGATCGAG CCCCAGGGCG TGAAGACCAC CTCCAACGAG GCCCAGATGG TCAAGGCGGA CCTCGCCAAG ATCGGCGTCG GCCTGGTGGA GACCCCGGTC AACAGCGACG ACTTCTTCGA CAAGTACATC AACACCGGCA ACTTCGACAT CACCGCCTTC GCCTACACCG GCAACCCCTT CCCGGTCAGC AGCGGCGCCC CCCAGATCCA GTCGGTGACC GACCCCAAGA ACATCCACAA CAACCCCCGC CTGGACTCCA ACCCGGCGAT CGACCAGGCC CTGACCAAGG CCCTGGAGGA CACCGACCCC ACCCAGGCCA TAGCCGACGC CAACGCCGCG GACAAGCTCG CCACCGACCA GGCCAGCCTG ATCCCGCTCT ACCAGCGCCC CCAGATCTGG GCCACCAAGA CGAACCTGGC CAACTTCGGC TCCTTCGGCT TCCAGGACTA CGACTGGACG AAGGTCGGCT TCACCAGCTG A
|
Protein sequence | MSSITKKSPA ILAVLATAAL ATSACSSSKS SSAGGSSSGG GKSIPVATAN DINAKDASTL KGGTLTLAID QYSSQWNGMT NNGNEQDTQK VLSTMMPQLF HFDATGKATP NADYLVSADE STVAGKQTVT YKLNPKAKWS DGTPITYKDF VATWKGEEGS AAGFDVASST GYDQMASVVR GADDFTVVVT YKTPFSDWKS MFDQFDGGGL YPASKVSTAD GWNKSYMNAI PVTAGPFKLE GMDPTNKTVS VIADPNWWGA KPILSKIVFR AIEDTSAQAD AYNNHEIDAF EVGPQSALYA KIKDTTDSTV HYAGGPDWRH ISMNTQSPAL KDDAVRKAIY QALDRQQIAD VDLKNLGTWK PTVLNNRFFV NNQTGYQDNG ADVAFNPTAA NAALDAAGWV KGGDGIRAKG GVKLNLKWIE PQGVKTTSNE AQMVKADLAK IGVGLVETPV NSDDFFDKYI NTGNFDITAF AYTGNPFPVS SGAPQIQSVT DPKNIHNNPR LDSNPAIDQA LTKALEDTDP TQAIADANAA DKLATDQASL IPLYQRPQIW ATKTNLANFG SFGFQDYDWT KVGFTS
|
| |