Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2124 |
Symbol | |
ID | 8333469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 2409269 |
End bp | 2410471 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644955274 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003112884 |
Protein GI | 256391320 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | [TIGR03227] 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCACG TACTGTCCCG CCCGCGTACC CGCCGTGTCG GCGCGCTGAT CGCCGTCGGC GGCCTGACGC TGACCGGACT GACGGCTTGT GCCTCTTCCA AGAGCTCCTC CGGTGCCGCC GCCGGCAGCA CCGCCACCTC CGCCGCTGCT GCGGCCGCGT CCTCCTCCTG CCCGGCGCTC GGTGCCCCAG CGTCGACCGC GCCTGCGAAG GCCGACGGCT CCGGCGGGCA GGTGACCATC TACAGCGCCG ACGGTCTGTA CGACGCGAAG GATGACAAGA ACTGGTACAA CCAGGAATTC AAAAAGTTCA CCGCCCTGAC CGGCATCCAC GTGAACTACT CCGAGGACGG CTCCGGCGGC GTGGAGACCA AGGTCGACTC CGAGAAGTCG AACCCGAAGG CCGACGTCAT CGTGACCCTG CCGCCGTTCA TCCAGAAGGC TGAAGCCTCC GGGCTGCTGC AGGCGTACAG CCCGGCGTGT GTGGACAAGG TCGACCCCTC GCTCGTGGAC AAGAACGGCG AGTGGGAAGC GGTGATGGGC AACTACCTGT CCTTCATCTA CAACACCAAG GCGCTGCCCG ACGGCCCGCC GAAGACCTGG AACGACCTGC TGGACCCGAA GTTCAGCAAG AAGTTGCAGT ACTCGACGCC GGGTGTGGCC GGTGACGGGA CGGCGGTGAT GATCGCGGCG ATCCACGCCT TCGGCGACAA CCGCGACTCC GCCTGGAGCT TCTTCAAGCA GCTGCAGTCG AACAACGTCG GGCCGTCGAA GTCCACCGGC GCGCTGGAGA GCAAGGTCAA CACCGGCGAC CTGCTGGTGG CGAACGGCGA CGTGCAGATG AACTACGTCG ACAGCACGAC GCAGTACCCG AACAACAAGA TCTTCTTCCC GGCGGGCAAC GACGGCAAGC CAAGCACGTT CTCGCTTCCG TATATGGCGG GCTTGGTCAA GGGCGCGCCC CATGCCGACA ACGGCAAGAA GCTGATCGAC TTCCTGCTGT CCGAGGGCGC GCAGCTGGAC GCTTCCAAGG TGGCGTATGG CTTCCCGGCG CGTACCGACG TCAAGCCCAC GGACAGCAAC TACGCGGCTT TGAACGCGCT GCTTCAGGGC GTGACCGTCT TCCCGGTGGA CTGGAACGAG GTCGCGCAGA ACTACAACAG CGACGTCAAG GCGTGGGACA CCGCCACCGG CACGCCGAGC TGA
|
Protein sequence | MSHVLSRPRT RRVGALIAVG GLTLTGLTAC ASSKSSSGAA AGSTATSAAA AAASSSCPAL GAPASTAPAK ADGSGGQVTI YSADGLYDAK DDKNWYNQEF KKFTALTGIH VNYSEDGSGG VETKVDSEKS NPKADVIVTL PPFIQKAEAS GLLQAYSPAC VDKVDPSLVD KNGEWEAVMG NYLSFIYNTK ALPDGPPKTW NDLLDPKFSK KLQYSTPGVA GDGTAVMIAA IHAFGDNRDS AWSFFKQLQS NNVGPSKSTG ALESKVNTGD LLVANGDVQM NYVDSTTQYP NNKIFFPAGN DGKPSTFSLP YMAGLVKGAP HADNGKKLID FLLSEGAQLD ASKVAYGFPA RTDVKPTDSN YAALNALLQG VTVFPVDWNE VAQNYNSDVK AWDTATGTPS
|
| |