Gene Caci_2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2159 
Symbol 
ID8333504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2448589 
End bp2449656 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content68% 
IMG OID644955309 
Productextracellular solute-binding protein family 3 
Protein accessionYP_003112919 
Protein GI256391355 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGA TGAACGGCAG GCGCCACAAC AGAATGCGCT TGTGGGGAGC AGCGGCGGCC 
GTGGCGGCCG TCGGGCTCGC CGGGGCGTGC AGCAGCACCA AAGGCGAGTC CGGCGCCGTC
ACGGTGCCCC GCTCCGGGCA GGGCATAGGC CCGGGCGCGC TTCCCAACGC CCCGAGCGTG
CCGCCGAAGG CCTGCGGGCC GGAGGCCACC GCGGCGACGT GGAACGGACC GCTGCCCGGC
CCGAACGACC CGGTCCCCGC CGGCGGGACC CTGGACAAGA TCCGCAAGCG CGGGTTCCTG
ATCGCCGGCA TCGACCTGAA CACCGAGCTG TTCGGCTACG ACCCGCAGCA CGACAACAAC
CCGCAGGGCT TCGACGTCGA CATGGCCCGG CAGATGGCGC GCGCCATCTT CGGCTCCGAC
GGCCACATCC AGTTCCGCGT GGTCACCCTC GGCGATCCGA AGACCGGCGA ATACGCGCAG
CTGCACGCCG GCAACGTGGA CCTGGTGGTG CAGACCACGA CGATCACCTG CGCGCGCATG
CAGGGCGCGC AGCGGATGAG TTTCTCCAAC CCCTACTACA CCGCGCAGCT GAAGCTCCTG
ATGGCGCTCG GCGACGACGG CAAGCCGCAG AGCGCGTCCC TGGAAAGCCT CAAGGGCAAG
AACGTCAAGG TCTGCGCGAC GGCGAACTCC ACCTCGATCG GCGAGATCGG CCAGGTGCTC
GGCAAGACGA ACGCCTTCCC GGCGCCCAAC GCCTTGGACT GCCTGGCATA TCTGCAACAA
GACGAGGTCG GCGGGATCTT CACCGACGAC GCCATCCTGC TGGGCATGAC GCGCCAGGAT
CCGCACGTCG CGATGACCAC CGCTCCGGCG GAGGAGAAGC AGCCGTACGG CATCGTCACG
AACTATGACG CCGGCAAGGC GAACGACCTG ACGCCCTTCG TGAACACCGC GCTGGCGAAC
ATGATCCAGG ACTCCGGGCC GAACGGCTGG CGCTCACTGT TTGCAAAGGA TTTGGGTATC
CAGCCCACGT CCCTGCCGGA GATCCCGGCG CAGTACCCTC TGGGATAG
 
Protein sequence
MTTMNGRRHN RMRLWGAAAA VAAVGLAGAC SSTKGESGAV TVPRSGQGIG PGALPNAPSV 
PPKACGPEAT AATWNGPLPG PNDPVPAGGT LDKIRKRGFL IAGIDLNTEL FGYDPQHDNN
PQGFDVDMAR QMARAIFGSD GHIQFRVVTL GDPKTGEYAQ LHAGNVDLVV QTTTITCARM
QGAQRMSFSN PYYTAQLKLL MALGDDGKPQ SASLESLKGK NVKVCATANS TSIGEIGQVL
GKTNAFPAPN ALDCLAYLQQ DEVGGIFTDD AILLGMTRQD PHVAMTTAPA EEKQPYGIVT
NYDAGKANDL TPFVNTALAN MIQDSGPNGW RSLFAKDLGI QPTSLPEIPA QYPLG