Gene Caci_3334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3334 
Symbol 
ID8334687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3683442 
End bp3684599 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content71% 
IMG OID644956479 
Productextracellular solute-binding protein 
Protein accessionYP_003114082 
Protein GI256392518 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCGAC GTCTGACGGG CATCGCCGCC ACCGCGGTCG TCCTCGCCCT GAGCGCGACC 
GCCTGCTCGT CGAGCAAGAG CTCTGCCAGC GGCAGCAGCG CGGCGGCCGG GGGCGCCGGG
AAGAGCGCCG CCAAGGCCAC CACCGCCGCC GATCTGGGCG GCATGGACGC TCTCGTCGCC
GCCGCGAAGA AGGAAGGCAC GCTGAACGTC ATAGCGCTGC CGAAGACCTG GGCCAACTAC
GGCGCCATCA TGGACGCCTT CACCGCCAAG TACGGCATCA AGATCACCGA CGCCAACCCG
GACGGCTCCA GCCAGGACGA GCTGAACGCG ATCAAGCAGC TGGCCGGCAA GAGCAGCGCC
CCGGACGTCG TGGACGTCGG CCAGGCCGCC GCCACCTCCG GTGCCGCCGC CGGCCAGTTC
GCGCCCTACC AGGTCGCCAC CTGGTCGCAG ATCGCCGACG CGCAGAAGGA CTCCCAGGGT
CTTTGGTACA ACGACTACGG AGGCTACATC GCCATCGGCT ATGACGCCGA CAAGGTGAAG
AACCCGCCGA CCACCCTGAA GTCCCTGGAC GACCCGCAGT ACAAGTCGCA GGTCGCCCTC
AACGGCGACC CGACGAAGGC CAACGCGGCG CTGTCCGGCG TCCTGGCCGC CTCGCTGGCC
AGCGGCGGCA GCCTGGACAA CGCCCAGCCC GGCATCGACT ACTTCGCCAA GCTGAAGTCC
GACGGCGTCT TCGTCCCGGT CGCCGCGACC CAGGCCACCA TCCAGTCCGG CACCACGCCG
ATCACCATCG ACTGGGACTA CCTGCAGGCC TCCGCCGCCT CCGACCTGAA GGCCAAGGGC
GTCACCTGGA AGGTCGTCGT GCCCTCCGAC GGTCTGTTCG GCGGCTTCTA CAGCCAGGCC
ATCAGCGCCA CCGCGCCGCA CCCGGCCGCC GCGCGCCTGT GGGAGGAGTT CCTGTACTCC
GCCGACGGCC AGAACCTGTG GCTCAAGGGC ATGGCCCGCC CGGCCGAGCT CCCGGCGCTG
CAGAAGGACG GCACCGCCGA CGCCACCGCG CTGGCCGCGC TGCCCGCCGT CACCGGTACC
CCGCAGTTCG CCACCCAGGA CCAGATGACC GCCGCGTCCA AGCTCGTGGT GGCCGGCTGG
GCGAAGGCGA CTGGCTGA
 
Protein sequence
MNRRLTGIAA TAVVLALSAT ACSSSKSSAS GSSAAAGGAG KSAAKATTAA DLGGMDALVA 
AAKKEGTLNV IALPKTWANY GAIMDAFTAK YGIKITDANP DGSSQDELNA IKQLAGKSSA
PDVVDVGQAA ATSGAAAGQF APYQVATWSQ IADAQKDSQG LWYNDYGGYI AIGYDADKVK
NPPTTLKSLD DPQYKSQVAL NGDPTKANAA LSGVLAASLA SGGSLDNAQP GIDYFAKLKS
DGVFVPVAAT QATIQSGTTP ITIDWDYLQA SAASDLKAKG VTWKVVVPSD GLFGGFYSQA
ISATAPHPAA ARLWEEFLYS ADGQNLWLKG MARPAELPAL QKDGTADATA LAALPAVTGT
PQFATQDQMT AASKLVVAGW AKATG