Gene Caci_3841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3841 
Symbol 
ID8335194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4348728 
End bp4350479 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content69% 
IMG OID644956977 
Productsulphate transporter 
Protein accessionYP_003114580 
Protein GI256393016 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.723364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0852274 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCCGAC GCGGCCGCCA CGAGGCCAGG GCCTGGCAGA AGCGGGCGGC GATTGAGGTG 
ACGGAATCCG GGAGGGCTGA GCGTGGTGAC CGGGACTGGC CGGTCGCCGC CTGGCTCCGC
GGCTATGACC GGCGCCGTTT GACACCGGAC ACCATCGGGG CGCTGACGGC TTGGGCGCTG
ATCGTGCCTG AGTGCGTGGC CTATGCGCAG ATCGCCGGCG TGCCGGTCCA GAACGCGTTT
TACGGGGCGC CTGTCGCGCT GCTCGCCTAT GCGTTGTTCG GGCGCAGCAA CCACCTGATC
GTCGGAGCGA CCTCGGCTGC CGCGGTGTTG TCGGCGTCCA CGGTCTCGGC GGTGTCGAGC
GATCCGGCCA AGGCCGCGAC GCTCTCGACG GCGCTGGCAC TGATCGCCGG CGCCGTGTTG
ATCGCAGCCG GTGTGGCCCG GCTCGGCTTC GTCGCCGACT TCCTGGCCGA GCCGGTACTG
CTCGGGTTCC TGTTCGGCAT GGCGTTGACC ATCATGGTCA GGCAACTGGG CAAGATCGCC
GGTGTCCCGA CCGGAGACGG GGACTTCTTC CAGCGGCTCT GGCACGTGTT CGGGCACGCT
TCGGACTGGA GCTGGACCTC GATCGCGGTC GGCGTGATCG CGATCGCCGC GTTGCTGGCG
CTGGAACGGT CGGTGCCCAA GCTGCCGGCG GCATTGGTGG TCCTGCTGGC CGGGATCGCG
GTCTCCGCGG CGCTGAATCT CCAGGACCAT GGCGTCGATG TGGTCGGGAA GATCCCGCGG
GCCGTGCCGA CGCCGTCGTG GCCGCACCTG TCGTCCTCGG ATTGGACGGC GCTGGTCGCC
GGTTCGTTCG GCGTCGCGCT GATCGTGTTC GCCGAGAGCT TCAGCATCGC CAAGGGACTG
GCCGCCAAGC ACGGCGAGCG TGTGGACGCC GGCCGGGAGA TGACGGCGAT GGGCGCCGCG
AACGCCGCCG TCGGCCTGTT CCGAGGCTTC GCCGTCTCCG GCAGCGCCTC CCGGTCGGCG
GCGGCCGAAG CGGCCGGAGC CTCGACGCCG ATGACGTCCG TGGTGGCGGC GGTCGCGGTA
CTGATCACCG GCCTCCTGCT GACGCCGCTG TTCACCGATC TGCCCGAGCC GGTTCTCGGA
GCCATCATCA TCGTCGCCGT ACGCAGCTTT TTGAAGGTCG CCGAACTGCG CCGCTACTGG
CATCGGGACC GCATGTCCTT CGCCGTGGCC GCCACCGCGC TGCTCGGTGT CCTCGTCTTC
GATCTCTTGC CCGGTCTGGT CATCGCGGTG GTGTTGTCCC TGGTGTTGTT CATCGCCTAT
TCGTCACGAC CGCGGCTCGC CGAACTCGGA CGTGTCGGCA CCGGACGGGT CTGGGGCGAC
ATCACGCTGC ACGACGACGC CCGAACAGTG CCGGGATTGC TGGTGCTCCG ACCAGACGCG
CAACTGTTCT TCGGCAACAC GCAAAGAGTC ACCGACGACG TACTGGCGCA TGTGGCGAAC
GCGAACCCCA AGCCGCGAAC CGTCGTCTTG GACCTGTCCG CCAGCTACGG AATGGGCCTG
CCGAGCCAGG ACGCGATCGA GGAACTGGTC GCACGCCTGA AACGCGAAGA CGTCGACGTG
TGGTTCGCAC ACGTGCGCCG GCACGACCCG GGCACGGCGG CGCCGGAATT GGGACGTCCA
GCTGAGATCT TTCCGGACGC CGATACCGCG GCGCAGACCT TCCAGAGTGT TGCGGACGAC
TCGCTGCGGT GA
 
Protein sequence
MLRRGRHEAR AWQKRAAIEV TESGRAERGD RDWPVAAWLR GYDRRRLTPD TIGALTAWAL 
IVPECVAYAQ IAGVPVQNAF YGAPVALLAY ALFGRSNHLI VGATSAAAVL SASTVSAVSS
DPAKAATLST ALALIAGAVL IAAGVARLGF VADFLAEPVL LGFLFGMALT IMVRQLGKIA
GVPTGDGDFF QRLWHVFGHA SDWSWTSIAV GVIAIAALLA LERSVPKLPA ALVVLLAGIA
VSAALNLQDH GVDVVGKIPR AVPTPSWPHL SSSDWTALVA GSFGVALIVF AESFSIAKGL
AAKHGERVDA GREMTAMGAA NAAVGLFRGF AVSGSASRSA AAEAAGASTP MTSVVAAVAV
LITGLLLTPL FTDLPEPVLG AIIIVAVRSF LKVAELRRYW HRDRMSFAVA ATALLGVLVF
DLLPGLVIAV VLSLVLFIAY SSRPRLAELG RVGTGRVWGD ITLHDDARTV PGLLVLRPDA
QLFFGNTQRV TDDVLAHVAN ANPKPRTVVL DLSASYGMGL PSQDAIEELV ARLKREDVDV
WFAHVRRHDP GTAAPELGRP AEIFPDADTA AQTFQSVADD SLR