Gene Caci_4301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4301 
Symbol 
ID8335655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4880095 
End bp4881192 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content67% 
IMG OID644957404 
Productaliphatic sulfonates family ABC transporter, periplsmic ligand-binding protein 
Protein accessionYP_003115006 
Protein GI256393442 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.389589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0000586034 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGAGACA TCAAGACTCT GATTCGTACG GTCGCGGCGG TCGGTGCCGC CCTCGGGATC 
GTCGCCTCGG CCGCCGCGTG CGGGTCGTCG AAGAGCTCTG CCTCGAGTAC GAAGGCCGCG
GCCGTCTCCA GCGGCTCGGC CGCCACGCAG GTGCGCCTGG GCTACTTCGC CAACGTCACG
CACGCCACCG CCGTGGTCGG CGTCGCACAC GGCGACTTCG CCAAGGCACT GGGCTCCACC
AAGCTCTCCA CGCAGGTCTA CAACGCCGGA CCCGCCGAGA TGACCGCCGT CCTCGGCGGA
CAACTGGACG CCGCCTACGT CGGACCGTCC TCGGCCCTGT CCGCCTTCGT CCAGTCGCAC
GGCGAGGCCC TGAAGATCGT CGCCGGCGCC ACCGAAGGCG GCGCGGAACT CGTGGTCAAG
CCCTCAATAG CCTCCGCGGC GGACCTCAAG GGCAAGACCC TCGCAACGCC GCAGAAGGGC
AACACCCAGG ACGTGGCCCT CCGCTTCTGG CTCAAGCAGC AGGGCCTGAC CGCCAACCCG
GACGGCTCCG GCGACGTATC GGTGAACCCC CAGGACAACG CCACCACCCT CGACCAGTTC
AAGGCCGGCC ACATCGACGG CGCCTGGCTC CCCGAACCCT GGGCCTCCCG CCTGGTCGAA
GAAGCCGGCG CGAAGGTCCT CGTCGACGAA CGCAGCCTGT GGCCCAACAG CCAGTTCTCC
ACCACCACCC TCGTCGTGGC GACCACCTTC CTGACCAAGC ACCCCGACAC AGTCAGGGCC
CTGATAGACG GCCAAATCGC CGCCAACACC TGGATCACCT CCAACCCCGC CGACGCCCAA
AAACTGGTCA ACAGCGAACT CAAGCGCCTC ACCGGCAAAG CCCTGACCGA CGCCGAAATC
CAGCGCTCCT TCAGCGAACA GAAGGTCACC AACAACCCCG ACGCATCAAC CCTCCAGACC
TCCCTGGACC ACGCAGTCGC AGTCAACCTC CTGAAGTCCA CCGACCTCCA CGGCATCTTC
GACCTCTCGA TCCTCAACGC CGAACTCACC AAGAACGGCC AGCCGACCGT CTCCGACGCC
GGACTGGCAA AGAAGTGA
 
Protein sequence
MRDIKTLIRT VAAVGAALGI VASAAACGSS KSSASSTKAA AVSSGSAATQ VRLGYFANVT 
HATAVVGVAH GDFAKALGST KLSTQVYNAG PAEMTAVLGG QLDAAYVGPS SALSAFVQSH
GEALKIVAGA TEGGAELVVK PSIASAADLK GKTLATPQKG NTQDVALRFW LKQQGLTANP
DGSGDVSVNP QDNATTLDQF KAGHIDGAWL PEPWASRLVE EAGAKVLVDE RSLWPNSQFS
TTTLVVATTF LTKHPDTVRA LIDGQIAANT WITSNPADAQ KLVNSELKRL TGKALTDAEI
QRSFSEQKVT NNPDASTLQT SLDHAVAVNL LKSTDLHGIF DLSILNAELT KNGQPTVSDA
GLAKK