Gene Caci_5603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5603 
Symbol 
ID8336963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6459093 
End bp6460223 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content68% 
IMG OID644958707 
ProductABC transporter substrate-binding protein 
Protein accessionYP_003116303 
Protein GI256394739 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.069875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.939883 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCTG CCATATCCAA GAAATCGGCT GCGACGAAGA TGCTGATCGT GGCCGCCGCC 
GGCGCCCTGA GCCTGACCGC GCTGAGCGCC TGCTCGTCCT CGAAGAGCTC GAGCAGCTCC
GCGGCCTCCG GCACCGCCGG CGTCTCGAAG ACCACGCAGC ACGTCACGCT GATGGTCGGC
GGCATCGACA AGCAGATCTA CCTGCCCTAC AAGCTCGCCG ACCAGCTCGG CTTCTACAAG
AAGTACAACG TCGACGTCAC GCTGAGCACC GAGCAGGACG GCGGCGTCGG CGCCGAGGAG
GCCATGGTCT CCGGCCAGGT CGACATGGCC GGCGCGTGGT ACAACCACGC GATCGACTTC
CAGATGAAGC ACAAGAACGT CGAGGACCTC GTGCAGCTCT CCGGTGCCCC CGGCGAGCGC
GAGATGTGCG GCAACAAGGC CACTGTGCAC ACCGGGGCAG ACTTCGCGGG CAAGACGATG
GGCGTCACCG ACCTGGGCTC GGGCACCGAC ACCCTGACCC AGCTGATCGC GGCGCAGAGC
GGCGTGGCCA AGAACAAGTT CAGCCGCACC GGCGTCGGCG CCGGCTCCAC GGCGCTGGCC
GCGCTGAAGA ACGGCTCCAT CTCCTGCGTC ATGACCACCC AGCCGACGGT CACCGCCATC
GAGAAGCAGA ACCTCGGCTA CTCCGCGATC GACCTGGCCA CCACCGAGGG CGCCACCAAG
GCCCTGGGCG GCGCGTGGCC CGCGGCCGGC GTGCTGGCCC GCACCGACTG GGCCAACCAG
CACCAGGAGG CCGTGCAGGA CGTGGTGGAC GCCCTGGTGG CCACCATGCA CTGGATCAGC
ACGCACTCGG CGACCGACAT CGCCAACGCC CTGCCGGCGA GCTACACGAA CAACGCGATC
ATCTCCAAGG CCGACTACAT CGCCGGCCTG ACCATGGACA AGAGCCAGTT CCTGCCCGAC
GGCATCATGC CCGCCGGCGG CCCGAAGGTG GTCCTGACCA CCGAGAAGCT GATCGGCAAC
GCCGACGACT CGGTGAACCT CGGCGCCACG TTCACGAACA CCTACGCGAT CAAGGCCAAC
CAGCTCGAGG GCTTCACGAC CACCACGACG CCGGCCGGTC CCACCGGCTG A
 
Protein sequence
MSSAISKKSA ATKMLIVAAA GALSLTALSA CSSSKSSSSS AASGTAGVSK TTQHVTLMVG 
GIDKQIYLPY KLADQLGFYK KYNVDVTLST EQDGGVGAEE AMVSGQVDMA GAWYNHAIDF
QMKHKNVEDL VQLSGAPGER EMCGNKATVH TGADFAGKTM GVTDLGSGTD TLTQLIAAQS
GVAKNKFSRT GVGAGSTALA ALKNGSISCV MTTQPTVTAI EKQNLGYSAI DLATTEGATK
ALGGAWPAAG VLARTDWANQ HQEAVQDVVD ALVATMHWIS THSATDIANA LPASYTNNAI
ISKADYIAGL TMDKSQFLPD GIMPAGGPKV VLTTEKLIGN ADDSVNLGAT FTNTYAIKAN
QLEGFTTTTT PAGPTG