Gene Caci_5330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5330 
Symbol 
ID8336684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6144087 
End bp6145403 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content67% 
IMG OID644958428 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003116030 
Protein GI256394466 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.489769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.341368 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAGGG CACGGAAGCG TTCTGCCGCG GCAATCGCCC TGCTCGTCTC CGGCGCGATG 
CTCGCGTCCG GGTGCAGCAG CAGCAAGTCG TCCTCGTCAA GCACCAAGAC CACCACCGAC
AGCGGCCAGC AGATCACGCT GAAGGTCGGC CTGTTCGGGA CGTTCGGCTT CAAGGAGGCC
GGACTTTACG ACCAGTACAT GAAGCTGCAC CCGAACATCA AGATCGTCGA AGACAGCGTC
GAGGACGAGG GCCAGTACTA CACCTCGCTG CAGACCCACC TGTCGGCCGG CAGCGGCCTG
GACGACATCC AGGGCATCGA GGTCGGCCGC ATCGCCGACG TCGACCAGAA CCTCTCCAGC
AAGTTCGTCG ACCTGAACTC CCTGGGCGCG GCGAGCCTGA AGAGCAACTT CTACCCGGCG
AAGTGGTCCG CCGCGACCAC CTCCGACGGC AAGGTCATGG CGCTGGGCAC GGACTACGGA
CCGCTGGCCA TCTGCTACCG CACCGACCTG TTCAAGGCCG CGGGCCTGCC GACCGACAGC
GCCGGTGTCA GCGCGCTGTG GCCGGACTGG AACAGCTACG TCCAGACCGG GCTGAAGTAC
AAGGCGAAGG CTCCGGCCAA CCAGGCCTGG ACCGACACCG CCGGCGGTAC GTTCAACGCG
ATCGTCGGAC AGTCGGCGAA CCAGTACTAC GACAGCTCCG GCAAGGAGAT CGCGGACACC
AACCCGGCCG TGCAGAACGC CTGGAACATC GCGATGCAGC TGTCCACCCA GGGTCTGACC
GCGAAGCTGA GCCAGTTCAC GCCGGCCTGG AACCAGGCCT TCACCACCGG CTCGTTCGCC
ACGATCGCGT GCCCGGCGTG GATGACCGGC TACATCAAGA GCGAGGCCGG GGCGATGACC
GGCGACTGGG GCGTGGCCAA GATCCCCGGC GGCACCGGCG ACTGGGGCGG GTCCTACCTG
GCCATCCCCA AGGCTTCCAA GCACCAGAAG GAGGCCTACG ACCTGATCAA CTGGCTGACC
AACCCGGACC AGCAGAAGAC CATGTTCACC AGCCAGGGCC ACTTCCCGTC CTCGCAGACG
GCGGCCCAGG ACCCGTCCAT CGCCTCGCAC ACCGACCCCT ACTTCGGCGA CTCGCCGCTG
GGTCAGATCT ACGCCGCCTC CGCGGCCACG ATCCCGCAGG CCGTGCTCGG CGCCAAGGAC
GGGACGATCA AGGACACCTT CTCCAAGGCG ATCACCCGCG TGGAGGCCCA GGGCCAGGCG
CCGCAAGCCT CGTGGACGAA GGCCTTGTCC GACATCAAGG CCGCTACCAG CGGCTGA
 
Protein sequence
MLRARKRSAA AIALLVSGAM LASGCSSSKS SSSSTKTTTD SGQQITLKVG LFGTFGFKEA 
GLYDQYMKLH PNIKIVEDSV EDEGQYYTSL QTHLSAGSGL DDIQGIEVGR IADVDQNLSS
KFVDLNSLGA ASLKSNFYPA KWSAATTSDG KVMALGTDYG PLAICYRTDL FKAAGLPTDS
AGVSALWPDW NSYVQTGLKY KAKAPANQAW TDTAGGTFNA IVGQSANQYY DSSGKEIADT
NPAVQNAWNI AMQLSTQGLT AKLSQFTPAW NQAFTTGSFA TIACPAWMTG YIKSEAGAMT
GDWGVAKIPG GTGDWGGSYL AIPKASKHQK EAYDLINWLT NPDQQKTMFT SQGHFPSSQT
AAQDPSIASH TDPYFGDSPL GQIYAASAAT IPQAVLGAKD GTIKDTFSKA ITRVEAQGQA
PQASWTKALS DIKAATSG