Gene Caci_2151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2151 
Symbol 
ID8333496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2437299 
End bp2438699 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content70% 
IMG OID644955301 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003112911 
Protein GI256391347 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCAC GCTCCCACCG GTCCGTCACG GTTGTCATAG CCGTCGGACT GGTCTCCGGC 
CTGGCACTGG CCGGCTGCTC CTCCAGCTCC TCCAAGCCCT CGGCGGGCGC GTCGACCTCC
TCGGCCGCCG GCACCTCGGC GCCGAGCACG TCCGCGGCTT CCTCCTCCGC CGCGGCCGCC
GGGCCCGCGC TGCCGGACCT GTCCGGCAAG TCGATCCAGG TGCTGGCCGA GTGGTCCGGC
CAGGAGCAGC AGGACTTCCA GAAGGTGATC GACGCCTTCA CCGCCAAGAC CCACGCCAAG
GTCAGCTACC AGGGCGCCGG CGACCAGACC CCGACCGTCC TGCGCAGCAA GCTGGCCGGC
GGCGGCGCCC CCGACGTGGC GCTGCTGGCC CAGCCCGGCG CCATCGCGCA GTTCGCGAAG
GCCGGGCAGA TCAAGCCGCT GGGCGCCAAC GTGCTCTCGG AGATCGACGC CAACTACGAC
CCGAGCTGGA AGAAGCTCGG CACGGTCAAC GGCCAGGTCT ACTCGATCAT GTTCAAGGCG
GCGAACAAGT CGACCTTCTG GTACAACACC GCGCAGTTCT CGCAGGCCGG CATCACGCCG
CCGAAGACCT GGGCCGACTT CCTCAAGGAC TGCCAGGCGC TCTCCGACGC CGGCATCACC
CCGGTCTCGA TCGGCGGCGC CGACGGCTGG ACGCTCACCG ACTGGTTCGA GAACGTCTAC
CTCTCCCAGG CCGGCGCGGA CAACTACGAC AAGCTCGCGC ACCACCAGAT CCCCTGGACC
GACCCCACGG TGGTCCAGGC GCTGACCACG ATGAAGCAGC TGTTCGGCAA CGACCAGTTC
ATGGCCGGCG GCAAGGCCGG GGCGCTGCAG ACCTCGTTCA ACGACTCGGT CACCCAGACC
TTCAAGAGCC CGCCGAAGGG CGCGATGGTC TACGAGGGCG ACTTCTCCGG CTCGGTGATC
ACCTCGACCA CCTCGGCCAA GCTGGGCACC GACGCCAAGT TCTTCGCCTT CCCGGCGGCC
GGGTCGCTGA CCAACTTCGT GGACGGCGGC GGCGACGCCG CGCTGGCCAC CAACGACAAC
CCGGCGACGA TGGCGTTCAT CCAGTTCCTG GCCTCCCCGG AGGCGGCCGA GGCGTGGGCG
TCGGCCGGCG GCTTCGTCTC GCCGAACAAG AACGTCCCGA TGTCCTCCTA CCCCGACGAC
ACCACCCGCG CCGAGGCGCA GATGCTCGTC AGCGCCGGCG ACGGCTTCCG CTTCGACATG
TCCGACCAGG CTCCGGTCGG CTTCGGCGGG ACCAAGGGCG CCGGGGAGTG GAAGGACCTG
CAGGACTTCC TGAGCAACGG CGACGTCAAC GGCACCGCCG CGCAACTGGA GAAGGACGCG
GCGAAGGAGA CCTGGCAGTA G
 
Protein sequence
MGARSHRSVT VVIAVGLVSG LALAGCSSSS SKPSAGASTS SAAGTSAPST SAASSSAAAA 
GPALPDLSGK SIQVLAEWSG QEQQDFQKVI DAFTAKTHAK VSYQGAGDQT PTVLRSKLAG
GGAPDVALLA QPGAIAQFAK AGQIKPLGAN VLSEIDANYD PSWKKLGTVN GQVYSIMFKA
ANKSTFWYNT AQFSQAGITP PKTWADFLKD CQALSDAGIT PVSIGGADGW TLTDWFENVY
LSQAGADNYD KLAHHQIPWT DPTVVQALTT MKQLFGNDQF MAGGKAGALQ TSFNDSVTQT
FKSPPKGAMV YEGDFSGSVI TSTTSAKLGT DAKFFAFPAA GSLTNFVDGG GDAALATNDN
PATMAFIQFL ASPEAAEAWA SAGGFVSPNK NVPMSSYPDD TTRAEAQMLV SAGDGFRFDM
SDQAPVGFGG TKGAGEWKDL QDFLSNGDVN GTAAQLEKDA AKETWQ