Gene Caci_2622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2622 
Symbol 
ID8333971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3004944 
End bp3006257 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content67% 
IMG OID644955774 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003113380 
Protein GI256391816 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0574924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.833644 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCA CTCCTTTGAA CGCCGCAGCC TCACGCCGCT CGTTCCTCCT CGGCGGCCTG 
TCCATAGTCG GCGCGGCCGC CCTGTCCGGG TGCAGCGTCA CCGGCACCTC GCAGAAGAAG
GGCTCCGCGG GCTCCGGCTC CGGCACCATC AATGTCCTGT TCATGCAGCA GGCCGGCTAC
AGCACCGACG ACGTCACGAA GATGACCGCC GCGTTCACCA AGCAGTACCC GGACATCAAG
GTCAACCCCA CCTACGTCGC CTACGAGGCG CTGCACGACA AGATCGTCAC CGCCGCCGCG
GCCGGCACCT ACGACGTGGT GCTCATCGAC GTCATCTGGC CGGCGGAGTT CGGCAAGAAG
AACATCGTCG CCGACGTCAC CTCCCGCTAC CCGGCGGACT GGAAGGACAC GATGCTCGGC
GGCGCGCTGC TGACCGCCGA CTACGACGGC AAGCAGTACG GCGTGCCGTG GGGGATGGAC
ACCAAGTTCT TCTACTACAA CAAGGCCCTG CTGGCGAAGG CCGGCGTCGA CGCCTCCACG
CTGGGCACTT GGAGCGGCGT GCTCCAGGCG GCCAAGGCGC TGAAGCAAGC CAAGGTCGTG
GAGTACCCGC TGGCGTGGAG CTGGTCGCAG GCCGAGGCCA TCATGTGCGA CTACACGCAG
CTGGTCGGCG CGTTCGGCGG GTCGTTCACC GACAGCGCCG GCAACCTCAC CCTGAACAAG
GGCGGCGCGG TGGACGCGCT GGCCTGGATG CGCCAGAGCA TCGTCGACGG TCTGACCAAC
CCCTCCTCCA CCACGTTCCT GGAAGCCGAC GTGGAGAAGA CGATGAACAA CGGCCAGGCG
GCGTTCGGTC TGAACTGGAC CTACTACCTG GGCTCCTCCA ACGACCCGAA GAACTCCCAG
GTCCCCGGGC AGATCGTGGT CGCCCAGACC CCGGCCGGCC CGAGCGGGAA GCGCCCGGGC
GTCAACGGCG CGATGGCGCT GTCGGTGTCC ACGGGCAGCA AGAACCAGGA CGCCGCCTGG
AAGTACATCT CCTGGATCGC CGGGGAGGAC CAGGTCGACC AGTTCGCCAA GGACGAGATG
CCGATCTGGA AGAAGTCCTT CACCACCCCC TCGGTGGTCT CCTCGGCGCC GGACATGTTC
GCCGTCGCCG CCAAGCAGCT CGACGACCTG GTCGTGCGCC CGCAGTTCGT GAACTACAAC
GCGGTCTCCC AGGTCATCCA GGTCGAGCTG CAGAACGCGC TGCTGGGCAA GAAGCCCGCG
CAGCAGGCGC TGGACGACGC GGTGAAGGCC GCGCAGCCGC TGATGGGGGG CTGA
 
Protein sequence
MNATPLNAAA SRRSFLLGGL SIVGAAALSG CSVTGTSQKK GSAGSGSGTI NVLFMQQAGY 
STDDVTKMTA AFTKQYPDIK VNPTYVAYEA LHDKIVTAAA AGTYDVVLID VIWPAEFGKK
NIVADVTSRY PADWKDTMLG GALLTADYDG KQYGVPWGMD TKFFYYNKAL LAKAGVDAST
LGTWSGVLQA AKALKQAKVV EYPLAWSWSQ AEAIMCDYTQ LVGAFGGSFT DSAGNLTLNK
GGAVDALAWM RQSIVDGLTN PSSTTFLEAD VEKTMNNGQA AFGLNWTYYL GSSNDPKNSQ
VPGQIVVAQT PAGPSGKRPG VNGAMALSVS TGSKNQDAAW KYISWIAGED QVDQFAKDEM
PIWKKSFTTP SVVSSAPDMF AVAAKQLDDL VVRPQFVNYN AVSQVIQVEL QNALLGKKPA
QQALDDAVKA AQPLMGG