Gene Caci_3821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3821 
Symbol 
ID8335174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4321811 
End bp4323472 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content65% 
IMG OID644956960 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003114563 
Protein GI256392999 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.302994 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0761262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCAGA TATCCCGGCG CGGCTTCCTG AGCGTGTCCG CCGGCGTGGC CGGTCTGTCC 
CTCGCTGCCT GCGGGGGCGG CGGCGACGGC GGTTCGAAGC CGAGCAGCAA GCTGACCGCG
AACCGCACCG GCGCCATGGC GAAGTACGGC GTCGGGGACC AGTTCAAGGC CACGGTCCCG
CTGTCGTTCT CGGCCATGCT GCTGAGCAAC GCGAACTACC CCTACAAGGC CGACTGGGAA
TTCTGGTCGG AGCTGACCAA GCGCACCAAC GTGACGCTGC AGCCGACGGT GATCCCGGCC
AGCGACTACA ACCAGAAGCG AAGCGTCATG GTCAGCGCGG GCAACGCCCC GACGCTCATT
CCGAAGACGT ACCACCCGGA CGAGGAGGCG TACATCTCCG GCGGCGCGAT CCTGCCGGTC
AGCGACTACC TGGACCTGAT GCCGAACTTC CAGGACAAGG TCGCCAAGTG GAACCTGGCC
GGCGACCTGG ATCAGCTGCG CGAGGCCGAC GGCAAGTTCT ACCTGCTGCC CGGACTGCAC
CAGGACGTGT GGAAGGACTA CTCGCTGGCC ATCCGAACCG ACATCCTCAA GCAGCTGAAC
CTGCAAGTCC CGCAGACCTG GGACGACCTG ACCACAGTGC TGCGCACGAT GAAGCTGACC
TACCCGGACC GGTACCCGTT CTCCGACCGC TGGAGCACGG GGAGCACGAC GCCGCAGCCG
GGCGCCAACA ACCTGCTGGC CATCCTCGGC GAGGCCCACG GCGTCTGGGC CGGCTGGAGC
TACCAGCACG CGAACTGGAA CGCCGACGCG GGCAGGTTCG AGTACACCGG CGCCACGGAC
CAGTACAAGG CGATGATCCA GTATCTCAAC ACCCTGGTGA GCGAGAAGCT GCTGGACCCG
GAGAGCTTCA CCCAGAGCGA CGATCAGGCC CGGCAGAAGT TCGCCGACGG CCAGTCCTTC
GTGATCAGCG CCAACGCCCA GGAGCTGGTC AACCACTACC GCAAGGACAT CGCCAAGATC
TCCGGCGCCA CGGTGGCCAA GATCCCGGTG CCGATCGGCC CGATCGGCGC GGCCAAGACC
GGCTACCGCA CCGAGAACGG CATGATGATC TCCAACAAGG CCAAGGACGG CAAGGACTTC
GTCGCGCTGA TGCAGTTCAT CGACTGGCTC TGGTACTCCG ACGAGGGCCA GATGTTCGCC
AAGTGGGGCG TGCCGGGCAC CACCTACACC GGCAGCGTCG ACGACGGCAC GTTCAAGCTG
GCCCCGGACG TCACCTGGGC CGGGGTCAAC CCTTCGGGCA CCAAGAACCT CCAGGTCGAC
TACGGGTTCT TCAACGGAGT GTTCGCCTAC GGCGGCAGCA CCAAGCTGCT CGACTCTCAG
TTCCCCCCGG AGGAATTGGA GTTCCAGAAG GTGATGGACG CGCGCAAGAC GCTGCCATTG
GCCCCGCCCG CACCGCTGAG CTCCGACGAC CGTGAGCAGG CGACGCTGTG GACGACGTCG
CTGAAGGACT ACGTCGACCA GGAGACGCTC AAGTTCATCC TCGGCAAGCG TCCACTCTCG
GAGTGGACGG CCTACGTCTC CGAGCTCAAG GGCAAGAACA GCGACCAGTA CATCAAGCTC
GTGAACCAGG CCTACCAGGA CTTCAAGAAG AACCACGGCT GA
 
Protein sequence
MNQISRRGFL SVSAGVAGLS LAACGGGGDG GSKPSSKLTA NRTGAMAKYG VGDQFKATVP 
LSFSAMLLSN ANYPYKADWE FWSELTKRTN VTLQPTVIPA SDYNQKRSVM VSAGNAPTLI
PKTYHPDEEA YISGGAILPV SDYLDLMPNF QDKVAKWNLA GDLDQLREAD GKFYLLPGLH
QDVWKDYSLA IRTDILKQLN LQVPQTWDDL TTVLRTMKLT YPDRYPFSDR WSTGSTTPQP
GANNLLAILG EAHGVWAGWS YQHANWNADA GRFEYTGATD QYKAMIQYLN TLVSEKLLDP
ESFTQSDDQA RQKFADGQSF VISANAQELV NHYRKDIAKI SGATVAKIPV PIGPIGAAKT
GYRTENGMMI SNKAKDGKDF VALMQFIDWL WYSDEGQMFA KWGVPGTTYT GSVDDGTFKL
APDVTWAGVN PSGTKNLQVD YGFFNGVFAY GGSTKLLDSQ FPPEELEFQK VMDARKTLPL
APPAPLSSDD REQATLWTTS LKDYVDQETL KFILGKRPLS EWTAYVSELK GKNSDQYIKL
VNQAYQDFKK NHG