Gene Caci_6904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6904 
Symbol 
ID8338270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7972883 
End bp7974097 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content65% 
IMG OID644959991 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003117582 
Protein GI256396018 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000382644 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00751503 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACACT CGCGCTGGGC GGCGACGCTC GTCGCTGCCG GTATCGTCGC GGCGGGCCTC 
AGCGGCTGTT CCTCAAGCTC AGGGGGCGGC AGTTCGGACT CTCTGACGGT CCAGGACTAC
TACGCGGAAC CGCAGGCCGG CCAGATGAAG GCGATCTACG ACTCGTGCGC CTCGCAGCTG
GGCGTCAAGA TCAACATCGT GCACGTGGCA TCGAACGGTC TGATCGCCAA GGTGCTGCAG
CAGGTCTCCT CGAAGACGAT GCCCGACGTT CTGATGCTCG ACAACCCGAA CGTGCAGCAG
ATAGCGGCCT CCGGAGCTCT CGCGCCACTG TCGCAATTCG GGATCACCGG AGACGGATTC
GCCAAGGGAG CCGTGTCTGC GGGCTCGTAC AACGGCAAGC TGTACGCAGT GCCACCCGTG
CTGAACTCGA TCTCGCTGTT CTACAACAAG GACATCCTGT CGCAGGCGGG TATCACGCCG
CCGAAGACCT GGGACGAGCT GGCCGCGGAC GCCAAGCAGC TGACGAAGCC CGGCCGTTAT
GGCTTCGCGT TCAGCGCTGC CAACACCGGC GAGGGCACGT GGACGTTCCT GCCGTTCATG
TGGAGCAACG GCGGGGACGA GACGAACATC GCCACCCCCC AGACGGCGCA GGCCCTGCAG
TACCTCACCG GTCTCGTCAG CAGCGGTTCC GCATCCAAGA GCGTGGTCAA CTGGACACAG
GCGGATGTGA ACGACCAGTT CATCGCCGGC AAGGCGGCCA TGATGATCAA TGGTCCCTGG
CAGATTCCGG CGCTGGACAA GGCCGGTGTG CACTGGGCCA GCGTGAGCAT CCCGACGCGC
GAGGCCGGCC AGACCGTGGT TTCGCCGCTG GGTGGGGAGA CGTTCAGCGT TCCCAACACC
GGGCACTCCG CGTCGATGAA AAAGGCCGCA CAGTTCGTGA GCTGCCTGAC CAACGACCAG
AACGAGGCGA CGAAAGCGGC CAATGAGGAC GCGGTCCCCT CGCGAACGGA TGCCGCAGCC
AAGTTCGCCT CATCCAATCC GGAGCTGGCG TCCTTCGTGA GCATCGTGGC CGACGGCCGC
TCTCGCACGG CGCAGTTGGG CGCGAAGTGG CCCGCGACGG AAACGGCGAT CTACACAGCG
GTGCAGGCGG CCATCACGGG CGAGGCGTCG CCCCAGGCCG CACTTCAGCA GGCGCAGTCG
CAGATCAGCA AGTAG
 
Protein sequence
MKHSRWAATL VAAGIVAAGL SGCSSSSGGG SSDSLTVQDY YAEPQAGQMK AIYDSCASQL 
GVKINIVHVA SNGLIAKVLQ QVSSKTMPDV LMLDNPNVQQ IAASGALAPL SQFGITGDGF
AKGAVSAGSY NGKLYAVPPV LNSISLFYNK DILSQAGITP PKTWDELAAD AKQLTKPGRY
GFAFSAANTG EGTWTFLPFM WSNGGDETNI ATPQTAQALQ YLTGLVSSGS ASKSVVNWTQ
ADVNDQFIAG KAAMMINGPW QIPALDKAGV HWASVSIPTR EAGQTVVSPL GGETFSVPNT
GHSASMKKAA QFVSCLTNDQ NEATKAANED AVPSRTDAAA KFASSNPELA SFVSIVADGR
SRTAQLGAKW PATETAIYTA VQAAITGEAS PQAALQQAQS QISK