Gene Caci_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2554 
Symbol 
ID8333903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2890829 
End bp2892667 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content67% 
IMG OID644955707 
ProductCarbohydrate binding family 6 
Protein accessionYP_003113313 
Protein GI256391749 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTCAT CCTCGTACTT TCGCACTCCC AAAATCGCCG CCGCCGTCGT GGTCCTGGCT 
GTCGGCAGCC TCGCCTCGGC TGCCCTGACC GCAGCCGGTC CGGCCAACGC CGCGCAGGCA
GCGCCGCCGC CGACCACTAC CGCCAACGCC GCCACGGTCG CCGCCACGCC CTTCATGGGC
TGGTCGAGCT GGAGCATGCA GTCCTCGTCT TACCCGGGAC TGAACCCGAA CGGCAACTAC
AGCTACCTCA CCGAAGCGAA CGTCCTGAAG CAGACCGACG CCCTGGCCGC CAAGCTCAAG
GCGTACGGCT ACGACCACGT CGACATCGAC GCCGGCTGGT GGCGCGACAA CAACTGGACG
CCGGAGTACG ACCAGAACGC CCGGCAGACC CCTGACCCGG TCCGCTTCCC GCACGGTATG
CAGTCGATCG CCGACCACAT CCACTCCCAG GGACTCAAGG CCGGTATCTA CCTGCCGGTC
GGTCTGGAGA AGGAGGCGTA CGGCGGCGGC ACCGTGCCGA TCGCGAACGC TCCCGGCTGC
ACCACCGCCG ACATCGTCTA CCCGGACCTT CGCACCACCA ACGGCTGGGA CAGCTCCTAC
AAGCTGAACT TCGCCAACGC CTGCGCGCAG AAGTACGTCG ACTCCCAGGC GCAGATGCTC
GCCGGCTGGG GCTACGACTT CCTCAAGATC GACGGCGTCG GTCCCGGCTC GGGCAAGTCC
GGCGACAACT ACGACAACAC CGCCGACGTG GCCGCCTGGA ACCAGGCGAT CGCCGCCACC
GGCCGTCCGA TCCACCTGGA ACTGTCCTGG TCCCTGGACC GGGGCAACGC CGCCAACTGG
AAGCAGTACT CCAACGGCTG GCGCGTCGAC ACCGACGTGG AGTGCTACTG CAACACGCTG
GTCACCTGGG ACAACTCGGT CAAGGCCCGC TGGAACGACG CCCCGGTGTG GAGCGACGTG
GCAGGTCCCG GCGGCTGGAA CGACCTGGAC TCCCTCGACG TCGGCAACGG CACGATGGAC
GGCCTGACCA ACGCCGAGCG GCAGAGCTAC ATGACGCTGT GGGCGATCGA GAAGTCACCG
CTGTTCACCG GTGACGACCT CACCCAGCTG GACAGCTACG GTCTGTCGCT GCTCACCAAC
CGGGAAGTCA TCGGCATCGA CCAGAACACC TCGCCGGTGG CGCGTCCGGT CAGCACGATG
CGCGACCAGC AGGTCTGGGC GACCAAGAAC GCTGACGGCA GCTACACCGT CGCGCTGTTC
AACATGGCCG CCGCGCCCGA ATCGGTCAGC GCCTACTGGG CCGCGCTCGG ATTCCAGGGC
AACGCCAGCG TCCACGACCT GTGGAACCAC CAGAACCTCG GCTCCTTCAC CAACCAGATC
ACCGAGGCAC TGCCCGCCCA CGGCTCGCGG CTGTTCACCA TCACCCCGGC CGGCAGCACC
AAGCCGGTGA AGACCACGAG CTACGAAGCC GAGTCCACGA ACAACACACT GACCGGCGGC
GCCTCGCTCA CCGCCTGCAC CGCCTGTTCC GGCGGCTCAC GGGTCGGCAA CCTCTACGGA
AGCGCCAAGC TTCAGGTCAA CAACGTCACC GTCAAGAAGG ACGGGATCTA CACCATCACC
GTCTCCTACG TCGACGGCAG CACGGACCGG ACCGCGACCA TCTCCTCCAA CACCGGCAGC
GGGACAAGCA TCGCCTTCCC CTCGACAGGC GACTGGAACA CCGTCCACTC CATCAGCTTC
CAGCTGGGTC TGAAGGCGGG CTCGAACTCC ATCACCTTCG ACAGCGCCGG CTGGTACTCG
CCCGACATCG ACAAGATCGA CGTCCCGGTT TCCTCCTGA
 
Protein sequence
MRSSSYFRTP KIAAAVVVLA VGSLASAALT AAGPANAAQA APPPTTTANA ATVAATPFMG 
WSSWSMQSSS YPGLNPNGNY SYLTEANVLK QTDALAAKLK AYGYDHVDID AGWWRDNNWT
PEYDQNARQT PDPVRFPHGM QSIADHIHSQ GLKAGIYLPV GLEKEAYGGG TVPIANAPGC
TTADIVYPDL RTTNGWDSSY KLNFANACAQ KYVDSQAQML AGWGYDFLKI DGVGPGSGKS
GDNYDNTADV AAWNQAIAAT GRPIHLELSW SLDRGNAANW KQYSNGWRVD TDVECYCNTL
VTWDNSVKAR WNDAPVWSDV AGPGGWNDLD SLDVGNGTMD GLTNAERQSY MTLWAIEKSP
LFTGDDLTQL DSYGLSLLTN REVIGIDQNT SPVARPVSTM RDQQVWATKN ADGSYTVALF
NMAAAPESVS AYWAALGFQG NASVHDLWNH QNLGSFTNQI TEALPAHGSR LFTITPAGST
KPVKTTSYEA ESTNNTLTGG ASLTACTACS GGSRVGNLYG SAKLQVNNVT VKKDGIYTIT
VSYVDGSTDR TATISSNTGS GTSIAFPSTG DWNTVHSISF QLGLKAGSNS ITFDSAGWYS
PDIDKIDVPV SS