Gene Caci_8075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_8075 
Symbol 
ID8339453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp9368438 
End bp9369643 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content70% 
IMG OID644961160 
ProductCarbohydrate binding family 6 
Protein accessionYP_003118739 
Protein GI256397175 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCGA GCCGTAAGCG CCTCGCCGTG GCAGCCGCCG CGGCGATCGC GATCGCCACC 
CAGCTCGCCA TCACGCACAC CGCCCAAGCG GCCACGGCGC ACCCCGCCGC CAAGGCCGGC
TCGGCGGCGG CGCCGGCCGC CGCCGCGGCC GACTACACCG CCGCCCAGGT TCTGGCCGGC
GTGCAGAAGA ACTCCACGTC CTCGACCCAG GTCAACAGCA AGCCCCACAT CAACACCATG
ACGCGGTCGA TGAACGTGAA CGTGTACCAG CCCGCTCCGG GCGTGTACTC CTACACCTCC
AGCATGGCCA TCGACGACGA CGGCAGCGAC CCGGACCCGG ATCCCGACCA CCAGGGCGAG
ACCACCTTCC AGGACAGCAA CGGGGCGCAG CTGGCCGCGC ACCACGTGCC GTTCTTCGTC
CTGGGCGACG ACTGCTGGGA CAAGAAGACG CCGTGCCCGC ACTTCTTCTA CAAGGAACAC
GGCATGTCCG GCCGTCAGTT CGCGCTGATG TTCTACAAGG GCAAGGTCAT CGGCTCGATC
TTCGGTGACA CCCAGACCGG GAACAGCCAG ACCACCTCGG ACAACGACTC GCGCGAGCTC
GGCGAGGCGT CCGTGAAGGC CGCCTCCCTG CTCGGCATCC CGAGCAGCGG CACCACCGGC
GGCGTGGACA ACGGCGTGAC CGTGGTCATG TTCTCCGGCC CGTCCTGGGT CGTGAACGGC
AGCAACGCCA ACCTGAGCAA CAACGCCCAG GCCCTGGTGC AGAAGGCGCT GAACACCCTC
GGCGCGGCCA TGGACGGGGG CGGCACGACC CCGCCGCCGC CGACCGGCAC GCTCTTCGAG
GCCGAGACCG GCTCGATGTC CTCCGGCGGC ACATTCGACT CCAACCACAC CGGCTTCACC
GGCTCCGGGT TCGCCAACCC GGCCAACGCG GCCGGCTCCT ACCTGGACAT CCCGGTCACC
GCGGACTCCG CGGGCACCAA GACCCTGACG TTCCGGTACT CGGACGGCAC CAGCTCGGCG
CGCCCGGCGA CCATCTCGGT CAACGGCACC TCGCACGGCA CGCTGAACTT CCCGGTCACC
TCGGACTGGA ACACCTGGTC CACCGCGACC ATCTCGGTGC CCCTGACCGC CGGCGCCAAC
ACCATCCGGG TCACCGGCAC GGTCGCGGAC GGCCCGGCCA ACATCGACTC GGTGACCGTC
TCCTAG
 
Protein sequence
MSSSRKRLAV AAAAAIAIAT QLAITHTAQA ATAHPAAKAG SAAAPAAAAA DYTAAQVLAG 
VQKNSTSSTQ VNSKPHINTM TRSMNVNVYQ PAPGVYSYTS SMAIDDDGSD PDPDPDHQGE
TTFQDSNGAQ LAAHHVPFFV LGDDCWDKKT PCPHFFYKEH GMSGRQFALM FYKGKVIGSI
FGDTQTGNSQ TTSDNDSREL GEASVKAASL LGIPSSGTTG GVDNGVTVVM FSGPSWVVNG
SNANLSNNAQ ALVQKALNTL GAAMDGGGTT PPPPTGTLFE AETGSMSSGG TFDSNHTGFT
GSGFANPANA AGSYLDIPVT ADSAGTKTLT FRYSDGTSSA RPATISVNGT SHGTLNFPVT
SDWNTWSTAT ISVPLTAGAN TIRVTGTVAD GPANIDSVTV S