Gene Caci_4775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4775 
Symbol 
ID8336129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5437065 
End bp5438912 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content71% 
IMG OID644957875 
Productvon Willebrand factor type A 
Protein accessionYP_003115477 
Protein GI256393913 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0165447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGGCA GGCATCGGTC CTATGAGGGC CCCGGCAACA CCCCCCGCGG CCGCGGCGGC 
GGCTCCGGCG GATCGTCGTT CCCGACCGGG TTGGTCGCGA TCGGCGCGGT CCTCGTGCTC
GCCGCCGGCG GCGGCTACGT GTATTACACG AAAAAACACG ACACCACGGC GACCGCGGGC
AACAGCGGCA CGTCCTCGGC GGCCAACGGC TCGTGCGCGG CGCCCACCAC GCTGAACGTC
GACGCCAACC CCGACGTCTA CACCGCGGTC AAGGCGGTCG CCGACGGCAT GGCGGACCCG
TGCGTGCACG TCAACGTCAG CAGCGCCGAG GCCTCGGCGG TCGAGGCGTT CCTGGCCGGC
AGCGCCAAGG GCGGGGACGT CACCAGCGCT CCGGACGTGT GGATCCCGGA CAGCAGCATG
TGGATCGACA TCGCGCACAC CGGTGGCGTG AAGTCGCTGG CCGCCAACCC CGCGCCGGTG
GCGACCAGCC CGCTGGTGAT CGGGATGCCC AAGCCGGTGG CCGCAGCCGC CGGCTGGCCG
GCCAAGCCCT TCGGCTGGGC GGACCTGCTG GCCAACTTCA AGACCACCAA GCTGCAGACC
GCCGTTCCGG ACCCGACCAC CTCAGGGCCC GGACTCGCGG CGATCACCAT GCTGCGCGCG
GCCGTGCTCG GCCCGGCCGG GACCGACAAG GCCAAGCAGA GCCAGGCGCT GCAGAACCTC
ACGCTGGTCT ACCGGGTCAT GAGCACCTCG GTCTCCAGCT CGATGAGCGC CCTGCTCACC
GGGCTGCCGA CGCAGGGTGC CACCGCGGCC GGAGCCGGCG GTATAGCGGC GTTCCCGTCC
ACCGAGCAGA AGATCGCGGC GTACAACACG GCCAGTCCCG CCACGCCGCT CGTCGCGCTG
TATCCCTCGG ATATGGGCAC GATGATGATG GACTACCCGT ACACCATCAG CTCCACCCTG
GACGCCGCGC ACGCCAAGGC CGCAGCGGAC TTCCAGACGC TGTTGCACAG CCCTGCGGCC
GTCAACACCC TGCAGAAGGC CGGCTTCCGC GATCCCAAGG GCGCCGCCGC AGGAATCCTC
ACCTCCGCGA ACGGCGTCAA CCCGGCGGTA CCGGCGCTGG CTCCGGCGGA CACCACCCAC
ACCGCCGCCG GCTCGGCGCT GTCGGTGTGG AAGGTGACCA GCGAGCAGAC CCGCGGCCTG
GTGGTCATGG ACGTCTCCGG CTCGATGGGC CTGACCGTGG ACGGGCAGGT CGACCCGAAT
ACCCACACCC CGCTGAGCCG GCTGCAGATC ACCGCCGCGG CGTGCCTGAC CGGGCTGCCG
CTGTTCGGCG ACAGCTCGCA GCTGGGCCTG TGGACGTTCA CCACCAAGAA CACCGCGGAC
GGCGGCGGGA CCGTGCACAA GGAGCTGGTC CCGATGGGCC CGCTGTCGGC ACCGGTGGGC
GCCTTCCCCA GCCGGCGCGC GGCGCTGAAC GCGGCGCTGG GACAGCTGAG CATCCAGCCG
GGCAGCCGCA ACGGGCTCTA CGACACCATC CTGGACGCCT ACCAGACGGT GCTGACCGGC
TGGGCGCCGA ACGAGTCCAA CGCGATCGTG GTCTTCACCG ACGGCAAGGA CGACGGCCTG
AACTCGATGA GCGCCGACCA GCTGATCACC AAGCTGAACG CGCTCAAGGC CGCGAACCCG
AACCACCCGG TCCGGGTCAT GATCGTGGCC CTGGGCAGCG GCGTGGACCT CACCACCCTG
TCGAAGATCA CCGGCGCCGC CAACGGCCAG GCGCTGCACG CCGACACCCC CGCCGACATC
GGCTCGGCGG TGATCGCCGG CTTCGCGGGC CGCCTGTCCG ACCAGTGA
 
Protein sequence
MAGRHRSYEG PGNTPRGRGG GSGGSSFPTG LVAIGAVLVL AAGGGYVYYT KKHDTTATAG 
NSGTSSAANG SCAAPTTLNV DANPDVYTAV KAVADGMADP CVHVNVSSAE ASAVEAFLAG
SAKGGDVTSA PDVWIPDSSM WIDIAHTGGV KSLAANPAPV ATSPLVIGMP KPVAAAAGWP
AKPFGWADLL ANFKTTKLQT AVPDPTTSGP GLAAITMLRA AVLGPAGTDK AKQSQALQNL
TLVYRVMSTS VSSSMSALLT GLPTQGATAA GAGGIAAFPS TEQKIAAYNT ASPATPLVAL
YPSDMGTMMM DYPYTISSTL DAAHAKAAAD FQTLLHSPAA VNTLQKAGFR DPKGAAAGIL
TSANGVNPAV PALAPADTTH TAAGSALSVW KVTSEQTRGL VVMDVSGSMG LTVDGQVDPN
THTPLSRLQI TAAACLTGLP LFGDSSQLGL WTFTTKNTAD GGGTVHKELV PMGPLSAPVG
AFPSRRAALN AALGQLSIQP GSRNGLYDTI LDAYQTVLTG WAPNESNAIV VFTDGKDDGL
NSMSADQLIT KLNALKAANP NHPVRVMIVA LGSGVDLTTL SKITGAANGQ ALHADTPADI
GSAVIAGFAG RLSDQ