Gene Caci_4460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4460 
Symbol 
ID8335814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5077972 
End bp5081073 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table11 
GC content71% 
IMG OID644957562 
ProductVault protein inter-alpha-trypsin domain protein 
Protein accessionYP_003115164 
Protein GI256393600 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.743014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTTG TTCAAACCGG TGTCGTCGAC GTCGTCGAGG AAGCGCCGCG CCCGGAGCCG 
GACGGCGGCC TCGGGGCGCT GCGGACCGAG CGCGGGAACC TGCCGCTGGA GCTGATCGAG
GTGCGCTCGG CGGTGGCGGG GCTGGCCGTG CGCACCGAGC TGGCGCAGGG CTTCCGCAAT
CCGTACGACG TGCCGCTGGA GGCGACGTAC ATCTTCCCGC TGCCGGACCG CGCCGCGGTG
ACGCGGCTGC GCATGGAGGC CGCGGACCGC GTGATCGAGG GCGTCATCAA GGAGCGGGAG
GCGGCGCGCG CCGACTACGA CGCGGCGATC TCCGCGGGCC AGCGCGCCTC GATCGCCGAG
GAGGAGCGCC CGGGCGTCTT CACGCTGCGG GTCGGCAACA TCATGCCCGG CGAGCGCGTG
GTGATCCGTA CGTCGCTGTC GGGGCGCCTG CCGTATGAGG ACGGACAGGC CACGTTCCGG
TTCCCCTTGG TCGTCGCGCC GCGCTACATC CCCGGCGCCG ACCTGCCCGG GGAGCAGGTG
GGCAGCGGCA CGGCGTCGGA CACCGACCAG GTACCCGATG CCTCGCGCAT CAGCCCACCG
ATCCTGCTGC CCGGGTTCCC GAACCCGGTC CGGCTGTCGA TCGAAGTGGC GGTCGACCCG
GTGGGGCTGC CGCTGGCCGG GCTGACGTCG AGCCTGCACG GGGTGAGCGT GGAGGAGTCG
GAGGGCTCTC GATATCTGGT GCGGCTGAAC CCCGGGGCGC GCGCCGACCG GGACTTCGTG
CTGCGCTTGG GGTACGGCGG TTCGGGCGCG GCGACATCCT TGGCCGTCGC ATGGGACAGC
GAGTCCGCGA ACGAGGTCGC GAAGGAATCC GCGAAAGCGA CGCCCACTGA CATCGGCACA
TTCCTGCTGA CCGTCCTGCC CCCGGAACCG ACCGGCGCCA CCCGCCCGCG TGACGTCGCG
CTGATCCTGG ACCGCTCCGG CAGCATGGGC GGCTGGAAGA TGACCGCCGC CCGCCGCGCC
GCAGCCCGCA TCGTCGACAC CCTGACCGCC GAAGACCGCT TCGCCGTCCT GACCTTCGAC
GACCAGATGG AAACCCCCGA CGGCCTCCCC ACCGGCCTGT CCGAAGCAAC CGACCGCCAC
CGCTTCCGCG CCGTCCAGCA CCTGGCAACC GTCGACGCCC GCGGCGGCAC CGAGATGGAA
CCACCCCTCC GCCGAGCCGC CACCCTCCTG TCCGACGACA ACCCCGACCG CGACAGAGTC
CTGATCCTCA TCACCGACGG CCAAGTAGGC AACGAGGACC GCCTCCTCAC CACCCTCAGC
CCCAAACTGA CCCACATCCG CGTCCACACA GTCGGCATCG ACACCGCCGT GAACGCCGCC
TTCCTCCAGC GCCTATCCAC CCTCGGCGGC GGCCACTGCG AACTCGTCGA ATCCGAAGAC
CGCCTCGACG ACGCCATGGA CGCCATTCAC CACCGCATCG CCACCCCCCT CGTGACCGGC
CTCCACCTGA CCGGAGTCGG CGGCCTCGAG CTAGAGCAGA ACTCGGTCAC GCCGACCCGA
CTGCCAGACC TCTTCGCCGG AGCACCGCTG GTGGTAGCCG GGCGACTCGG CGGGAAGATG
GTCGCGCTCG GCGCGACTGC GAGCGACAAG AATGCGGCGG CTGGCGAGAG CGCGGCAACT
AGCGCGAGCG CGGTCGCGCT CAGCGACTCG GCGGCGACCA GCAAGAGTAC GGCTGCGATC
GAGGGCTCGG CGGCGACCAG CAAGAGTACG GCTGCGATCG AGGGCTCGGC GGCGGCTGGC
GAGAATGCGG CTGCGGTCGG CGACTCGGTG GAGGCTGGCA AGAGTGCGGC TGTGCTCGGC
GACTCGGCGG AGGCTGGCGG GAGAGCGTCC GCACTCGGCA ATTCTGCGGC GAGCGACACG
AATGCAGCGG CCGTCGAGGG CGCGGCGGCT GGCGAGGCTG CGGCTGTGCT CGGCGGCTCG
GCAGAGGCTG GCAAGAGTGC GGATGCGGTC GATAGCTCGG CGACGGCTGG CGAGAATGCG
GCTGTGCTCG GCGACTCGGC GACGGCTGGC GAGACTGTGG CTGCGGTCGG CGGCGCGGCG
GCTGGCTCGG GCGAAGATGC GGCTGCGGCT GACACCATCC CCGGCATCGT CATCACCGGC
ACGGCGGCCG ATGGGTCCGC GTGGAGTCGC CGTGTGGCTG GGGTTGGGAC TGCGGATGCG
GGGCTTGGGC AGTTTTGGGC TCGGGGGCGG ATTCGTGATC TGGAGGATCG GTATGTCTCG
ACCTATGGCG GGCAGGCGGA GATCGAGCGG GCGATTGTTG CTGCCTCGGT TGAGCACAGT
GTGCTGTCGC GGTTCACGGC TTTCGTGGTG GTCGACAGTC GGGTGGTGAA CGCGGGCGGC
GCTCACAAGC AGGTCACGCA GCCGGTCGAT CTGCCTGATG GTTGGGCGGC GGACTTCGGG
GGTGCTCTGC CGGTCAGTGC TTCGATGCCC GCGCCGGCCG CCGATGCCCA TTACTCGAGT
CTCGACTTCG GCGCGCGGGT GTCGCGCAAG TCTGTACGTG CGCGGAGCGT CGGCGCGCCG
AAGCCCGGGC CGTTGGCGCC CGGCGGCGGA CCGGGGTACG GCGGCGCCAT GCCGGCTCCC
GGTGCGCCGA TGCCGTCGGT CCCGCCGTCG GTCCAGCCGT TGGCGCGGCC GTTGCTGCCG
CCGCTGGCGG AGCCCGCGAT CTCGGTCGAC TCGGCGGACG ACATGGCTGC CCTGCCCGAC
TTCTTGAAGG AATCCACCTC CGCCGCGGAC GAACTCCGCG CCCTCGCGGC GACCTGGCTC
CCCCGCCTCA CGGCCGCCGC GAACGACACC GCCGAGGCCC AGGAGAAGCT GCTGACCGAA
TTCGCCACGG AGCTGCGCGA GCTTGCCCCG CGCCTGTCGT CCACGCGCTC CGCCGATGCC
GGTGACCTCC TCAGCACCCT GACTGCCCTC GACAAGCCGT TCGACAAGCC GCTCGACAAG
CGCCGCCAGA GTGCCATCGC GTTCCTGGAA TCGGTGCAGC AGGCAGTGTC AGATTCGGAG
CCGACGACCA AGCCGCGGCG CGCGCCGTTC TGGAAGCGGT AG
 
Protein sequence
MTVVQTGVVD VVEEAPRPEP DGGLGALRTE RGNLPLELIE VRSAVAGLAV RTELAQGFRN 
PYDVPLEATY IFPLPDRAAV TRLRMEAADR VIEGVIKERE AARADYDAAI SAGQRASIAE
EERPGVFTLR VGNIMPGERV VIRTSLSGRL PYEDGQATFR FPLVVAPRYI PGADLPGEQV
GSGTASDTDQ VPDASRISPP ILLPGFPNPV RLSIEVAVDP VGLPLAGLTS SLHGVSVEES
EGSRYLVRLN PGARADRDFV LRLGYGGSGA ATSLAVAWDS ESANEVAKES AKATPTDIGT
FLLTVLPPEP TGATRPRDVA LILDRSGSMG GWKMTAARRA AARIVDTLTA EDRFAVLTFD
DQMETPDGLP TGLSEATDRH RFRAVQHLAT VDARGGTEME PPLRRAATLL SDDNPDRDRV
LILITDGQVG NEDRLLTTLS PKLTHIRVHT VGIDTAVNAA FLQRLSTLGG GHCELVESED
RLDDAMDAIH HRIATPLVTG LHLTGVGGLE LEQNSVTPTR LPDLFAGAPL VVAGRLGGKM
VALGATASDK NAAAGESAAT SASAVALSDS AATSKSTAAI EGSAATSKST AAIEGSAAAG
ENAAAVGDSV EAGKSAAVLG DSAEAGGRAS ALGNSAASDT NAAAVEGAAA GEAAAVLGGS
AEAGKSADAV DSSATAGENA AVLGDSATAG ETVAAVGGAA AGSGEDAAAA DTIPGIVITG
TAADGSAWSR RVAGVGTADA GLGQFWARGR IRDLEDRYVS TYGGQAEIER AIVAASVEHS
VLSRFTAFVV VDSRVVNAGG AHKQVTQPVD LPDGWAADFG GALPVSASMP APAADAHYSS
LDFGARVSRK SVRARSVGAP KPGPLAPGGG PGYGGAMPAP GAPMPSVPPS VQPLARPLLP
PLAEPAISVD SADDMAALPD FLKESTSAAD ELRALAATWL PRLTAAANDT AEAQEKLLTE
FATELRELAP RLSSTRSADA GDLLSTLTAL DKPFDKPLDK RRQSAIAFLE SVQQAVSDSE
PTTKPRRAPF WKR