Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4460 |
Symbol | |
ID | 8335814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5077972 |
End bp | 5081073 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644957562 |
Product | Vault protein inter-alpha-trypsin domain protein |
Protein accession | YP_003115164 |
Protein GI | 256393600 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.743014 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGTTG TTCAAACCGG TGTCGTCGAC GTCGTCGAGG AAGCGCCGCG CCCGGAGCCG GACGGCGGCC TCGGGGCGCT GCGGACCGAG CGCGGGAACC TGCCGCTGGA GCTGATCGAG GTGCGCTCGG CGGTGGCGGG GCTGGCCGTG CGCACCGAGC TGGCGCAGGG CTTCCGCAAT CCGTACGACG TGCCGCTGGA GGCGACGTAC ATCTTCCCGC TGCCGGACCG CGCCGCGGTG ACGCGGCTGC GCATGGAGGC CGCGGACCGC GTGATCGAGG GCGTCATCAA GGAGCGGGAG GCGGCGCGCG CCGACTACGA CGCGGCGATC TCCGCGGGCC AGCGCGCCTC GATCGCCGAG GAGGAGCGCC CGGGCGTCTT CACGCTGCGG GTCGGCAACA TCATGCCCGG CGAGCGCGTG GTGATCCGTA CGTCGCTGTC GGGGCGCCTG CCGTATGAGG ACGGACAGGC CACGTTCCGG TTCCCCTTGG TCGTCGCGCC GCGCTACATC CCCGGCGCCG ACCTGCCCGG GGAGCAGGTG GGCAGCGGCA CGGCGTCGGA CACCGACCAG GTACCCGATG CCTCGCGCAT CAGCCCACCG ATCCTGCTGC CCGGGTTCCC GAACCCGGTC CGGCTGTCGA TCGAAGTGGC GGTCGACCCG GTGGGGCTGC CGCTGGCCGG GCTGACGTCG AGCCTGCACG GGGTGAGCGT GGAGGAGTCG GAGGGCTCTC GATATCTGGT GCGGCTGAAC CCCGGGGCGC GCGCCGACCG GGACTTCGTG CTGCGCTTGG GGTACGGCGG TTCGGGCGCG GCGACATCCT TGGCCGTCGC ATGGGACAGC GAGTCCGCGA ACGAGGTCGC GAAGGAATCC GCGAAAGCGA CGCCCACTGA CATCGGCACA TTCCTGCTGA CCGTCCTGCC CCCGGAACCG ACCGGCGCCA CCCGCCCGCG TGACGTCGCG CTGATCCTGG ACCGCTCCGG CAGCATGGGC GGCTGGAAGA TGACCGCCGC CCGCCGCGCC GCAGCCCGCA TCGTCGACAC CCTGACCGCC GAAGACCGCT TCGCCGTCCT GACCTTCGAC GACCAGATGG AAACCCCCGA CGGCCTCCCC ACCGGCCTGT CCGAAGCAAC CGACCGCCAC CGCTTCCGCG CCGTCCAGCA CCTGGCAACC GTCGACGCCC GCGGCGGCAC CGAGATGGAA CCACCCCTCC GCCGAGCCGC CACCCTCCTG TCCGACGACA ACCCCGACCG CGACAGAGTC CTGATCCTCA TCACCGACGG CCAAGTAGGC AACGAGGACC GCCTCCTCAC CACCCTCAGC CCCAAACTGA CCCACATCCG CGTCCACACA GTCGGCATCG ACACCGCCGT GAACGCCGCC TTCCTCCAGC GCCTATCCAC CCTCGGCGGC GGCCACTGCG AACTCGTCGA ATCCGAAGAC CGCCTCGACG ACGCCATGGA CGCCATTCAC CACCGCATCG CCACCCCCCT CGTGACCGGC CTCCACCTGA CCGGAGTCGG CGGCCTCGAG CTAGAGCAGA ACTCGGTCAC GCCGACCCGA CTGCCAGACC TCTTCGCCGG AGCACCGCTG GTGGTAGCCG GGCGACTCGG CGGGAAGATG GTCGCGCTCG GCGCGACTGC GAGCGACAAG AATGCGGCGG CTGGCGAGAG CGCGGCAACT AGCGCGAGCG CGGTCGCGCT CAGCGACTCG GCGGCGACCA GCAAGAGTAC GGCTGCGATC GAGGGCTCGG CGGCGACCAG CAAGAGTACG GCTGCGATCG AGGGCTCGGC GGCGGCTGGC GAGAATGCGG CTGCGGTCGG CGACTCGGTG GAGGCTGGCA AGAGTGCGGC TGTGCTCGGC GACTCGGCGG AGGCTGGCGG GAGAGCGTCC GCACTCGGCA ATTCTGCGGC GAGCGACACG AATGCAGCGG CCGTCGAGGG CGCGGCGGCT GGCGAGGCTG CGGCTGTGCT CGGCGGCTCG GCAGAGGCTG GCAAGAGTGC GGATGCGGTC GATAGCTCGG CGACGGCTGG CGAGAATGCG GCTGTGCTCG GCGACTCGGC GACGGCTGGC GAGACTGTGG CTGCGGTCGG CGGCGCGGCG GCTGGCTCGG GCGAAGATGC GGCTGCGGCT GACACCATCC CCGGCATCGT CATCACCGGC ACGGCGGCCG ATGGGTCCGC GTGGAGTCGC CGTGTGGCTG GGGTTGGGAC TGCGGATGCG GGGCTTGGGC AGTTTTGGGC TCGGGGGCGG ATTCGTGATC TGGAGGATCG GTATGTCTCG ACCTATGGCG GGCAGGCGGA GATCGAGCGG GCGATTGTTG CTGCCTCGGT TGAGCACAGT GTGCTGTCGC GGTTCACGGC TTTCGTGGTG GTCGACAGTC GGGTGGTGAA CGCGGGCGGC GCTCACAAGC AGGTCACGCA GCCGGTCGAT CTGCCTGATG GTTGGGCGGC GGACTTCGGG GGTGCTCTGC CGGTCAGTGC TTCGATGCCC GCGCCGGCCG CCGATGCCCA TTACTCGAGT CTCGACTTCG GCGCGCGGGT GTCGCGCAAG TCTGTACGTG CGCGGAGCGT CGGCGCGCCG AAGCCCGGGC CGTTGGCGCC CGGCGGCGGA CCGGGGTACG GCGGCGCCAT GCCGGCTCCC GGTGCGCCGA TGCCGTCGGT CCCGCCGTCG GTCCAGCCGT TGGCGCGGCC GTTGCTGCCG CCGCTGGCGG AGCCCGCGAT CTCGGTCGAC TCGGCGGACG ACATGGCTGC CCTGCCCGAC TTCTTGAAGG AATCCACCTC CGCCGCGGAC GAACTCCGCG CCCTCGCGGC GACCTGGCTC CCCCGCCTCA CGGCCGCCGC GAACGACACC GCCGAGGCCC AGGAGAAGCT GCTGACCGAA TTCGCCACGG AGCTGCGCGA GCTTGCCCCG CGCCTGTCGT CCACGCGCTC CGCCGATGCC GGTGACCTCC TCAGCACCCT GACTGCCCTC GACAAGCCGT TCGACAAGCC GCTCGACAAG CGCCGCCAGA GTGCCATCGC GTTCCTGGAA TCGGTGCAGC AGGCAGTGTC AGATTCGGAG CCGACGACCA AGCCGCGGCG CGCGCCGTTC TGGAAGCGGT AG
|
Protein sequence | MTVVQTGVVD VVEEAPRPEP DGGLGALRTE RGNLPLELIE VRSAVAGLAV RTELAQGFRN PYDVPLEATY IFPLPDRAAV TRLRMEAADR VIEGVIKERE AARADYDAAI SAGQRASIAE EERPGVFTLR VGNIMPGERV VIRTSLSGRL PYEDGQATFR FPLVVAPRYI PGADLPGEQV GSGTASDTDQ VPDASRISPP ILLPGFPNPV RLSIEVAVDP VGLPLAGLTS SLHGVSVEES EGSRYLVRLN PGARADRDFV LRLGYGGSGA ATSLAVAWDS ESANEVAKES AKATPTDIGT FLLTVLPPEP TGATRPRDVA LILDRSGSMG GWKMTAARRA AARIVDTLTA EDRFAVLTFD DQMETPDGLP TGLSEATDRH RFRAVQHLAT VDARGGTEME PPLRRAATLL SDDNPDRDRV LILITDGQVG NEDRLLTTLS PKLTHIRVHT VGIDTAVNAA FLQRLSTLGG GHCELVESED RLDDAMDAIH HRIATPLVTG LHLTGVGGLE LEQNSVTPTR LPDLFAGAPL VVAGRLGGKM VALGATASDK NAAAGESAAT SASAVALSDS AATSKSTAAI EGSAATSKST AAIEGSAAAG ENAAAVGDSV EAGKSAAVLG DSAEAGGRAS ALGNSAASDT NAAAVEGAAA GEAAAVLGGS AEAGKSADAV DSSATAGENA AVLGDSATAG ETVAAVGGAA AGSGEDAAAA DTIPGIVITG TAADGSAWSR RVAGVGTADA GLGQFWARGR IRDLEDRYVS TYGGQAEIER AIVAASVEHS VLSRFTAFVV VDSRVVNAGG AHKQVTQPVD LPDGWAADFG GALPVSASMP APAADAHYSS LDFGARVSRK SVRARSVGAP KPGPLAPGGG PGYGGAMPAP GAPMPSVPPS VQPLARPLLP PLAEPAISVD SADDMAALPD FLKESTSAAD ELRALAATWL PRLTAAANDT AEAQEKLLTE FATELRELAP RLSSTRSADA GDLLSTLTAL DKPFDKPLDK RRQSAIAFLE SVQQAVSDSE PTTKPRRAPF WKR
|
| |