Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_4584 |
Symbol | vpR |
ID | 2818051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | + |
Start bp | 4163410 |
End bp | 4166163 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637791277 |
Product | minor extracellular protease VpR |
Protein accession | YP_021229 |
Protein GI | 47529880 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CTACATCTAT ACTTTTAAGT ATGGCACTCG TCTTTTCTAG TTTTGGAGCT TTAAGCGCCC ATGCTGAATC ACTGCAAAAG GAGAAGCAAT TTAGTCCACA ACTAAAAACA ACGATTGAAC AATGGGGAGA AAGTAAAATT GCTCAACACG TTGAAACAAA AACAACAAAA GAAATATCTG TAATTGTAGA ATTGCAACAT GCACCTCTTG CTGCGCAAAG TAACATTCAG CATGCTCCAG ATTTACAAAA TAGTCATGCA CAGTCTTATC ATACCGAGCT AAAAAAAGCA CAAGAAGAGA CGACTAAGAA AATAAAAGAA AAAGCACCTG GTGCAAAAAT TAAAGAAGTT TATAATACGT TATTTTCTGG TTTCTCTATT TCAGTTCCAG GAGATCAAAT TACTGCTCTT GCTTCTTTAC CTGAAGTAAA GACAGTCTAT CCGAACTTAA CATATAAATT GCATGAAACA ACGAAAAGCG CTACTAGCGA AGAAGCACCT AATATTGGTG GACCGACAAT TGGTGCTCCT GAAGCATGGA ATTTAAAAGA CCCATCTGGT AAATCTCTTG ATGGAAAAGG TATGAAAGTA GCGATTATCG ATTCTGGCGT AGACTATACA CATCCTGACT TAAAGGCAAA TTATATTGGT GGATACGACA CAGTGGATGA AGATGCTGAT CCAATGGATG GTAACGTACA CGGTACTCAT GTAGCTGGAA TTATTGCTGG TAATGGAAAA ATTAAAGGCG TTGCTCCAAA TGCTTCTATT TTAGCTTATC GTGTCATGAA TGACGGTGGT ACTGGTACAA CAGATGATAT TATTCAAGGT ATTGAACGAG CAATTCAAGA TGGTGCTGAT GTGTTAAATC TATCCCTTGG ACAAGATTTA AATGTACCTG ATCAGCCTGT AACTTTAACG TTAGAACGTG CAGCAAAGCT TGGGATTACT GCAGTCGTTT CCAATGGAAA TGATGGTCCA AAACCTTGGT CTGTTGATGC ACCTGGAAAT GCAAGTAGCG TTATATCAGT TGGAGCATCT ACAGTTTCTA TTCCGTTTCC AACATTCCAA GTAGCTGGTT CCAGCAAAAC ATACCAAGGA TTATCGTTAT CAAAATCAGA TTTCCCAATA GGAAATGATT CTCCACTTGT ATATGTTGGC TATGGTAATC CAAGCGATTA TGCAAAACAA GATGTGAAAG GAAAATTTGC ACTTGTTTTA CAAGGTACTT CTAGCACGTT AGTAAAAGCT GAACAAGCGA AGCAAGCTGG TGCAATTGGT GTACTATTCA TTTCTACAGA AAAAGAAATG AATAGTATGC CTGAATACTT TACACGTGAA AACCTAGCCC TTCCAGTTAT GCAACTATCC AATGTAAACG GTGAAGAGTT GAAAAATTTA ATTACAAAGC GTAAGAAAAA TATAAAGATC GGACAACCGG TTCCGACTGA ATTAATTGGA AACTTTAGTT CCAGAGGTCC ATCACAAGGA AGCTGGCTTA TAAAACCAGA TATCGTTGCA CCTGGCGTAC AAATTACTAG TACCGTACCA CGAGGTGGCT ATGAATCTCA TAACGGAACA AGTATGGCTG CTCCGCAAGT AGCTGGAGCG GTTGCCCTCC TGCGTCAAAT GCACCCTGAT TGGACGACGC AACAATTAAA AGCATCACTT GCCAATACCG CAAAAACTTT AAAAGATGTG AATGAAAATA CATATCCTAT TATGACACAA GGATCTGGTT TAATTAACAT CCCGAAAGCA GCTCAAACAG ATGTATTAGT CAAACCTAAC AATGTGAGCT TCGGTCTTAT TAAGCCAAAC AGTGGTAAAG TAAAACTGAC GCAAAATATT ACGTTACAAA ATCTATCTAG TAAAAAGAAA AGTTTTTCAA CTCGTGTGGA ATTGCTAGAT ACAAACACAA AAACAAAAGT AAAAACTTCT GTACCTTCAT CGATTAGCAT ACAGCCGAAT AGTAGTACCG AAAAACCATT TACTATCACT GTCGATAGCT CACTACCACA AGGTGTGTAT ACTGGGAATG TATATGTAAA AGAGCAGGGA GCGAAAGAAG AAACTCGCAT TCCATTTACA TTTAGTATCG ATCCTAAAGA TTACAAACGT ATAGATGGAC TCGAAATTAT TAATTCTACA TTTAGTCCAA ATGGTGACCA AATATTAGAC GATAATCTCA TCAACTACTA TTTAGTTGCA CCTGTGGATG ATGTAACATT GCATGCAAAT TTAGTTACAA AAGAACGTGT AACGTACCAA GGGATTATCC ATCAAGCTAA AAATGAAACA GCTGGATACA AACCTTTCAA ATGGGATGGT ACAAAAGCAG ATGGTACTCC GTTAGCTGAC GGTCTATACC AAATCGAAGC AGTTGCCTCT AATTCTGGTG GGGAAACAAA ACAAACAGCT GCTGTATTTC TTGATAGAAC TGCACCTAAG TTAACATACG AAATTGACCA AGAAAACCTT GTAATCACAG GAAAAGTGGA TGATATCTTA CTAGATTGGA TGACAGAATC TGGTTGGGTA GCACCTGGAA TTCCAGTGAG ACTACAGTAT GAAATCAACG GAAATGGTGT ATGGGAAAGT GCATTCCTAA ATCCTTGGGA GAAAAATTAC GGCATTTATT TAGATCGTAC TCAATTACAA GAAGGTAAAA ATACAATTCA CATTGTAGCA ACAGATGCAG CTGGAAATAC ATCTAATTTA AATGTTGATT TAGATGTGAA ATAA
|
Protein sequence | MKKTTSILLS MALVFSSFGA LSAHAESLQK EKQFSPQLKT TIEQWGESKI AQHVETKTTK EISVIVELQH APLAAQSNIQ HAPDLQNSHA QSYHTELKKA QEETTKKIKE KAPGAKIKEV YNTLFSGFSI SVPGDQITAL ASLPEVKTVY PNLTYKLHET TKSATSEEAP NIGGPTIGAP EAWNLKDPSG KSLDGKGMKV AIIDSGVDYT HPDLKANYIG GYDTVDEDAD PMDGNVHGTH VAGIIAGNGK IKGVAPNASI LAYRVMNDGG TGTTDDIIQG IERAIQDGAD VLNLSLGQDL NVPDQPVTLT LERAAKLGIT AVVSNGNDGP KPWSVDAPGN ASSVISVGAS TVSIPFPTFQ VAGSSKTYQG LSLSKSDFPI GNDSPLVYVG YGNPSDYAKQ DVKGKFALVL QGTSSTLVKA EQAKQAGAIG VLFISTEKEM NSMPEYFTRE NLALPVMQLS NVNGEELKNL ITKRKKNIKI GQPVPTELIG NFSSRGPSQG SWLIKPDIVA PGVQITSTVP RGGYESHNGT SMAAPQVAGA VALLRQMHPD WTTQQLKASL ANTAKTLKDV NENTYPIMTQ GSGLINIPKA AQTDVLVKPN NVSFGLIKPN SGKVKLTQNI TLQNLSSKKK SFSTRVELLD TNTKTKVKTS VPSSISIQPN SSTEKPFTIT VDSSLPQGVY TGNVYVKEQG AKEETRIPFT FSIDPKDYKR IDGLEIINST FSPNGDQILD DNLINYYLVA PVDDVTLHAN LVTKERVTYQ GIIHQAKNET AGYKPFKWDG TKADGTPLAD GLYQIEAVAS NSGGETKQTA AVFLDRTAPK LTYEIDQENL VITGKVDDIL LDWMTESGWV APGIPVRLQY EINGNGVWES AFLNPWEKNY GIYLDRTQLQ EGKNTIHIVA TDAAGNTSNL NVDLDVK
|
| |