Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1994 |
Symbol | |
ID | 8535153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 2136601 |
End bp | 2138616 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 646384376 |
Product | Vault protein inter-alpha-trypsin domain protein |
Protein accession | YP_003263863 |
Protein GI | 261856580 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | [TIGR02595] PEP-CTERM putative exosortase interaction domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.204714 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGGG GACTGTTATT CCTATTTGCC TGCCTGATTT TTTTCTCTGC TGCCTACGCA TTCGCCGACA GTGGGCGCGA ATGCACCAAC TGCGACCACA GCCTGGCGCC ACGATTATGG ATTCCAGACG GCGATCCTAA TACCGATCAC CTGCCACTCA AATCCAGCTC GGCCGATATC GTCATTGATG GTCCGATTGC TCGGGTAACC ATCACCCAGC GTTACCGAAA CGAAGGCACG CGCCCAATCA ATGCACGCTA CGTCTTTCCC GGATCGACGC ACGCTGCGGT TCAAAGCCTC ACCATGAACA TTGGCGATCG GATCATCAAA GCCAAAATCA AGGAAAAGGA AGAAGCCAAG AAAATATACG AAGTAGCGAA ACAAGCCGGA AAACATGCCG CATTGCTGGA ACAAAAACGT CCGAACGTAT TCATGATGAA CGTTGCGAAC ATCATGCCTG GCGACACCGT TGAACTGGTG CTGCAATACA GCGAATTACT GATTCCCGAT GATGGCGTAT ACCAGTTGGT CTATCCGACC GTGGTCGGGC CTCGTTACGG TGGAGATCCG ATTCGGGCCA CACCCCATAA CCGATGGATA GCCAATCCCT ACGCCAAAGA TAACACTGAC GGTTCAAACC CGGCCCAGAT CAAGACCGAC ATTCATGTGC GCATTGCCAG CCCGATTCCA ATCAGTGACC TGCGTTCGGC GCAACACAAG ATCGTCACGC ATTGGTTGAA TGACAAATCG GCGGAAATCA GTCTCGATCC ATCCGAAACA CACACCGGTA ATCGGGACTT CATCCTGAGT TTCAGACTCC AGGGCGCCAA GATCAATTCA GGCCTGATGA CCTACGAATG GAACGGCGAG CATTATTTTT TGATGATGGC TCAACCGCCC AAGCGGGTTG CACCCACAGA AGTGATGAAG CGCGAATATC TGTTCGTGGT CGATGTCTCC GGCTCCATGT ACGGCTTCCC CCTCAACACG GCCAGCGACC TGATGCGCGA ACTGCTTAGC AGCCTGAAGC CACAGGAAAC CTTCAACATA CTGTTCTTCT CGGGTGGTTC ACGCGTGCTG TCACCCACCC CGTTGCAAGC AACACCAGAA AACCTGCAAC GGGCCATGAC CATGATGCGG AGCATTCAGG GTGGCGGCGG CACCGAATTG CTCCCGGCAC TGAAAACAGC GTTTGCGATG CCGCGCACGG AAGATACCGC CCGCAGTATC GTTGTCATCA CGGACGGTTA TGTCGACGTC GAGCGGCAAG CCTACGATTT GATCAAACAA AACCTGAACT CGACCAATCT CTTTGCCTTC GGCATCGGAT CGTCGGTCAA TCGCTATCTG ATGGAGAGCA TGGCCCATGC CGGTCAGGGC GAACCGTTCA TCATCACAGG CCCGAATGAC GTACCGGGTG TCGGTGCACG GTTCCGTCGC TATGTGGACG CACCGGTTCT CAGCCATATC AAAATCCGGG GCAATGGCGT GGAACTGTAC GATACGGAGC CGTCTGAAAT CCCGGTCATG CTCGCCGAGA GACCCATCGT AATCTTCGGC AAATACCGCA ATGCGCAACC CGGCGCCACG CTGGAACTCA CGGGCACCCG TGCCACGGGG GAATACCGCG CTACCCTATC GTTGGACGAC TCCAACGGGC AGGCCGACAA GAATCAAGCC GAACTGCTGC CCGTACTGTG GGCCCGCCAA CGACTGATGT ATCTTTCCGA TCTTCAGGGC GACGACGACG CGCATCGGGA TGAAATCATC CGTCTCGGCC TGCGTTACTC TCTGCTGACG CGATACACCT CGTTCGTCGC GGTCGATGAA ACGATCAGCA ACCCGAACGG CAATACCACG GACGTCAAAC AACCGCTGCC GCTGCCACAA GGTGTGTCCG AGTTGGCCGT GGCACAGCCC GTACCCGAAC CCAGCCTGTA CTGGTTGCTG CTTGCGCTGG CTGTCCTGTT CGCTTCAGAT CGACTGTTCC GGAAGAATCG CCATGTTCCG CACTGA
|
Protein sequence | MKRGLLFLFA CLIFFSAAYA FADSGRECTN CDHSLAPRLW IPDGDPNTDH LPLKSSSADI VIDGPIARVT ITQRYRNEGT RPINARYVFP GSTHAAVQSL TMNIGDRIIK AKIKEKEEAK KIYEVAKQAG KHAALLEQKR PNVFMMNVAN IMPGDTVELV LQYSELLIPD DGVYQLVYPT VVGPRYGGDP IRATPHNRWI ANPYAKDNTD GSNPAQIKTD IHVRIASPIP ISDLRSAQHK IVTHWLNDKS AEISLDPSET HTGNRDFILS FRLQGAKINS GLMTYEWNGE HYFLMMAQPP KRVAPTEVMK REYLFVVDVS GSMYGFPLNT ASDLMRELLS SLKPQETFNI LFFSGGSRVL SPTPLQATPE NLQRAMTMMR SIQGGGGTEL LPALKTAFAM PRTEDTARSI VVITDGYVDV ERQAYDLIKQ NLNSTNLFAF GIGSSVNRYL MESMAHAGQG EPFIITGPND VPGVGARFRR YVDAPVLSHI KIRGNGVELY DTEPSEIPVM LAERPIVIFG KYRNAQPGAT LELTGTRATG EYRATLSLDD SNGQADKNQA ELLPVLWARQ RLMYLSDLQG DDDAHRDEII RLGLRYSLLT RYTSFVAVDE TISNPNGNTT DVKQPLPLPQ GVSELAVAQP VPEPSLYWLL LALAVLFASD RLFRKNRHVP H
|
| |