Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_2383 |
Symbol | |
ID | 8883578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 2526660 |
End bp | 2529155 |
Gene Length | 2496 bp |
Protein Length | 831 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Vault protein inter-alpha-trypsin domain-containing protein |
Protein accession | YP_003511161 |
Protein GI | 291299883 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.151834 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00156967 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCTTGC GCATCGCCGA CCTGCCCGTC TTCAACACCG AGCCGACCGA GGCGACGGAG GGCTCCGGCC TGGGAACGCT GGCCACCGAG CGCGGCAACC TGCCACTGCG CGGCCTGGAC ATCAACTGCC ACGTCACCGG CCTGGGCGTC CGCACCGTCG TCACGCAGCG ATTCCACAAT CCACACGGCG AGCCCATCGA GGCGACCTAC ATCTTCCCGC TGCCCGAACG CGCGGCGGTC ACCGATATGA CGATGACGGT CGCCGAGCGC ACGGTCACCG CCGAGCTCCA CGAACGCGCC AAGGCACGGC AGCTGTACGA CACCGCGATC AGCGAGGGCA AACGCGCCTC CATCGCCGAG GCCGAACGCG CCGACGTGTT CACCATGCGG GTCGGGAACC TCGGTGCCGG CGAGGAAGCC GTCGTCACCC TCACCCTCGT CGGCCCACTG GCCTTCGAGG ACAACGAGGC GACGCTGCGG CTGCCGCTGG TCGTCGCGCC CCGCTACATC CCCGGCCAGC CCACCGGTGC CGCACCGGTG GGGGAGGGCT ATGCCGAGGA CACCGACGCG GTCCCCGACG CCTCCCGCAT CACCCCGCCG GTGCTGTTGC CCGGCTTCCC CAACCCGGTG CGACTGTCCA TCGAGGTCAC CATCGACCCG GCCGGACTGC CGTTGCGGCA ACTGCGTTCC AGCCTGCACG CCGTCACAGT GGACGAGACC GGCGAGGTCA CCCGGGTGCG GATCGAACCC GGCGAGCGGG TCAACCGTGA TTTCATCCTG CGCTTCGACT ACGGCGAGTC CGGCGACGTC GCCGGCTCCC TGCTGACCGC TCCGGACGAG AACGAGCCGA CCAGCGGCAC CTTCCAGCTG ACCGCCATCC CGCCGTCCGA CCTGCCCCGG GCCCGGCCCC GCGACGTCGT GGTCCTGCTG GACCGTTCCG GCAGCATGGG CGGCTGGAAG ATGGTCGCCG CCCGTCGCGC CGCCGCCCGC ATCGTCGACA CCCTGTCCAG CGCCGACCGC TTCGCCGTCC GCTGCTTCGA CACCGCCATG ACCAGCCCGG AAGGCTTGGA CCCCAACGGT TTGAGCGCCG GAACCGACCG CAACCGGTTC CGCGCCGTCG AACACCTCGC GGGCACCGAG ACCCGCGGCG GCACCGACAT CCTCAAGCCC CTGTCCACGG CTGTCGACCT GCTGACGGCG GGCGAGAAGG GCCGCGACCG CGTCATCATC CTGGTCACCG ACGGCCAGGT CGGCAACGAG GACCAGATCC TGCGCGAACT CACCGGACGG CTGTCGGGCA TGCGGGTCCA CGTCGTCGGC ATCGACAAGG CCGTGAACGC CGGTTTCCTG CACCGGCTGG CCCTGGTGGG CCGGGGCCGC TGCGAACTCG TCGAATCCGA GGACCGGCTC GACGAGGCCA CCGCCCACAT CCACCGCCGG ATCGTCGCCC CCGTCGTCAC CGACCTCACC GTCACCGGCG AGGGCCTCGA CCTGGAACCC GAAACCTTGG CACCCCACCG CATCCCCGAC CTGTTCACCG GCGCCCCGCT GATCATCAGC GGCCGCTACC ACGGCGCGGG AACCACACCC CGCCTGAAGC TCACCGGCAC CAGCCAGGAC GGCACCCCGT GGACCAGCGA ACTCGCGGCC CGCACCGACG ACACCGCCCT GACCTGCCCC GCCTGGGCCC GCGCCCACCT GCGCGACCTG GAGGACCGCT ACGCCTCAGC CCCCGGCTCA CCCGACCTGG GCGACCTCGA GAAACGCATC GTCGACGTCT CCCTGCGCCA CCGGGTGCTG TCCCGGTTCA CCTCCTTCGT CGCCGTGGAC AGCCGCGTCG TCACCGAAGG CGGCAAACCC CGCACGGTCG TCCAGCCGGT CGAGATGCCG GAAGGCTGGG ACATGCCCAC CCCCGCCGCC CCCTCGCCCT ACCTGGGAGC CAGCGTCCGC ATGATGGCCG CTCCCGCCGC CGCCCCCGAG GCGATGGCCC AGGGCTACGG CGCCGCCCCG CCCCCGCCCG CCCAACCCGG CGGAGCCGCC CCGTCCTTCG CCCGCCCCGC CGCCCCCCGC AAGGCTCCCA ACCGCGGCTT CGGCAAATCG GCGGGCGGTC CCGGCCAACC CCTCATGGAC GTCGACCAGA TCCGCCAGAT CCTGCACGAC GAATGGCGCA TCCTCGACCC CGAAGTCCAC ACCATGGAGT TCAAGGCCCC CGAGGTCGTC GCCCGGGAAC GCCGAGCGGC CCTGTCCGAC CTGGCCACCC GCCTCGGCGT CGTCATCGAC GCCATGAGGG ACACGGGAGC CTTCGACGGC GACACGGTGA CCGCCCTACG CGACCTCCTG CCCCGCATGG AGGCCTGCGA ACGCCCCAAC CCACCAACCG GAGACGACCT GGCATCGCTG TGGCAGCGCA CCATCGAACT CCTGCGAACC CTGAGCGAGA CCGGCGGCGC CACCCCGCCG TCGTCAACGG ATCCCAAACC GTTCTGGAAG CGGTGA
|
Protein sequence | MSLRIADLPV FNTEPTEATE GSGLGTLATE RGNLPLRGLD INCHVTGLGV RTVVTQRFHN PHGEPIEATY IFPLPERAAV TDMTMTVAER TVTAELHERA KARQLYDTAI SEGKRASIAE AERADVFTMR VGNLGAGEEA VVTLTLVGPL AFEDNEATLR LPLVVAPRYI PGQPTGAAPV GEGYAEDTDA VPDASRITPP VLLPGFPNPV RLSIEVTIDP AGLPLRQLRS SLHAVTVDET GEVTRVRIEP GERVNRDFIL RFDYGESGDV AGSLLTAPDE NEPTSGTFQL TAIPPSDLPR ARPRDVVVLL DRSGSMGGWK MVAARRAAAR IVDTLSSADR FAVRCFDTAM TSPEGLDPNG LSAGTDRNRF RAVEHLAGTE TRGGTDILKP LSTAVDLLTA GEKGRDRVII LVTDGQVGNE DQILRELTGR LSGMRVHVVG IDKAVNAGFL HRLALVGRGR CELVESEDRL DEATAHIHRR IVAPVVTDLT VTGEGLDLEP ETLAPHRIPD LFTGAPLIIS GRYHGAGTTP RLKLTGTSQD GTPWTSELAA RTDDTALTCP AWARAHLRDL EDRYASAPGS PDLGDLEKRI VDVSLRHRVL SRFTSFVAVD SRVVTEGGKP RTVVQPVEMP EGWDMPTPAA PSPYLGASVR MMAAPAAAPE AMAQGYGAAP PPPAQPGGAA PSFARPAAPR KAPNRGFGKS AGGPGQPLMD VDQIRQILHD EWRILDPEVH TMEFKAPEVV ARERRAALSD LATRLGVVID AMRDTGAFDG DTVTALRDLL PRMEACERPN PPTGDDLASL WQRTIELLRT LSETGGATPP SSTDPKPFWK R
|
| |