Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal_1843 |
Symbol | |
ID | 4845687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS155 |
Kingdom | Bacteria |
Replicon accession | NC_009052 |
Strand | + |
Start bp | 2140893 |
End bp | 2143208 |
Gene Length | 2316 bp |
Protein Length | 771 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640119064 |
Product | vault protein inter-alpha-trypsin subunit |
Protein accession | YP_001050218 |
Protein GI | 126174069 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.327396 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCACAG GGAAAAGGGT AAAAGAAATC AGCGAAAGCT TAGCGATATT AATGATCAGT GTCGCCGTAT GTTGGGGCTT GCCCTTTGTG GCATTAGCCT CACCCAATAG CACGGGCACA TTGTCTGAGC CGCAGAATAT TGCCTCGGCG CTGGTCGGCC AAAATACTGC TCAAAGTACG GCAGAATCAC AGGTCATTAA TTACGATGAC ATTACTCAGG GTTTGTTGCT TTATCGTCAG CCGAGCGGCG CTTGGGTGCC ATCTTTGCCA CTCGATACAC AAGTGTCGAT GCAAGTCTCT GGCTTAAGTA ATCGCGTGAG TGTGAAGCAG GTTTTTCGCA ATAACACTGA ATTTGTGTTG AATGGGCAAT ATCTGTTCCC GCTGCCCAAT GAGGCCGCGG TGGATTCGCT GCGATTACAT ATCGGCCAAA GGGTCATCGA AGGGCAAATT CATCCTAAGG CCGAGGCTAA GCAGATTTTT GAGCAAGCCA AAGCGGAGGG CAAACGCGCC AGCTTAGTGA GCCAAGAGCG GCCGAATATG TTCACGACCG AAGTGGCCAA CCTAGCACCG CAGGAAGAGT TGATTGTCGA AATCAGCTAT CAGGAGAATA TCAAATATGA AGACGGCTTA TTTAGCCTAC GTTTTCCGCT GGTGGTCGCG CCTCGGTATA TCCCGGGCTT AACCTCTGAC GCAAGTGCTA GCGATTCAAA TAATCCTATA GCACAGAGCA GCCAATCGAG CCGAGTGACC AGCTCGCAGG TGTTTGATGC CGACCGGATT GTGGCGCCTG TGCGGGACGG CGCCAGTGGA CGCGATCCTG TGCTCAAGGC GGACATTCAA GTCTTGCTGG CAAAGGGGGT TGATAAGGCG TCAATTGAGA GTCCTTACCA TGACATCAAA CTCAAGCAAA CCAATAGCGG CGTCGATGTT TCACTGGCAC AACGCGTGCC TGCCAATCGT GACTTTGTAC TGCAATGGCG CGTACAGCAG GGTACGAGTC CAATGGCTTG GGTGTTTAAT CAACAGGGTA AGACCCATAA ACCCGATGGC GATAATTTGT CTCAGGACAC GCTTGAAACC AGCAAAGCTA ACGGCGTAAA TGAAGATAAT TATAGTCTCG TCATGGTGTT GCCTCCTAAG GTCGAAAAGA GCACCCAACC AAGTTTGCCC CGTGAATTAA TTTTGGTGAT TGATACTTCG GGCTCTATGG CGGGGGATTC TATCGTTCAA GCCAAAAATG CACTGCTTTA TGCATTAAAA GGACTCAAGC CCGAGGACAG TTTTAACATT ATCGAATTTA ACTCCAGTCT GTCTCAGTTT TCAGCGACAC CATTACCTGC GACTTCGTCT AATTTGTCCC GCGCCCGTCA ATTTGTGAGC CGTTTACAAG CCGATGGTGG CACTGAAATG GCGCTGGCAC TGGATGCGGC CTTGCCAAAA TCTTTGGGCA GTGCCCCGTC CGATGCTGTG CAGCCTTTGC GCCAAGTGAT CTTTATGACC GACGGCTCTG TGGGCAATGA GCAAGCCTTG TTCGATTTGA TCCGCTATCA AATCGGAGAG AGCCGCTTAT TTACTGTCGG CATAGGCTCA GCGCCTAATT CTCACTTTAT GCAAAGGGCG GCAGAGCTGG GTCGTGGCAC TTTTACTTAT ATTGGCAAAG TGGATGAAGT CGGCGAGAAG ATCAGCGCCT TGCTGAGCAA AATTCAATAT CCCTTATTAA CCGATATTCA AGTGCGTTTT GATGATGGCA GTGTGCCCGA TTACTGGCCA TCACCGATTG CCGATCTGTA TCGCGGCGAG CCTGTGCTCG TAAGTCTAAA ACGCAGCGCC CGTGAGCCGC AGGAATTAGT GATCTTAGGT CGCCAAGGGC ATAAAAACTG GCAACAGTCG TTATCGCTTC AGGACAACTC AGCGGGACTT ATCACCAATC AAGGTGCGGG TTTAGATTTA CTGTGGGCGC GTAAACAGAT TGGTGCATTG GAGCTGAGTA AAAATGGCGC AAATGATGAC AAGGTAAAGC AGCAAGTCAC GGCTTTGTCG ATGAATTATC ACTTAGTCAG CGCTTATACC AGCTTAGTGG CGGTGGACTT AACCCCCATA GATTCAACTG CAATGAGCCG CGATGCAGTG GTGCGTCAGC ACTTGCCCTT AGGTTGGCAG CCTTTTGGTG CCTTGCCACA AACTGCGACC TCGAGCCGTT TGGATATGTT ATTGGGTGCG CTTACCTTGC TGCTGGCCTT GGTACTTATG GGATCGATTC TGCGACAAAA ATGTGAAGAG AAGGCAGCCA TGATGGCGCT TGCCTGTCAT CGATGA
|
Protein sequence | MITGKRVKEI SESLAILMIS VAVCWGLPFV ALASPNSTGT LSEPQNIASA LVGQNTAQST AESQVINYDD ITQGLLLYRQ PSGAWVPSLP LDTQVSMQVS GLSNRVSVKQ VFRNNTEFVL NGQYLFPLPN EAAVDSLRLH IGQRVIEGQI HPKAEAKQIF EQAKAEGKRA SLVSQERPNM FTTEVANLAP QEELIVEISY QENIKYEDGL FSLRFPLVVA PRYIPGLTSD ASASDSNNPI AQSSQSSRVT SSQVFDADRI VAPVRDGASG RDPVLKADIQ VLLAKGVDKA SIESPYHDIK LKQTNSGVDV SLAQRVPANR DFVLQWRVQQ GTSPMAWVFN QQGKTHKPDG DNLSQDTLET SKANGVNEDN YSLVMVLPPK VEKSTQPSLP RELILVIDTS GSMAGDSIVQ AKNALLYALK GLKPEDSFNI IEFNSSLSQF SATPLPATSS NLSRARQFVS RLQADGGTEM ALALDAALPK SLGSAPSDAV QPLRQVIFMT DGSVGNEQAL FDLIRYQIGE SRLFTVGIGS APNSHFMQRA AELGRGTFTY IGKVDEVGEK ISALLSKIQY PLLTDIQVRF DDGSVPDYWP SPIADLYRGE PVLVSLKRSA REPQELVILG RQGHKNWQQS LSLQDNSAGL ITNQGAGLDL LWARKQIGAL ELSKNGANDD KVKQQVTALS MNYHLVSAYT SLVAVDLTPI DSTAMSRDAV VRQHLPLGWQ PFGALPQTAT SSRLDMLLGA LTLLLALVLM GSILRQKCEE KAAMMALACH R
|
| |