Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_3252 |
Symbol | |
ID | 7087630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 3857380 |
End bp | 3859263 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643462135 |
Product | von Willebrand factor type A |
Protein accession | YP_002359159 |
Protein GI | 217974408 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00246065 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAACAA AATATTCCCC AAGAACCGTT TCGGTTTCCA ATCCAACTTT CTACCTGCAA CAGGGATACG GTATTCCATC GGCGACCAAT ATGGCTGCTC TATTACTGGT CGCAGTGAGT TTAACGGCCT GTGGCGGTAA AGGCGCCGAA GTTCAACATC GACAAGCCAA GCAGCAAGCC GAGCAACGTC ATCAAGAAGC GTCGCAGCGC CAAGCTGAAA TGCGTGATGC TGCTAAGGTT GAGATGGCGA GAGTTGCGGC GCCGATGCAA ATGTCTAGCA ATGGGGCAGT AATGGGAATG AGCATAGCGC CAATGCCGCG TGATTATGCC GCGATTCCGT TAGCGCAAAA TAAATTCGAG CAGCAAGTGC AAAATGGCAT CATGGTGGCA GGAGAGATCC CGGTATCGAC GTTTTCCATC GATGTCGATA CTGGCAGTTA TGCCACTTTA AGGCGGATGC TGAGGGAAGG GCGCTTACCC GAGAAAGGCA TTGTCAGAGT TGAGGAAATG CTCAATTATT TTGCCTACGA TTATCCCTTA CCCGCTAAAA ACGCGGCGCC GTTTAGTGTG ACGACAGAGC TTGCGCCCTC ACCCTATAAC GATGACATGA TGCTGCTGCG GATTGGTCTT AAAGGTTATG ACTTACCTAA ATCTCAGTTA GGCGCCAGCA ACTTAGTCTT TTTGCTCGAT GTGTCAGGCT CTATGGCTTC AACGGATAAA TTACCTTTAC TGCAAACGGC GTTAAAGCTG CTAACAGCGC AATTAAGTGC GCAGGATAAA GTCTCTATTG TGGTTTATGC TGGGGCTGCT GGTGTGGTGC TCGATGGCGC GTCGGGGAAC GACACTCAAA CCTTGACCTA TGCGCTAGAG CAATTAAGTG CCGGTGGTTC AACCAATGGT GGGCAAGGGA TCACGCAAGC CTATCAATTG GCCAAAAAGC ATTTTATCCC CAATGGCATT AATCGAGTCA TCCTTGCGAC CGATGGTGAT TTCAATGTTG GCGTGACAGA TTTTGATGAT TTGATTGCCT TGATTGAAAA GGAAAAAGAT CATGGCATTG GCCTGACAAC CTTAGGGTTT GGCTTGGGCA ATTATAACGA TCAACTGATG GAGCAATTGG CGGACAAGGG CAATGGCAAC TATGCCTATA TTGATACGCT GAATGAAGCG CGAAAAGTGC TGGTGGACGA GTTGAGTTCG ACCTTATTCA CTATCGCCAA AGATGTGAAA GTGCAGGTGG AGTTTAATCC AGCCTTAGTC TCGGAATACC GTTTGATTGG CTATGAGAAT CGCGCCTTAG CACGGGAAGA TTTTAATAAC GATAAGGTGG ATGCGGGCGA GATTGGCGCG GGTCATACAG TAACAGCCTT GTATGAATTA AGGTACGTTG AAGCTGGGAA TAGGATGAAT GATAAACTTA GATATGGCGT TGATGCTCAA ACGGGGAAAG AGAAATACAG CCGTGAAGAA ATTGCTTTCC TTAAATTGAG ATACAAGTTG CCAGCACAAA CGCAGAGCCA ATTACTGAGT TATCCCATCA GGTTAGATCA AAGCGTTAAA CAGCTAGAGC AAGCAAGTGA TGATTTTAGA TTTGCCGCCG CGGTTGCAGG GTTAGGGCAA TTACTGAATG GCAGTCACTA TCTACATCAA TTTGATTATA CTAAGTTAAG CTTACTCGCA CGTTCAGCAT TAGGGGATGA TCCCTTTGGT TATCGACATG AATTTGTGCA GTTAATGGAA ACCGCAGCGG CGATAGAGCA ATCTAATCAG CTGCCAATTA ACAAAACATT CGATGGCTCG GATAAACCTT TTCCTCCACA AGATAAACTC CATGGCGAGC CAATGAGAGA TAAGAGCAAT CCGCGTAATG AGAGATTACA ATAG
|
Protein sequence | MKTKYSPRTV SVSNPTFYLQ QGYGIPSATN MAALLLVAVS LTACGGKGAE VQHRQAKQQA EQRHQEASQR QAEMRDAAKV EMARVAAPMQ MSSNGAVMGM SIAPMPRDYA AIPLAQNKFE QQVQNGIMVA GEIPVSTFSI DVDTGSYATL RRMLREGRLP EKGIVRVEEM LNYFAYDYPL PAKNAAPFSV TTELAPSPYN DDMMLLRIGL KGYDLPKSQL GASNLVFLLD VSGSMASTDK LPLLQTALKL LTAQLSAQDK VSIVVYAGAA GVVLDGASGN DTQTLTYALE QLSAGGSTNG GQGITQAYQL AKKHFIPNGI NRVILATDGD FNVGVTDFDD LIALIEKEKD HGIGLTTLGF GLGNYNDQLM EQLADKGNGN YAYIDTLNEA RKVLVDELSS TLFTIAKDVK VQVEFNPALV SEYRLIGYEN RALAREDFNN DKVDAGEIGA GHTVTALYEL RYVEAGNRMN DKLRYGVDAQ TGKEKYSREE IAFLKLRYKL PAQTQSQLLS YPIRLDQSVK QLEQASDDFR FAAAVAGLGQ LLNGSHYLHQ FDYTKLSLLA RSALGDDPFG YRHEFVQLME TAAAIEQSNQ LPINKTFDGS DKPFPPQDKL HGEPMRDKSN PRNERLQ
|
| |