Gene Sbal_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal_1038 
Symbol 
ID4841413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS155 
KingdomBacteria 
Replicon accessionNC_009052 
Strand
Start bp1195310 
End bp1197238 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content47% 
IMG OID640118262 
Productvon Willebrand factor type A 
Protein accessionYP_001049431 
Protein GI126173282 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGCCG TTTACAGCTT TAGGTCAACA TCAAAGGCAT TCAACATGAA AACACAATAT 
TCCCCAAAAA CTGTTTCAGT TTCCGATCCA ACTCTCTACC TGCAACGGGG AAACGGTATC
CCATCGGCGA GCAATACGGC TGCTCTATTA CTGGTCGCAG TGAGTTTAAC GGCCTGTGGT
GGTAAAGGCG CCGAAGTTGA ACATCGACAA GCTGAGCAGC AAGCCGAGCA ACGTCATCAA
GTAGCGTCGC AGCGCCAAGC TGAAATGCGT GATGCCGCTA AGGTTGAGAT GGCGAGAGTG
GCGGCGCCGA TGCAAATGTC TAGCAATGGG GCTGTGATGG GGATGAGCAT AGCGCCAATG
CCGCGTGACT ATGCCGCGAT TCCGTTAGCG CAAAATAAAT TCGAGCAGCA AGTGCAAAAT
GGCATCATGG TGGCAGGGGA GATCCCTGTA TCGACGTTTT TCATCGATGT CGATACTGGC
AGTTATGCCA CCTTAAGACG GATGTTGAGG GAAGGGCGCT TACCCGAGAA AGGCACTGTC
AGAGTTGAGG AAATGCTCAA TTATTTTGCC TACGATTATC CCTTACCCGC TAAAAACGCG
GCACCGTTTA GCGTGACGAC AGAGCTTGCT CCCTCACCCT ATAACGATGA CATGATGCTG
CTGCGGATTG GCCTTAAAGG TTATGACTTA CCTAAATCTC AATTAGGCGC GAGTAACTTA
GTCTTTTTAC TCGATGTGTC AGGCTCTATG GCGTCAGCGG ATAAATTGCC TTTACTGCAA
ACGGCGTTAA AGCTGCTAAC AGCGCAATTA AGTGCGCAGG ATAAAGTCTC TATTGTGGTT
TATGCTGGGG CTGCTGGTGT GGTGCTCGAT GGCGTGTCGG GGAACGACAC TCAAACCTTG
ACCTATGCGT TAGAACAATT AAGTGCCGGT GGTTCAATCA ATGGTGGGCA AGGGATCACG
CAAGCCTATC AATTGGCCAA AAAGCATTTT ATCCCTAATG GCATAAATCG AGTCATCCTC
GCGACCGATG GCGATTTTAA TGTTGGCGTG ACAGATTTTG ATGATTTGAT TGCCTTGATT
GAAAAGGAAA AAGATCATGG TATTGGCCTA ACGACCTTAG GGTTTGGTTT GGGCAATTAT
AACGATCAAC TGATGGAGCA ATTGGCGGAC AAGGGCAATG GCAACTATGC CTATATTGAT
ACGCTGAATG AAGCGCGAAA AGTGCTGGTG GACGAGTTGA GTTCGACCTT ATTTACTATC
GCCAAAGATG TAAAAGTGCA GGTGGAGTTT AATCCGGCCT TAGTCTCGGA ATACCGTTTG
ATTGGCTATG AGAATCGCGC CTTAGCACGG GAAGATTTTA ATAACTATAA GGTGGATGCG
GGCGAGATTG GCGCGGGTCA TACAGTAACA GCCTTGTATG AATTAAGGTA CGTTGAAGCC
GGGAATAGGA TGAATGATAA ACTTAGATAT GGCGTTGATG CTCAAACGGG GAAAGAGAAA
TACAGCCGTA AAGAAATTGC TTTCCTTAAA TTGAGATACA AGTTGCCAGC ACAAACGCAG
AGCCAATTAC TAAGTTATCC CATCAGGTTA GATCAAAGCG TTAAACAGCT CGAGCAAGCA
AGTGATGATT TTAGATTTGC CGCCGCTGTT GCAGGGCTAG GACAATTACT GAATGGCAGT
CACTATCTAC ATCAATTTGA TTATACTAAG TTAAGCTTAC TCGCACGTTC AGCATTAGGG
GATGATCCCT TTGGTTATCG CCATGAGTTT GTGCAGTTAA TGGAAACCGC AGCGGCGATA
GAGCAATCTA ATCAGCTGCC AATTAAGAAA GCATTCGATG GCTCGGATAA ACCTTTTCCT
CCCCAAGATA AACTCCATGG CGAGCCAATG AGAGATAAGA GCAATCCGCG TAATGAGAGA
TTACAATAG
 
Protein sequence
MRAVYSFRST SKAFNMKTQY SPKTVSVSDP TLYLQRGNGI PSASNTAALL LVAVSLTACG 
GKGAEVEHRQ AEQQAEQRHQ VASQRQAEMR DAAKVEMARV AAPMQMSSNG AVMGMSIAPM
PRDYAAIPLA QNKFEQQVQN GIMVAGEIPV STFFIDVDTG SYATLRRMLR EGRLPEKGTV
RVEEMLNYFA YDYPLPAKNA APFSVTTELA PSPYNDDMML LRIGLKGYDL PKSQLGASNL
VFLLDVSGSM ASADKLPLLQ TALKLLTAQL SAQDKVSIVV YAGAAGVVLD GVSGNDTQTL
TYALEQLSAG GSINGGQGIT QAYQLAKKHF IPNGINRVIL ATDGDFNVGV TDFDDLIALI
EKEKDHGIGL TTLGFGLGNY NDQLMEQLAD KGNGNYAYID TLNEARKVLV DELSSTLFTI
AKDVKVQVEF NPALVSEYRL IGYENRALAR EDFNNYKVDA GEIGAGHTVT ALYELRYVEA
GNRMNDKLRY GVDAQTGKEK YSRKEIAFLK LRYKLPAQTQ SQLLSYPIRL DQSVKQLEQA
SDDFRFAAAV AGLGQLLNGS HYLHQFDYTK LSLLARSALG DDPFGYRHEF VQLMETAAAI
EQSNQLPIKK AFDGSDKPFP PQDKLHGEPM RDKSNPRNER LQ