Gene Sbal195_1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_1139 
Symbol 
ID5752866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp1350170 
End bp1352098 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content47% 
IMG OID641287409 
Productvon Willebrand factor type A 
Protein accessionYP_001553575 
Protein GI160874259 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCCG TTTACAGCTT TAGGTCAACA TCAAAGGCAT TTAACATGAA AACAAAATAT 
TCCCCAAAAA CTGTTTCAGT TTCCGATCCA ACTCTCTACC TGCAACGGGG AAACGGTATC
CCATCGGCGA GCAATACGGC TGCTCTATTA CTGGTCGCAG TGAGTTTAAC TGCCTGTAGT
GGTAAAGGCG CCGAAGTTGA ACATCGACAA GCCAAGCAGC AAGCCGAGCA ACGTCATCAA
GTAGCGTCGC AGCGCCAAGC TGAAATGCGT GATGCTGCTA AGGTTGAGAT GGCGAGAGTG
GCGGCGCCGA TGCAAATGTC TAGCAATGGG GCAGTGATGG GAATGAGCAT AGCGCCAATG
CCGCGTGATT ATGCCGCGAT TCCGTTAGCG CAAAATAAAT TCGAGCAGCA AGTGCAAAAT
GGCATCATGG TGGCAGGGGA GATCCCGGTA TCGACGTTTT CCATCGATGT CGATACTGGC
AGTTATGCCA CTTTAAGGCG GATGCTGAGG GAAGGGCACT TACCTGAGAA AGGCACTGTC
AGAGTTGAGG AAATGCTCAA TTATTTTGCC TACGATTATC CCTTACCCGC TAAAAACGCG
GCACCGTTTA GCGTGACGAC AGAGCTTGCT CCCTCACCCT ATAACGATGA CATGATGCTG
CTGCGGATTG GCCTTAAAGG TTATGACTTA CCTAAATCTC AGTTAGGCGC CAGCAACTTA
GTCTTTTTGC TCGACGTGTC AGGCTCTATG GCGTCAGTGG ATAAATTGCC TTTACTGCAA
ACGGCATTAA AGCTGCTAAC AGCGCAGTTA AGCGCGCAGG ATAAAGTCTC TATTGTGGTT
TATGCAGGGG CGGCTGGTGT AGTGCTCGAT GGCGCGTCGG GGAACGACAC TCAAACTCTG
AACTATGCGC TAGAGCAATT GAGTGCCGGT GGTTCAACCA ACGGTGGTCA AGGGATCACG
CAAGCCTATC AATTGGCCAA AAAGCATTTT ATCCCCAATG GCATTAATCG AGTCATCCTT
GCGACCGATG GCGATTTCAA TGTTGGCGTG ACAGATTTTG ATGATTTGAT TGCCTTGATT
GAAAAGGAAA AAGATCATGG CATTGGCCTA ACGACCTTAG GGTTTGGTTT GGGCAATTAT
AACGATCAAC TGATGGAGCA ATTGGCGGAC AAGGGCAATG GCAATTATGC CTATATTGAT
ACGCTGAATG AAGCGCGAAA AGTGCTGGTG GACGAGTTGA GTTCGACCTT ATTTACTATC
GCCAAAGATG TGAAAGTGCA GGTGGAGTTT AATCCGGCCT TAGTCTCGGA ATACCGTTTG
ATTGGTTACG AGAATCGCGC CTTAGCGCGG GAGGATTTTA ATAACGATAA GGTGGATGCG
GGCGAGATTG GCGCGGGTCA TACCGTTACA GCCTTGTATG AATTAAGGTA CGTTGAAGCT
GGGAATAGGA TGAATGATAA ACTCAGATAT GGCGTTGATG CTCAAACGGG GAAAGAGAAA
TACAGCCGTG AAGAAATTGC TTTCCTTAAA TTGAGATACA AGTTGCCAGC ACAAACGCAG
AGCCAATTAC TGAGTTATCC CATCAGGTTA GATCAAAGCG TTAAACAGCT CGAGCAAGCA
AGTGATGATT TTAGATTTGC CGCCGCGGTT GCAGGGTTAG GGCAATTACT GAATGGCAGT
CACTATCTCC ATCAATTTGA TTATACTAAG TTAAGCTTAC TCGCACGTTC AGCATTAGGG
GATGATCCCT TTGGTTATCG TCATGAGTTT GTGCAGTTAA TGGAAACCGC AGCGGCGATT
GAGCAATCTA ATCAGCTGCC AAGTAAGAAA ACATTCGATG GCTCGGATAA ACCTTTTCCT
CCACAAGATA AACTCCATGG CGAGCCAATG AGAGATAAGA GCAATCCGCG TAATGAGAGA
TTACAATAG
 
Protein sequence
MRAVYSFRST SKAFNMKTKY SPKTVSVSDP TLYLQRGNGI PSASNTAALL LVAVSLTACS 
GKGAEVEHRQ AKQQAEQRHQ VASQRQAEMR DAAKVEMARV AAPMQMSSNG AVMGMSIAPM
PRDYAAIPLA QNKFEQQVQN GIMVAGEIPV STFSIDVDTG SYATLRRMLR EGHLPEKGTV
RVEEMLNYFA YDYPLPAKNA APFSVTTELA PSPYNDDMML LRIGLKGYDL PKSQLGASNL
VFLLDVSGSM ASVDKLPLLQ TALKLLTAQL SAQDKVSIVV YAGAAGVVLD GASGNDTQTL
NYALEQLSAG GSTNGGQGIT QAYQLAKKHF IPNGINRVIL ATDGDFNVGV TDFDDLIALI
EKEKDHGIGL TTLGFGLGNY NDQLMEQLAD KGNGNYAYID TLNEARKVLV DELSSTLFTI
AKDVKVQVEF NPALVSEYRL IGYENRALAR EDFNNDKVDA GEIGAGHTVT ALYELRYVEA
GNRMNDKLRY GVDAQTGKEK YSREEIAFLK LRYKLPAQTQ SQLLSYPIRL DQSVKQLEQA
SDDFRFAAAV AGLGQLLNGS HYLHQFDYTK LSLLARSALG DDPFGYRHEF VQLMETAAAI
EQSNQLPSKK TFDGSDKPFP PQDKLHGEPM RDKSNPRNER LQ