Gene Snas_3738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3738 
Symbol 
ID8884937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3984148 
End bp3985977 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content65% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003512488 
Protein GI291301210 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.105397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGTG GCCGCAGCCA ATCGGGTGTC GTTAGGTCGC ATCGGCGCGG TCGCCGCAGG 
CAGGTCATCC GCCGCCGTCT GCCCGTCGCT CCGTGGATCG TCGTGACCAC CGTCGTCGCG
CTGGCCGCCA GCGGCCTGGT CGGCGGCTAC GCGTTCCTGC TGAGCACCGG TTGCAGCGGC
GACCCGATCA AGGCGGTCGT GGCGGCCCCC AAGGAGATCA GCGGAACCCT CCAGACCGCC
GCGCGCTCCT GGGCCGGTAC CGAGCCGTCG GTGGACGGGG GACAGTGCAT CAGCGTGGAA
GTGCGCGAGC AGGCCTCCCA GGACGTCGTG TCGGCGCTGT CGGGCTCCAG CACGGGCTCC
AAGAAGGATC TGCCGCACGT GTGGATCCCG GAGTCGATGG CGTGGCTGGA GATGGCCAAG
ATCTCCGACC GTGGCAAGAA GATGCTGCCG GACTCGCCGC CGCTGGTCGC GACCAGTCCG
ACCGTGATCG CGATGCCCAC CGAGGCCGCC AAGGCCCTGG GCTGGCGCAA CGAGGACAAA
CCGGTCGACA AGGCCGGTAA ACCCACCTGG GGCAACCTGC TGAAACTGGC CGAGGACTCC
GATTGGAGCC AGTTCGGCAA GGACAAGTGG GGCGACATCA CCGTCGGCAT GAGCGACCCG
ATGGCCTCCA CCGCCGACCT GCACGCGCTG TTGAGCATCG TGGACAAGAA CCGCAGCGCC
GGTGTCGACG CCGAGGAACT GGGCAACGTC TCCAAACTGA AAAGCACCGT CCACAAGGAG
ACCCCCAGCG TCGAGGAGAT GATGGGCAAG GTCTCCGAGG CGAAGGCGAA GGGTGACCCG
GTCGGCTTCG TGTCGGCGTT CCCGGCGCTG GAGCGCGACG TCTGGAAGAA CAACTTCTCC
GGTACCGAAT CGCCGTTGAC GGCGGTGTAC CCGGCAGACG GCAGCCTCGA CGCCGACTAC
CCGCTGGCGG TGCTGCAGAA CGTGTCCTGG ACCGACGCGA CCCACCAGGA GATCGGGAAG
CAGTTCGGCG AGTACCTGCT CGGTGAAGGC CAGAAGGAGA TCAAGAAGGG CGGCTTCCGC
GACGGCACCC GTCGCGAGGC CAGCAGCGAA CTGACCGGTA CCGAGGGCCT GGCCACCCAG
ATCACCGCGA CGCAACGCGA CAAGGTCGAC TCCGAGAGTG TCCAGACCAC GCTGGCGACC
TGGCAGGCGG TGGCCCGTCC GGCCAACGTC CTGGTCGTGG TCGACTCCTC GTCGTCGATG
AGCACCGAGG AACCCTACGA CGGCGAGAAA CTGTCGCGTA TGGACATCAT TCGCAAGTCG
CTGGAGAGAT CGCTGGACCT GTTCGGCGAA CAGGCCAATG TGGGCCTGTG GCGATATCCG
TACGACGATC CGGTCGCCGG TACGGCGTAC CAGAAACTGG TGGAGATCGG TGAGTTCGAC
AAGTCGCGGC AGGATGACAT CGAATCGCAG CTCAGTGCCG TGGAACCGGC CGCGGACGGC
GGCCTCAACG ACACCGTCGT GGAGGCGTAC AAGAACGTCC TGGACAACTA CAACAAGACC
ACCGGGGCGA TCAACCTGGT GGTGGTGATA TCCGACGGCG GCAGCGAGTC GGACGCCAGT
CTGAGTAATG AGGACGTCAC GGAGGAACTG AAGGACCTGT CGGCCAAGGA CCGTGACAAG
GAAGCCTCGA TCATGACGAT CGGTTACGGC AAGGACGCCG ACAAGGATCA CCTGGACGCG
ATCGCGACGG CCACGCAGGG TCGCTACTAC CCGGCGAAGT GGAACGACGA GATCAACATG
CAGATCCTCA ACGCGCTGTA CTACAACTGA
 
Protein sequence
MARGRSQSGV VRSHRRGRRR QVIRRRLPVA PWIVVTTVVA LAASGLVGGY AFLLSTGCSG 
DPIKAVVAAP KEISGTLQTA ARSWAGTEPS VDGGQCISVE VREQASQDVV SALSGSSTGS
KKDLPHVWIP ESMAWLEMAK ISDRGKKMLP DSPPLVATSP TVIAMPTEAA KALGWRNEDK
PVDKAGKPTW GNLLKLAEDS DWSQFGKDKW GDITVGMSDP MASTADLHAL LSIVDKNRSA
GVDAEELGNV SKLKSTVHKE TPSVEEMMGK VSEAKAKGDP VGFVSAFPAL ERDVWKNNFS
GTESPLTAVY PADGSLDADY PLAVLQNVSW TDATHQEIGK QFGEYLLGEG QKEIKKGGFR
DGTRREASSE LTGTEGLATQ ITATQRDKVD SESVQTTLAT WQAVARPANV LVVVDSSSSM
STEEPYDGEK LSRMDIIRKS LERSLDLFGE QANVGLWRYP YDDPVAGTAY QKLVEIGEFD
KSRQDDIESQ LSAVEPAADG GLNDTVVEAY KNVLDNYNKT TGAINLVVVI SDGGSESDAS
LSNEDVTEEL KDLSAKDRDK EASIMTIGYG KDADKDHLDA IATATQGRYY PAKWNDEINM
QILNALYYN