Gene Snas_4557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4557 
Symbol 
ID8885762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4860729 
End bp4862006 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content69% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003513294 
Protein GI291302016 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.54416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.659076 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGAT TCAGCATCGA CATCGACCAC AACGAATATC TCAGCGAGGG TGACACGATC 
GTCGACGCGA TCGTGACCGT CACCTCCGCC GGGAACGGCT CGGGCACGGT CACCGCCGAC
GCCGCCGATT TCGCGCAGGT CATCATGGTC GACTGCTCCG GCTCCATGAC CGGTTCCCGC
ATCGCCGAAG CGAAACGCGC CACCATCGCC GCCATCGAGT CCCTGGACGA GGGCTGCCGG
TTCGCGATCG TCAAGGGCAC CGACGAGGCC CAGATGGTGT ACCCCGACGA CGAGACCACC
GCCGTAGTGA AGTCCAGCAC CCGCAGCGCC GCCGTCAAAC GGGTCCAGAT CCTGCGGGCC
GGTGGCGGCA CCGCGATGGG CACCTGGCTG GCGAAGACCT CCCGGATCCT GTCCGGCACC
GACGCCGCCG TCAAGTACGG GCTGCTGTTG ACCGACGGCC GCAACCAGCA CGAGACCGAG
GAGGAACTGC GCGAGCACCT CGACGACTGC GAGGGCGTCT TCACCTGCGA CGCCTGCGGC
ATCGGCGAGG ACTGGAGCGC CACCGAGGTG CGGCACATCG CCGACACCCT GCTGGGCGAG
GCCAACGGCC TGCCCGAGCC GCGTGAGTGG GTGGAACGGT TCACTCAGCT GACGAACGGG
GCCCGCGCCA AGACCCTCGC CGACGTGAAC CTGCGGGTGT TCACCCCGGC CCGCAACCGG
ATCCGTTTCG TCAAGCAGAT GAGCCCGCAC ATCAGCGACC TGACCTCGCG CCGCCGCGAA
CTCGACGACA AGACCGGCGA CTACCCCACC GGCTCCTGGG GAGCCGAGGC CCGCGAGTTC
CACATCAGCG TCGAGGTGCC GCCGATCCAG GTCGGCGACG ACCGGCTCGC GGCCCGGGTG
TCGGCGATGT CCGGCGACAC CGAACTGATC CGGCACATGG TCACCGCCCG CTGGACCGGC
GACCAGATGC TGTCGACCCG CATCAACCCC AAGGTGGCGC TCCACACCGG ACAGGCCGAG
CTGGCTCGGG CGATCCAGGA GGGCGTCGCC GCCGCCCGGT CCGGCGACAC CGACACCGCT
ACCGACAGAC TCGGCCGCGC CGTCGCCCTG GCGGCCGAGG CCGGTAACGA CAACACCGCC
CGGCTGTTGG CCAAGGTCGT GGAGGTGGAT CCGGACACCG GCACCGTCCA GATGAAATCG
GCCGTCAGCT CCGTCGACCT GGAAATGGCC GATGTGGAGT CGGTGAAGAC GGTGACGGTC
CGTAAGTCCA GTAAGTAG
 
Protein sequence
MTGFSIDIDH NEYLSEGDTI VDAIVTVTSA GNGSGTVTAD AADFAQVIMV DCSGSMTGSR 
IAEAKRATIA AIESLDEGCR FAIVKGTDEA QMVYPDDETT AVVKSSTRSA AVKRVQILRA
GGGTAMGTWL AKTSRILSGT DAAVKYGLLL TDGRNQHETE EELREHLDDC EGVFTCDACG
IGEDWSATEV RHIADTLLGE ANGLPEPREW VERFTQLTNG ARAKTLADVN LRVFTPARNR
IRFVKQMSPH ISDLTSRRRE LDDKTGDYPT GSWGAEAREF HISVEVPPIQ VGDDRLAARV
SAMSGDTELI RHMVTARWTG DQMLSTRINP KVALHTGQAE LARAIQEGVA AARSGDTDTA
TDRLGRAVAL AAEAGNDNTA RLLAKVVEVD PDTGTVQMKS AVSSVDLEMA DVESVKTVTV
RKSSK