Gene Shewmr4_2973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2973 
Symbol 
ID4253544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3550455 
End bp3552329 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content48% 
IMG OID638119609 
Productvon Willebrand factor, type A 
Protein accessionYP_735101 
Protein GI113971308 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00160061 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATATC CATCTTCAGT CTTTCGCCGT AAAACGCTCT CAAGCATTGT CGTTTCGGGA 
TTAACACTCG CTATTTTGCT CGGCTTGAAT GGTTGCAGTG ATAAATCCGA CGACCAGCAA
AAGCGTGCTG AATTAGCCGA TCAAACAAAA CTCGCCGCCG AGCAACAGGC TGAGCTTAAG
CAGCAAGTAG AGTTAAAGGA CAGTGTTGAG CGTCAGGCCA ATAGGCAAAG AGATGCTGCA
ATCGCCATAC ATGAACAAGC CACGTCAACA AAATTGCGGA CAATGAATGC TGAGCATCGA
GCCTATATCG CTCAGCCTGC TGCCACTATC AGTGCGGCGC CCGCGTTAAA CGGCGATTGG
CCTGGGGCTG TGCCACCAGA GCGCAATCGA TTTGAGAAAC AAGTGCAAAA CGGCATCATG
GTTGCGGGGG AAACCCCTGT CTCGACCTTT GCTATCGATG TCGATACTGG TAGTTACACG
ACGTTAAGGC GAATGCTAAA GGAGGGGCGA TTACCGCAAA AGGACACGCT GCGGGTCGAA
GAAATGCTGA ATTATTTTTC CTATGACTAT CCATTACCGA GTAAAAATGA GGCGCCATTT
AGTGTTACCA CTGAGCTTGC ACCATCGCCC TATAACTATG ACATGATGTT ACTTCGCATC
GGTTTGAAGG GATATGAGCA GAGTAAAGCA GAACTCGGCG CCAGTAACTT AGTGTTTCTG
CTGGATGTGT CAGGGTCGAT GGCATCGCCC GATAAATTAC CTCTATTGCA AACTGCCTTG
AAAATGCTGA CTCAGCAACT GGGTGCTCAG GATAAGGTAT CGATTGTCGT CTACGCTGGC
GCAGCTGGTG TGGTGTTAGA TGGCGCGGCG GGTAACGACA GTCAAACCCT TAACTATGCG
TTAGAGCAGC TCAGTGCGGG TGGTTCTACC AATGGGGCGC AGGGTATTCA GCTTGCCTAT
CAGCTTGCGA AAAAGCACTT GGTTGAAGGC GGCATCAATC GAGTGATATT TGCGACCGAC
GGTGACTTTA ATGTCGGCAC GACTAACCTC GATGAGTTAA TCGATTTGGT TAGCGCGCAG
AAGCAACTGG GCATTGGGCT AACGACGCTC GGCTTTGGTA TGGGCGACTA CAATGACCAT
CTAATGGAGC AATTAGCCGA TAAAGGCAAT GGACAATACG CCTATATTGA TTCCCTCAAT
GAAGCGAGAA AAGTGCTGGT GGAACAGTTA AGTGCAACCT TACTGACCAT AGCCAAAGAG
GTGAAAGTGC AGGTCGAGTT TAATCCCGCC CTTGTGGCTG AGTATCGTCT TATTGGTTAT
GAGAACCGTG CCTTAGCGCG TGAAGATTTT AATAATGATA AGGTGGATGC GGGCGAAATA
GGCGCTGGGC ATACTGTGAC GGCGCTATAT GAGCTGCGTT ATGTTGATGC GGGAAATTTG
GCTAATGATA AACTTCGCTA TGGCTATAAT CCCAAGACTG GCAATGAAAA ATATAGCCGT
GATGAAATCG CCTTTCTGAA ATTACGTTAT CAGCTACCGG ATGCGACTCA AAGCCAGTTA
CTGAGTTATC CAATTCGTGC CGACCAAAGG GCAAACTCAT TAGCGCAGGC GAGTGACGAT
TTTCGTTTTG CCGCTGCAGT GGCAGGATTA GGACAGTTAC TGAATCAAAG CCACTATTTG
CATCAATTTG ATTATAATAA GCTTAGTGCG CTCACACGTT CTGCGCTGGG GGAAGATACT
AGCGGCTACC GACATGAATT TATGCAACTT GTCGATACCG CTGCGATACT CGCACAAACA
CAGCGAGTGC CAATCAAAAA ATCCTTTGAT GCCGGAGATA AACCTTTCCC GCCCGAGGAT
AAACCGCATC AGTGA
 
Protein sequence
MRYPSSVFRR KTLSSIVVSG LTLAILLGLN GCSDKSDDQQ KRAELADQTK LAAEQQAELK 
QQVELKDSVE RQANRQRDAA IAIHEQATST KLRTMNAEHR AYIAQPAATI SAAPALNGDW
PGAVPPERNR FEKQVQNGIM VAGETPVSTF AIDVDTGSYT TLRRMLKEGR LPQKDTLRVE
EMLNYFSYDY PLPSKNEAPF SVTTELAPSP YNYDMMLLRI GLKGYEQSKA ELGASNLVFL
LDVSGSMASP DKLPLLQTAL KMLTQQLGAQ DKVSIVVYAG AAGVVLDGAA GNDSQTLNYA
LEQLSAGGST NGAQGIQLAY QLAKKHLVEG GINRVIFATD GDFNVGTTNL DELIDLVSAQ
KQLGIGLTTL GFGMGDYNDH LMEQLADKGN GQYAYIDSLN EARKVLVEQL SATLLTIAKE
VKVQVEFNPA LVAEYRLIGY ENRALAREDF NNDKVDAGEI GAGHTVTALY ELRYVDAGNL
ANDKLRYGYN PKTGNEKYSR DEIAFLKLRY QLPDATQSQL LSYPIRADQR ANSLAQASDD
FRFAAAVAGL GQLLNQSHYL HQFDYNKLSA LTRSALGEDT SGYRHEFMQL VDTAAILAQT
QRVPIKKSFD AGDKPFPPED KPHQ