Gene Shewmr4_3385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3385 
Symbol 
ID4253951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4038887 
End bp4040608 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content53% 
IMG OID638120023 
Productvon Willebrand factor, type A 
Protein accessionYP_735508 
Protein GI113971715 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.465184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATT GGCATTTAGA ACTGCAAATG TTGGCGCAGT TTCATTTTAT TCGGCCACTC 
TGGTTATTGA CCCTCATTCC ACTCGCCATT GTGCTGATGT TGCGTTGGCG CCGGGATGAT
GTGCAGCAAC GGCTAGTATT TTTTCCCAAC CATTTGCGCA GTGCGCTCAC GCTGAATCAA
GGCGGTTGGC GCAGTCAATT ACCTTTGAAA ATCTTAATGT TATTGCTGTT ATTGGCAGTG
ATTATCTGTG CGGGACCGAC CTGGGAGCGT GAAGCGTCGC CCTTCGGCGA GGACGATGCC
GCGCTGATGG TGTTGCTCGA CAGCAGCGAG AGCATGAAAC AGCAGGACGT GGCGCCGGAT
AGACTCAGCC GCGCTAAACA TAAGATCTTA GATTTAATCG CGGCGCGAAG CGGCGGTAAG
ACGGGGTTGA TGGTGTTTGC GGGCAGCGCC CATGTGGCTA TGCCTGTTAC CAGCGACGCT
AAGGTGCTGC AGCCTTACCT TGAGGCGATC AGCCCTGAGG TGATGCCGTT ATCGGGTAAG
GCGGCGCAAA CCGCACTGAG TCAGCTCGCT GAGCAATTAC CCGCCAATGC GGGCAACAGT
GTCTTACTGC TCACCGACGG CGTTGACCAA CTCACTATCG ATGCGTTTGA GCGGTATTTT
ACTGAGCAGT TTGAACATCC TCCCTATCAA TTGCTGATCT TGGCGATCGG CGATCCCGAT
GTTCAATCGC AGGTGCCAGT GGATGTTGAC TCCCTTGCCA ACTTGGCCGA TAGCACGGGC
GGCAGTCTGT ACCGCATGAC AATAGATGAT GCGGATATTC AGGCTCTTGA GCGCAAAATT
GAGCGCTTTA GCATGCTCAA TAATGATTCC AGCATGCCTT GGTTAGATGA GGGCTATTGG
TTGCTCTGGC CCTTAGCCTT GCTCAGTTTA CTGTGGTTTC GCCGGGGTTG GTTGGTGAAA
TGGAGCCTAG TGTTAGCGTT AATGCTACCG AGTATTGCGC CGCAACAGGT CTATGCCGAA
ATCACCGTTT CTAAGGCCGC CACCGAGACC CAAGTGACAC AAGTCAGCTT TGCCGAGCGG
AGTTGGCAAT GGTGGTTGGA TCTGTGGCTA ACGCCGGATC AACAAGGCGC ACTCTGGTTT
AGTAGGGGCG AATTTGCCAA GGCGGCAGCG GCTTACCATT CGGTGCTCAA CAAGGGCATT
GCCTACTACT ATGGCGGCGA GTATAAGCTC GCCCATTCGG CCTTTATGCA GGTACAAACC
GATCTCGGCG CCTATTATGC CGCCAGCGCA TTAGCGCGGC AGCGGGAATA TATCGCGGCG
CGTAAGCTAT TGAAGACGCT GGCGAAAAAG CAGGATATTG CTCCAGATCT AAAAGCCGAT
ATCGAGCATA ATCTTAAGGT TATCCAAGGG CTTATTGATG AAATCAATCA AGCGAGTGCC
TCCCAAGCCA ACAGTATGGG CGATCAGGAA ACCTCTATCG AGTTGCCGGA CGATCAGCCA
CAGACCGCCG AAGGCGCCGA TGAACAAACC TCACAGGATA AAATGCAGTC GCAAAACCTG
ACGGCGGAGC AGATGTTAGG CGATCCTAAA TTGGCTGAAG TCTGGCTTAA GCGAGTCGAG
GCTAATCCCG AACAATTTTT GCGGGCGAAG TTTCAGCTGC AAAATCTGCA ACCTAAGGAC
GCACAGAGCA CTGATAACGC CAAGGGAGGG TTGCAGCCAT GA
 
Protein sequence
MSDWHLELQM LAQFHFIRPL WLLTLIPLAI VLMLRWRRDD VQQRLVFFPN HLRSALTLNQ 
GGWRSQLPLK ILMLLLLLAV IICAGPTWER EASPFGEDDA ALMVLLDSSE SMKQQDVAPD
RLSRAKHKIL DLIAARSGGK TGLMVFAGSA HVAMPVTSDA KVLQPYLEAI SPEVMPLSGK
AAQTALSQLA EQLPANAGNS VLLLTDGVDQ LTIDAFERYF TEQFEHPPYQ LLILAIGDPD
VQSQVPVDVD SLANLADSTG GSLYRMTIDD ADIQALERKI ERFSMLNNDS SMPWLDEGYW
LLWPLALLSL LWFRRGWLVK WSLVLALMLP SIAPQQVYAE ITVSKAATET QVTQVSFAER
SWQWWLDLWL TPDQQGALWF SRGEFAKAAA AYHSVLNKGI AYYYGGEYKL AHSAFMQVQT
DLGAYYAASA LARQREYIAA RKLLKTLAKK QDIAPDLKAD IEHNLKVIQG LIDEINQASA
SQANSMGDQE TSIELPDDQP QTAEGADEQT SQDKMQSQNL TAEQMLGDPK LAEVWLKRVE
ANPEQFLRAK FQLQNLQPKD AQSTDNAKGG LQP