Gene Shewana3_3555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_3555 
Symbol 
ID4476155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp4262666 
End bp4264387 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content53% 
IMG OID639728164 
Productvon Willebrand factor, type A 
Protein accessionYP_871184 
Protein GI117921992 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATT GGCATTTAGA ACTGCAAATG TTGGCGCAGT TTCATTTTAT TCGGCCACTC 
TGGTTATTAA CCCTCATTCC ACTCGCCATT GTGCTGATGC TGCGCTGGCG CCGGGATGAT
GTGCAGCAGC GGCTGGTGTT TTTCCCCAAT CATTTACGCA GTGCGCTCAC GCTGAATCAA
GGCGGTTGGC GCAGTCAATT ACCGTTGAAA ATCTTAATGT TATTGCTGTT ATTGGCGGTG
ATTATCTGTG CGGGACCGAC CTGGGAGCGC GAAGCGTCGC CCTTCGGCGA GGATGATGCC
GCGCTTATGG TGTTACTCGA CAGCAGTGAG AGCATGAAAC AGCAGGACGT GGCACCGGAT
AGACTCAGTC GCGCTAAACA TAAGATCTTG GATTTAATCG CAGCGCGAAG CGGCGGTAAG
ACGGGGTTGA TGGTGTTTGC GGGCAGCGCC CATGTGGCTA TGCCCGTCAC CAGCGATGCT
AAGGTGTTGC AGCCTTACCT TGAGGCGATC AGCCCTGAGG TGATGCCGTT ATCGGGCAAG
GCGGCGCAAA CAGCATTGAG TCAGCTCGCT GAGCAATTAC CCGCGAATGC AGGCAACAGT
GTTTTACTGC TCACAGACGG CGTTGATCAA CTCACTATCG ATGCGTTTGA GCGGTATTTT
ACCGAGCAGT TTGAACAGCC TCCCTATCAA CTGTTGATCT TGGCGATCGG CGATCCCGAT
GTTCAATCGC AGGTGCCGGT GGACTTTGAC TCCCTTGCCA ACTTGGCCGA TAGCACGGGC
GGTAGTCTGT ATCGCATGAC AATAGATGAT GCGGATATTC AGGCACTTGA GCGCAAAATT
GAGCGCTTTA GCATGCTCAA TAATGACTCC AGCATGCCTT GGTTAGATGA AGGCTATTGG
TTGCTTTGGC CCTTAGCTTT GCTCAGTTTG TTGTGGTTTC GTCGGGGCTG GTTGGTGAAG
TGGAGCCTAG TGTTAGCCTT AACGCTACCA AGTATTGTGC CGCAACAGGC CTATGCCGAA
ATCACCGTTT CTAAGGCGGC CACCGAGATC CAAGTGACAC AAGTCAGCTT TGCCGAGCGG
AGTTGGCAAT GGTGGTTGGA TCTGTGGCTA ACGCCGGATC AACAAGGCGC ATTCTGGTTT
AATCAGGGCG AATTTGCTAA GGCGGCAGCG GCTTACCATT CGGTGCTCAA CAAGGGCATC
GCCTACTACT ATGGCGGTGA GTATAAGCTC GCCCATTCGG CCTTTATGCA GGTCCAAACC
GATCTTGGCG CCTATTACGC CGCCAGCGCA TTAGCGCGGC AGCGGGAATA TATTGCGGCG
CGTAAGCTCT TGAGGACGCT GGCGAAAAAG CAGGATATTG CTCCCGAGCT AAAAGCCGAT
ATCGAACATA ATCTTAAGGT TATTGAGGGG CTTATAGATG AAATCAATCA AGCGAGTGCC
TCCCAAGCCA ACAGTATGGA CGATCAGGAA ACCTCCATCG AGTTGCCGGA CGATCAGCCG
CAGACCGCCG AAGGCGCCGA TGAACAAACC TCACAGGATA AAATGCAGTC GCAAAACCTG
ACGGCGGAGC AGATGTTGGG CGATCCTAAA TTGGCTGAAG TTTGGCTTAA GCGAGTCGAG
GCCAATCCCG AACAATTTTT GCGGGCGAAG TTTCAGCTGC AAAATCTGCA ACCTAAGGGC
GAACAAGGTA CCGATAACGC CAAGGGAGGA TTGCAGCCAT GA
 
Protein sequence
MSDWHLELQM LAQFHFIRPL WLLTLIPLAI VLMLRWRRDD VQQRLVFFPN HLRSALTLNQ 
GGWRSQLPLK ILMLLLLLAV IICAGPTWER EASPFGEDDA ALMVLLDSSE SMKQQDVAPD
RLSRAKHKIL DLIAARSGGK TGLMVFAGSA HVAMPVTSDA KVLQPYLEAI SPEVMPLSGK
AAQTALSQLA EQLPANAGNS VLLLTDGVDQ LTIDAFERYF TEQFEQPPYQ LLILAIGDPD
VQSQVPVDFD SLANLADSTG GSLYRMTIDD ADIQALERKI ERFSMLNNDS SMPWLDEGYW
LLWPLALLSL LWFRRGWLVK WSLVLALTLP SIVPQQAYAE ITVSKAATEI QVTQVSFAER
SWQWWLDLWL TPDQQGAFWF NQGEFAKAAA AYHSVLNKGI AYYYGGEYKL AHSAFMQVQT
DLGAYYAASA LARQREYIAA RKLLRTLAKK QDIAPELKAD IEHNLKVIEG LIDEINQASA
SQANSMDDQE TSIELPDDQP QTAEGADEQT SQDKMQSQNL TAEQMLGDPK LAEVWLKRVE
ANPEQFLRAK FQLQNLQPKG EQGTDNAKGG LQP