Gene Shewmr4_1703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1703 
Symbol 
ID4252278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2014822 
End bp2016405 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content44% 
IMG OID638118315 
Productvon Willebrand factor, type A 
Protein accessionYP_733835 
Protein GI113970042 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0500569 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATGAAC AAAAACCAAC ACAGCAGGCT CCTGCACTTA ATATTTTAGC TGGTTTGGAG 
CGGCAGTTAG TTGATATTGA TTTATTACCT GATTCGTTGG TGCCAGCTGT AGTTACCCTT
CCAGGAGGCG ATTTATCTAC TCGCTGTATG ACTATCGCAT TATTGCGAAA TCAGTTGTTG
GCGGGAGAGT CATTGTCACA ACCTTGCCCT TGGTTACCTG AGGCCGTTCA GCAAAACATT
GTTAGTGTAA TTAGCCGTTC AGGTGTTGCA ACTTATTGTC AAAATAATCC GCAAGTCACC
GATGCCTTGA TACTGGATTT ACTATCTGCT CTTGAAGGTG CTCAGACAAG AGCATTAATC
CTTGCTCGGC AATTAGCTCA AATCAAATAT AAGCAAGCGT TAGAAGTGCT GAAAGAAGAA
TTCGCTGTTA CCCATATCAA AAGCAATTCA GCCAAACTTA AAAATTCACT TGTTCTTTCA
GATGCAAAAA AACTTGAATG TCAGCTCCAG GCTGAATTAG AGGCCTGGTT GCAGTTAATT
AATGCTAATG GACAACATCC ATTTGCATTG CCATCAGTTT GGGCCGAAAG GTTGGAAGTC
TGGCAAATAC TTTCAGAGAT TTTTGATGAT TTAGGTGTGG TCACTGGTCT TGGTTGGGGC
CTATCGAAAG GAATGCTGCA AAGCCATGGT TGGATGAACA TGGTGCGATT ACAGAAGCTC
GTCGAGCAAA TCCCACAATT GCGTGAAGTG ATTGAAACCC TGGGGCGCAT GAAAGATTGC
GACGGTGAGC CCGTGATTGA AGAAATTATC TCTAAGATGA GCGTAACGAA AAAACGCGAT
AAAGAGATTT CTATGCCGCT TGTGCCAATG GAAACCAAGG GGATCACTCG TTCTGACTCA
ATTAGTCGAA TGTTGCCTCA AGAAGCGGCT TTTTTGGGGC ATCCAGTCCT TAAAAAACTT
TGGCATGCCC GACGAGCTGA GCATGCATTG CTCAGTTATG CTGTTGAAGG CACGGACGTC
ATCACTGAAG AAGTTACATT TGAGCAAGAG GTTAAGGAAG AAAAAGCTGG CTTCAAAATA
AATCGTAACC GTGGTCCCAT GATTGTTTGT CTAGATACTT CTGGTTCCAT GCAAGGAACA
CCAGAGAATG TTGCCAAGGC GCTGGTGTTG CAATGTATCA GTGTGGCTAA AACCGAAAAA
CGTGCTTGTT ATGTTTACCT ATTTGGTAGC AGGGGAGAAG TGACAGAAAT GGAGCTGACA
CCCGATGCTG AGGGTTTAGA GCGGATGATA CTTTTTCTGT CCATGTCATT CGGCGGCGGT
ACAGATGCAG AGGGACCACT TAATTTGGCT CTTGATCAAA GTGATAAAAA TAAATGGCAT
CAAGCAGATA TATTGCTCGT TAGCGATGGT GAATTTGCGG TGTCTTCTGG ACTAACCCGT
AAAATTAGTC ATCGTAAGGA TAATAGCGGA TTGAGTATTC ATGGGATTGT TATTGGGTCG
AGTTTGTCGC CGATGGACAA GATCTGTGAC CCGTTGCATC AGTTTTCAAG TTGGTTAGAC
CTGCAAAATA ACGTATATCA GTGA
 
Protein sequence
MNEQKPTQQA PALNILAGLE RQLVDIDLLP DSLVPAVVTL PGGDLSTRCM TIALLRNQLL 
AGESLSQPCP WLPEAVQQNI VSVISRSGVA TYCQNNPQVT DALILDLLSA LEGAQTRALI
LARQLAQIKY KQALEVLKEE FAVTHIKSNS AKLKNSLVLS DAKKLECQLQ AELEAWLQLI
NANGQHPFAL PSVWAERLEV WQILSEIFDD LGVVTGLGWG LSKGMLQSHG WMNMVRLQKL
VEQIPQLREV IETLGRMKDC DGEPVIEEII SKMSVTKKRD KEISMPLVPM ETKGITRSDS
ISRMLPQEAA FLGHPVLKKL WHARRAEHAL LSYAVEGTDV ITEEVTFEQE VKEEKAGFKI
NRNRGPMIVC LDTSGSMQGT PENVAKALVL QCISVAKTEK RACYVYLFGS RGEVTEMELT
PDAEGLERMI LFLSMSFGGG TDAEGPLNLA LDQSDKNKWH QADILLVSDG EFAVSSGLTR
KISHRKDNSG LSIHGIVIGS SLSPMDKICD PLHQFSSWLD LQNNVYQ