Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_1703 |
Symbol | |
ID | 4252278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 2014822 |
End bp | 2016405 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 638118315 |
Product | von Willebrand factor, type A |
Protein accession | YP_733835 |
Protein GI | 113970042 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0500569 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATGAAC AAAAACCAAC ACAGCAGGCT CCTGCACTTA ATATTTTAGC TGGTTTGGAG CGGCAGTTAG TTGATATTGA TTTATTACCT GATTCGTTGG TGCCAGCTGT AGTTACCCTT CCAGGAGGCG ATTTATCTAC TCGCTGTATG ACTATCGCAT TATTGCGAAA TCAGTTGTTG GCGGGAGAGT CATTGTCACA ACCTTGCCCT TGGTTACCTG AGGCCGTTCA GCAAAACATT GTTAGTGTAA TTAGCCGTTC AGGTGTTGCA ACTTATTGTC AAAATAATCC GCAAGTCACC GATGCCTTGA TACTGGATTT ACTATCTGCT CTTGAAGGTG CTCAGACAAG AGCATTAATC CTTGCTCGGC AATTAGCTCA AATCAAATAT AAGCAAGCGT TAGAAGTGCT GAAAGAAGAA TTCGCTGTTA CCCATATCAA AAGCAATTCA GCCAAACTTA AAAATTCACT TGTTCTTTCA GATGCAAAAA AACTTGAATG TCAGCTCCAG GCTGAATTAG AGGCCTGGTT GCAGTTAATT AATGCTAATG GACAACATCC ATTTGCATTG CCATCAGTTT GGGCCGAAAG GTTGGAAGTC TGGCAAATAC TTTCAGAGAT TTTTGATGAT TTAGGTGTGG TCACTGGTCT TGGTTGGGGC CTATCGAAAG GAATGCTGCA AAGCCATGGT TGGATGAACA TGGTGCGATT ACAGAAGCTC GTCGAGCAAA TCCCACAATT GCGTGAAGTG ATTGAAACCC TGGGGCGCAT GAAAGATTGC GACGGTGAGC CCGTGATTGA AGAAATTATC TCTAAGATGA GCGTAACGAA AAAACGCGAT AAAGAGATTT CTATGCCGCT TGTGCCAATG GAAACCAAGG GGATCACTCG TTCTGACTCA ATTAGTCGAA TGTTGCCTCA AGAAGCGGCT TTTTTGGGGC ATCCAGTCCT TAAAAAACTT TGGCATGCCC GACGAGCTGA GCATGCATTG CTCAGTTATG CTGTTGAAGG CACGGACGTC ATCACTGAAG AAGTTACATT TGAGCAAGAG GTTAAGGAAG AAAAAGCTGG CTTCAAAATA AATCGTAACC GTGGTCCCAT GATTGTTTGT CTAGATACTT CTGGTTCCAT GCAAGGAACA CCAGAGAATG TTGCCAAGGC GCTGGTGTTG CAATGTATCA GTGTGGCTAA AACCGAAAAA CGTGCTTGTT ATGTTTACCT ATTTGGTAGC AGGGGAGAAG TGACAGAAAT GGAGCTGACA CCCGATGCTG AGGGTTTAGA GCGGATGATA CTTTTTCTGT CCATGTCATT CGGCGGCGGT ACAGATGCAG AGGGACCACT TAATTTGGCT CTTGATCAAA GTGATAAAAA TAAATGGCAT CAAGCAGATA TATTGCTCGT TAGCGATGGT GAATTTGCGG TGTCTTCTGG ACTAACCCGT AAAATTAGTC ATCGTAAGGA TAATAGCGGA TTGAGTATTC ATGGGATTGT TATTGGGTCG AGTTTGTCGC CGATGGACAA GATCTGTGAC CCGTTGCATC AGTTTTCAAG TTGGTTAGAC CTGCAAAATA ACGTATATCA GTGA
|
Protein sequence | MNEQKPTQQA PALNILAGLE RQLVDIDLLP DSLVPAVVTL PGGDLSTRCM TIALLRNQLL AGESLSQPCP WLPEAVQQNI VSVISRSGVA TYCQNNPQVT DALILDLLSA LEGAQTRALI LARQLAQIKY KQALEVLKEE FAVTHIKSNS AKLKNSLVLS DAKKLECQLQ AELEAWLQLI NANGQHPFAL PSVWAERLEV WQILSEIFDD LGVVTGLGWG LSKGMLQSHG WMNMVRLQKL VEQIPQLREV IETLGRMKDC DGEPVIEEII SKMSVTKKRD KEISMPLVPM ETKGITRSDS ISRMLPQEAA FLGHPVLKKL WHARRAEHAL LSYAVEGTDV ITEEVTFEQE VKEEKAGFKI NRNRGPMIVC LDTSGSMQGT PENVAKALVL QCISVAKTEK RACYVYLFGS RGEVTEMELT PDAEGLERMI LFLSMSFGGG TDAEGPLNLA LDQSDKNKWH QADILLVSDG EFAVSSGLTR KISHRKDNSG LSIHGIVIGS SLSPMDKICD PLHQFSSWLD LQNNVYQ
|
| |