Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_1403 |
Symbol | |
ID | 4251422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 1634481 |
End bp | 1635497 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 638118002 |
Product | von Willebrand factor, type A |
Protein accession | YP_733538 |
Protein GI | 113969745 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.520734 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0000232326 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTAACAG TCGCATGGCC TTTGGCATTG ATTCTACTGC CTTTACCTTT TATTTTTTGG CGTCGTCAAA CGGTGCAAGC CGAAGGTGGT CGCCTGCAAT TGCCCGGCAT TAGTCAAACG GGCAAGGCCA ATATCAGTAG CCACAGCCGA CAAAGCCGCA AGCGTTATTG GTTGATGTGG AGCTTGTTAG TGCTTGCTAT CGCCCGCCCA CAATGGCTTG GCGATCCAAT TGAACTGCCA AGCCAAGGGC GGGATCTGAT GCTGGCGGTG GACTTATCCG GCAGTATGCA AATTGAAGAT ATGGTGATTA ACGGTAAAGT CGTCGACCGT TTTACCTTGA TCCAACACGT GGTCAGTGAG TTTATAGAGC GTCGCAAAGG CGATCGTATC GGTTTGATTC TATTTGCCGA TCACGCTTAT CTACAAGCGC CACTCACCCA AGATAGACGC TCTGTCGCTC AGTTTTTAAA GGAGGCGCAG ATTGGCCTTG TCGGTAAGCA AACGGCGATT GGCGAGTCGA TTGCGCTCGC GGTCAAGCGC TTCGATAAAA TGGATGAGAG TAACAGAGTC TTGATTTTAT TGACTGACGG TTCAAATAAC GCCGGTAATA TCGATCCTGA TCAAGCGGCC CAAATCGCCG CTAATCGCAA AGTGACGATT TATACCGTGG GCGTGGGTGC CGATGTCATG GAAAGACGCA CCCTGTTTGG TCGTGAGCGC GTCAATCCGT CGATGGATCT CGATGAAAAC CAACTCAAGC ATATTGCCGA AGTGACCCAT GGCCGCTATT TCCGCGCCCG TAACAGCCAA GAGCTGGAAC AGATTTATCA AGAAATTGAC AAACTGGAAC CCGTGAGTCG CGATCAACTC AGCTATCGTC CTCAAGCGGA GCTATTCTAT TGGCCGCTCG CACTCGCGCT GCTCACCAGC ATTTGGATAG CCTTAGGCCA ATTAGCGCTG TTTTCGCGGA CTAAAAACCC TGTTCCATCA GCAACACCAC GGGGGGATGT TCGTTAA
|
Protein sequence | MLTVAWPLAL ILLPLPFIFW RRQTVQAEGG RLQLPGISQT GKANISSHSR QSRKRYWLMW SLLVLAIARP QWLGDPIELP SQGRDLMLAV DLSGSMQIED MVINGKVVDR FTLIQHVVSE FIERRKGDRI GLILFADHAY LQAPLTQDRR SVAQFLKEAQ IGLVGKQTAI GESIALAVKR FDKMDESNRV LILLTDGSNN AGNIDPDQAA QIAANRKVTI YTVGVGADVM ERRTLFGRER VNPSMDLDEN QLKHIAEVTH GRYFRARNSQ ELEQIYQEID KLEPVSRDQL SYRPQAELFY WPLALALLTS IWIALGQLAL FSRTKNPVPS ATPRGDVR
|
| |