Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2659 |
Symbol | |
ID | 6871247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2542837 |
End bp | 2544618 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642785726 |
Product | von Willebrand factor, type A |
Protein accession | YP_002216383 |
Protein GI | 198245970 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.101316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTAACG GAAAAACATT AATGCTGTTG TTGGGTGGGG TTATTCTCTC GGGCTGTGGG CCGGAGCCCT CCGACCCGCA GGGAAATAAT CCGGCTGAGT TAAAACAGAA GCGGACTATC CAGAAAGAAA ATAGCGCTCA GGCCGGGGAT GACACTGTCC AGAAAAGACA AGCCGAGGCC GCCCAACAGG CTGCGAAAAA AGCAGCCGAA TATGCTGAAG CCAAAGCCCT GGCAGATGCC AAAGCGGCAT CTCTGGCAAC AGCCGAAGCA CCGCAATATG TAATGCAGAC ACGTGCCGCG GCACCAAAAG CGTTCGCTGC ACAAGGCGGT AATGTAATGG GGACCGCGCG TTACGAACAC TACGATGAAA ATCCGATTAA ACAGGTAAGT CGGGCGCCGC TCGCCACGTT TAGTCTGGAT GTAGACACTG GCAGCTACGC TAACGTGCGG CGCTTTCTGA ATCAGGGACA ACTGCCGCCG CCGGAAGCCG TGCGGGTAGA AGAGATGCTC AATTATTTTC CCGCGCCTCA GCCCGTTGCG GATAAGCAGG ATAACACTAA ACCCATTGCG GCCTGTATAC CGATGCCATT TGCGGTTAAA TACGAACTGG CTCCCTCGCC GTGGAACGCG CAGCGTACGC TATTAAAAGT TGATGTTCAG GCCCGGGACA TGCAGACCAG AGATCTGCCA CCTGCCAACC TGGTTTTTCT CATTGATACT TCTGGTTCTA TGCAGCCAGC GGAACGCCTG CCGTTGATCC GGTCGGCGCT AAAACTGTTG GTGAACGATC TGCGTGCGCA GGATAACATC ACTATTGTGA CCTACGCGGG CGGCACTCAC GTCGCGCTGG CGTCTACAGC GGGAAATAAC ACAACCGCGA TTAAAGCGGC AATTGATAAT CTGGATACTT ACGGGAGTAC CGGTGGTGAA GCGGGATTAC GGCTAGCCTA CGAGCAGGCC GAAAAAGGGT TTATCAAAGG CGGCGTTAAC CGCATTTTGT TGACCACCGA CGGTGATTTT AACCTCGGTA TTACCGATCC CAAAGACATC GAAGCGCTGG TAAAAAAAGA GCGTGAGAAA GGTATTACCT TATCTACGCT GGGCGTCGGC GATGACAATT TCAACGAAGC CATGATGGTG AGGATTGCTG ATGTGGGTAA CGGCAATTAC AGCTACATCG ACTCCCTCTC CGAGGCGCAA AAAGTCCTCA AGGATGAGAT GCATCAAACG CTGGTCACCG TTGCCAAAGA TGTAAAATCG CAAATCGAAT TTAATCCGCA GTGGGTGACT GAGTACCGGC AGATTGGTTA TGAAAAACGC CAACTGCGCG ACGAGGATTT CAATAACGAT AAGGTTGATG CCGGTGATAT CGGCGCGGGT AAACACGTCA CGCTATTCTT TGAACTGACG CTTAACGGGC AGAAAGCCTC GGTGGATAAA CTGCGCTACG CTCAGGACAA AGCCGCCTCA AAGCCAACAA AATCAAGCGA GCTGGCGTGG ATCAAATTGC GCTGGAAAGC GCCGCAGGGC AGCGAAAGTA CATTAGCCGA GCTCCCGGTC GTTATGGGAA AGATGCCGAT CTTTGCTGAC GCCTCTGAAG ATTTTCGTTT CCGCGCGGCG GTAGCGGCTT TCGGGCAAAA ACTGCGTGGC TCAGAAACGC TGGCAGATAC GACCTGGCCG CAAATTATTA AATGGGGTGA ACAGGCGCGC GGGGAAGATA AACAAGGCTA TCGCGCGGAG TTTATTAAAC TGGTAAAACT GGCGGAAGGC TTGTCTCACT AA
|
Protein sequence | MLNGKTLMLL LGGVILSGCG PEPSDPQGNN PAELKQKRTI QKENSAQAGD DTVQKRQAEA AQQAAKKAAE YAEAKALADA KAASLATAEA PQYVMQTRAA APKAFAAQGG NVMGTARYEH YDENPIKQVS RAPLATFSLD VDTGSYANVR RFLNQGQLPP PEAVRVEEML NYFPAPQPVA DKQDNTKPIA ACIPMPFAVK YELAPSPWNA QRTLLKVDVQ ARDMQTRDLP PANLVFLIDT SGSMQPAERL PLIRSALKLL VNDLRAQDNI TIVTYAGGTH VALASTAGNN TTAIKAAIDN LDTYGSTGGE AGLRLAYEQA EKGFIKGGVN RILLTTDGDF NLGITDPKDI EALVKKEREK GITLSTLGVG DDNFNEAMMV RIADVGNGNY SYIDSLSEAQ KVLKDEMHQT LVTVAKDVKS QIEFNPQWVT EYRQIGYEKR QLRDEDFNND KVDAGDIGAG KHVTLFFELT LNGQKASVDK LRYAQDKAAS KPTKSSELAW IKLRWKAPQG SESTLAELPV VMGKMPIFAD ASEDFRFRAA VAAFGQKLRG SETLADTTWP QIIKWGEQAR GEDKQGYRAE FIKLVKLAEG LSH
|
| |