Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0868 |
Symbol | |
ID | 8251962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 1024028 |
End bp | 1024996 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644934523 |
Product | von Willebrand factor type A |
Protein accession | YP_003091152 |
Protein GI | 255530780 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.721273 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00379424 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAGCG TTCATAATTA TTATGAACGC TTTTTTTATA TTTACAGCAT GCAGTCGCCA ATTGATGAAA CAACCTTACA TTTTAATGGC AACCTTGAAT TACTGGCCAG GCAGGTAGTG GAAGGTTTTA TTACCGGATT GCACAAAAGC CCTTTTCACG GTTTTTCTGT TGAATTTGCA GAACACCGGC TTTATAATGT TGGCGATAAT GTTAAAAACA TCGACTGGAA ACTTTATGCC AGGACCGACA AGCTTTTTAG CAAACGCTAT GAAGAAGAAA CCAATCTGCG CTGTCAGTTC ATCATCGATG TTTCTTCGTC CATGTATTTC CCTTTAAAAG CATACAATAA GCTTAATTTT TCTGTACAGG CGGTTGCTGC ATTGGTTTAT TTGCTAAAAA GGCAGCGCGA TGCCTTTGGC CTAAGTCTTT TTACCGATCA GCTGGTATTG AATACCCCGG CTAAATCTAC CACCACCCAT CAAAAATACC TGTTCGCAAG ACTGGAAGAA ATTTTAAAAG CCGAGCAGAT GAACGTAAAG ACCAATCTTG ACCAGGCTTT GCACCAAATT GCAGAGCTGA TCCATAAACG TTCCCTGGTG GTTGTGTTTA GTGATCTGCT CAGTACTGCT CAGGATGAAC ATCAGATCGA AGGTTTATTC TCGGCCCTTC AGCACCTGAA GTTCAATAAA CATGAAGTCA TCATTTTTAA TGTGACAGAC AAAGCAAAAG AAGTAGATTT TAAATTTGAG AACCGTCCTT ATCAATTTGT TGATATGGAA ACGGGTGCTA TACTAAAAGC ACATACCTCA AAAGTTAAAG ATGCCTATCT GTTAAAGATG CAGGCCTACA GGCAGGCCAT TCAGCTTAAA TGTGCACAGT ACAAAATTGA TATGGTTGAT GCAGATATTG CCAAAGGATT TTACCCCATA TTACAGGCTT ATCTAATCAA GCGTCAAAAA ATGAGTTAA
|
Protein sequence | MKSVHNYYER FFYIYSMQSP IDETTLHFNG NLELLARQVV EGFITGLHKS PFHGFSVEFA EHRLYNVGDN VKNIDWKLYA RTDKLFSKRY EEETNLRCQF IIDVSSSMYF PLKAYNKLNF SVQAVAALVY LLKRQRDAFG LSLFTDQLVL NTPAKSTTTH QKYLFARLEE ILKAEQMNVK TNLDQALHQI AELIHKRSLV VVFSDLLSTA QDEHQIEGLF SALQHLKFNK HEVIIFNVTD KAKEVDFKFE NRPYQFVDME TGAILKAHTS KVKDAYLLKM QAYRQAIQLK CAQYKIDMVD ADIAKGFYPI LQAYLIKRQK MS
|
| |