Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3199 |
Symbol | |
ID | 8138551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3712061 |
End bp | 3713167 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644870804 |
Product | von Willebrand factor type A |
Protein accession | YP_003022984 |
Protein GI | 253701795 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 0.930737 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCC TTCAGGGGCA ACCCAAGGGG CAGATACTGC TAGTCGTCGC CGCGGTGATG TTCTTCGGCA TCTTCCTCGC CGCCCTGGCG GTCGATGCCG GCAGGGCATA TGGGGTGAAG GCGAAGCTCC ATGCCGCGGT GGACGCTGCG AGCTATGAGG CGGCCAAGGC CCTGGCGCAT GGGGAAGATG AGGACGACAT GGAGGAAAAG GCGAGCGAGG CAGCGCGCGA CTACTTCAGG GCGAACTTCC CTGCTGCCTA TTTCGGCGCG CAGTGCAGCG GACCGGAACT GGAGCTGAGC GAGCGGAAGT CGGGAAAAAA GATGCGTGCC CTCACCGTTT CCGCCACAGC GACGCTCCCC AACGTCTTTG CCGGGATCCT CGGCTGGGAC AGCATCGATC TTCCTGCCCA GTCGCGCGCG GTCAGGACGG ATGCCGACGT GGTGCTGGTG CTGGAGTCCT CGGACGTGCT CAGGGAGTCT TTTCCTCAGG TGAAGCAACG GGTGGCTAAC TTCAGCGACC GCTTCAGCCA ACACTATGAC CGGATGTCGC TGGTGACCTT TGCCGCGGGA GCCGACCCGG TCATTTCCAT CTGTGGCGTC TATAAGCCCG CAAAGGACCG CCCCGGAGCG GGGACCTTCA ACTGCGGCAG TGGGTATCAG AAAAGCAATT TCGCCAAAGC CCTTCTGGAG TTGAGCCCGC AGACGACCGG CGCAGCAGCG GCTCCCGAGG AGGCGATGAA GCAGGCACAG GCCCAGTTGG ACGGCCTGAG TAGCGACCTG CGGGCGGAGA AGAGGGCGAT CGTGCTTTTG GCAAGCGATG TAGCCGCAAG CAACATTAAA GAAGAGACCG CCGCTGCGGC TCGCAAGGAG CAGGTGTTCA TTTATGCGGT GGAGATAGCG GGGTCCCTTA AGGCAAGCAC ACCTGCGGGG GGCGCTAACG GCCGGAGCGG GAGCGAAAAC ATGAAGCTGT TCGCCAACAC CAAGGACTCC GGAGGCCACG AAAAGGGACA ACCGACCGGC TCGTACTGCG CAGCGACCGA CCTGCAGCAG TTGGAGCTGT GCCTGGAAAA TATCGCCAAC GGCATGACCG TGAGCATTGA GCAGTAA
|
Protein sequence | MKILQGQPKG QILLVVAAVM FFGIFLAALA VDAGRAYGVK AKLHAAVDAA SYEAAKALAH GEDEDDMEEK ASEAARDYFR ANFPAAYFGA QCSGPELELS ERKSGKKMRA LTVSATATLP NVFAGILGWD SIDLPAQSRA VRTDADVVLV LESSDVLRES FPQVKQRVAN FSDRFSQHYD RMSLVTFAAG ADPVISICGV YKPAKDRPGA GTFNCGSGYQ KSNFAKALLE LSPQTTGAAA APEEAMKQAQ AQLDGLSSDL RAEKRAIVLL ASDVAASNIK EETAAAARKE QVFIYAVEIA GSLKASTPAG GANGRSGSEN MKLFANTKDS GGHEKGQPTG SYCAATDLQQ LELCLENIAN GMTVSIEQ
|
| |