Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_0463 |
Symbol | |
ID | 8752115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 509018 |
End bp | 510661 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003407616 |
Protein GI | 284989062 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCCGCC ACGCCGCCAC CCCAGCCGCG CGCCGACCGG CGCCGCTGAC GCCGCAGGTC CTCATCGCGC TGGCGGTGGC CGCGATGGTG CTGCTGGCCG GCGGCATCAC GTGGTGGGCC GTCGGCGCCG GGAGCGGGAA CTGCGACGGC ACCGCGGTCG TGCGCGTCGC CGTCGCCCCC GAGCTGGCGC CCGTCGCCGA GCAGGTGCTC ACCGACGCCG AGGGCCTGCG CCCCGAGGAC TGCGCCTCCG CCCAGGTCAC CGCGCAGGAG CCGGTGCAGA CCGTCGGCGA CCTGGGTGCG CTGGACGCCG GGGACCTGCC GCACCTGTGG GTGCCCGACT CCTCGCTGTG GCCGGCCCGC GCCGGGCACG CCGCCCTGGA GACCGCCGGA TCGGTCGCCA CCTCCCCCGT GGTGCTGGCC ACCAGCCGCG CGGCCGTGGA CGCGCTGGGC TGGACCGCCG AGGCGCCGGG GTGGGGTGCG GCGCTGGTCA GCGAGCAGGG CATCGCCGTC CCGGACCTGG CGACCAGCGC CGAGGCCCTC GCCGCGCTGG GCGCCGTGCG GACGTCGCTG GGCGGCGGCG AGGACGCCGA CAACGCCGTG GTGCAGGCGG TGCTCGCCGC CGAGCGCGGA CCGTCGGTCT CCCCGGCCGA CGCGCTGGCC GCCGGCGGCG AGGGCGCGGC CGACGCGCCG CTGGTGCCGG TCAGCGAGCA GGAGGTGTGG GCGGCCAACG CCGACGCGGA GGACCCGCAG CTGGTCGCCG TCTACCCGGA GGAGGGCTCA CCGGGGCTGG ACTACCCGCT GGTCCGGGTC GGCACCGCCT CCGCCACCGA CGCCGCCGTG GTGGACGCCG CCGTGCGGGC GCTCACCTCG GACGCCGCGC GCTCCGCCGT CACCGAGGCG GGTTTCCGTG ACGCCGACGG CACCGCGCCG CCCGGCGCCG AGGCGGCGGG GATCCGCGAG GCCGCTCCGC GGTCCCTGCA GCTCGACCCC GCCGAGGTGC AGGGGCTGCT GGCGCAGCTG TCGGAGCTGG CCGCGCCGTC CCGGATCCTC ACCGTCTTCG ACATCTCCAC CTCGATGGAG GCCCCGGCCG GCGACGGCAC CCGGGCCACG CTCGCCCGGG ACGCCGCCAA GAGCACGCTC ACGCTGGTGC CGGGCAACTT CGCGCTCGGG CTGTGGTTCT TCGCCGCCGA GCTGGATGGC GAGCGCGACT GGACCGAGGT CGTGCCCACG CGTCAGCTCG AGGCGGAGGT CGAGGGCACC GTCCAGCGCG ACCTGCTCGA CGAGGAGCTC GACACCATCC CCGACCGGCT CAGCCCCGGC GGCACCGGCC TGTACGACAC CACGCTGGAC GCGGTGCGGG CGGCGCGCTC GGACTTCGAC CCGCGCGCGG TCAACAGCGT GCTGGTGGTC ACCGACGGGA CGAACGAGGA CAGCGGGGGC GTGGACCTCG ACGAGCTGCT GGCCACGCTG CGCAGCGAGG CCGACCCCGA TCGGCCGATC AAGGTCATCG GTGTAGCGCT CGGCCCGGAC GCCGACCTGG GTGCCCTCGA GCGGATCGCG GACGTGACCG GCGGCGCGGC CTACTCCGCG GTCGACCCGA CGGACCTGCA GACCGTCCTG TTCGACGCGC TGCGGCAGCG GTGA
|
Protein sequence | MGRHAATPAA RRPAPLTPQV LIALAVAAMV LLAGGITWWA VGAGSGNCDG TAVVRVAVAP ELAPVAEQVL TDAEGLRPED CASAQVTAQE PVQTVGDLGA LDAGDLPHLW VPDSSLWPAR AGHAALETAG SVATSPVVLA TSRAAVDALG WTAEAPGWGA ALVSEQGIAV PDLATSAEAL AALGAVRTSL GGGEDADNAV VQAVLAAERG PSVSPADALA AGGEGAADAP LVPVSEQEVW AANADAEDPQ LVAVYPEEGS PGLDYPLVRV GTASATDAAV VDAAVRALTS DAARSAVTEA GFRDADGTAP PGAEAAGIRE AAPRSLQLDP AEVQGLLAQL SELAAPSRIL TVFDISTSME APAGDGTRAT LARDAAKSTL TLVPGNFALG LWFFAAELDG ERDWTEVVPT RQLEAEVEGT VQRDLLDEEL DTIPDRLSPG GTGLYDTTLD AVRAARSDFD PRAVNSVLVV TDGTNEDSGG VDLDELLATL RSEADPDRPI KVIGVALGPD ADLGALERIA DVTGGAAYSA VDPTDLQTVL FDALRQR
|
| |