Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3225 |
Symbol | |
ID | 4075367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 222584 |
End bp | 224017 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638004734 |
Product | von Willebrand factor, type A |
Protein accession | YP_611461 |
Protein GI | 99078203 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACATA AAATCATGAC AGCCGCCATC TGTGCCGCAC TCGCCGCGCC TGTTGCTGCG CAAGACCTGA CCCGCAGTAC TTTGGTGTTG GATGCCTCTG GGTCGATGTG GGGACAGATT GACGGTGTTG CCAAGATCAC CATCGCTCAG GACGTGATGC AGCACCTTCT AAAGACACTC CCAGAGAACC AAGAGCTCGG CCTGATGGCC TATGGACATC GGCGCAAAGG CGATTGCAAC GATATCGAAC AGTTGATCGC CCCTGCCGCA GGATCTCGGC AGGCCATTTC ACAGGCGGTC ACGCAGATCA GTCCAAAAGG CAAGACACCG CTCTCTGCGG CGGTGATGCA AGCCGCTGAC GCTTTGCGAT CCAGTGAGGA GAAAGCAACC GTCATCCTCA TCTCGGATGG CGAGGAAACC TGTGGGCTTG ATCCTTGCGC AGTTGGCGCA GAACTCGAAG CGCGCGGTGT GGACTTTACC CTGCATGCGA TTGGCTTCGG CATCGCGGAT GACGCAGCGC GCGCGCAGTT GCAGTGCTTG GCTGAGAACA CCGGAGGTTT CTACCGGGAC GCGTCGAGCG CATCTGAATT GACGGCGGCT TTGGCACAGG TAGCGGTGAC CACATCCACG CCCGACCCGG CATCGGCTAC TTTGGTAGGG CCTGCTACGG CACAGGCGGG TCGGACCATC GACATCATCT GGCAGGGCCC TGGCGAGGAA GGCGACTGGA TCGGGACCTT GGCCCCCGGC GCCGATGTCG GTGCATTTTC CTCACGCATT GGGGTCGAGC ACGGCAACCC TGCGCCGATG CCAACCCCGC CGCAGGCGGG CACCTACGAG ATCGTCTATG TGCGCGCCGA AACCGGAGCC GTCCTTGCGC GCGCGCCGCT TGAAGTAACG CCCATGGTCG CGTCGGTCAC AGCGCCATCT ACGGGTCTCA CGGGGGGGAC TGTGACGGTC ACATGGCAAG GGCGCGGATC AGAGAAAGAC TTTATCGGCA TCGCCCCAAA GGCCGAGGGT TTCCCGGCGA CCACCTTCTA TTTCGAGACC ACTGGACCAG AACAGCAGGC TTTGCGCCTG CCATCGCAGC CGGGCGACTA CGAGATCGTT TTTGTTGCCA CGACCGATCC ATGGACCGTG CTTGATCGCA CCTCGATCAC ATTGTCAGAT CCGGATTTGT CGCTGAAAGC GCCAAGTCAA GTGGCTGCAG GCCAGGACTT TGCCGTGTTG TGGAACGGGA TCGCGCCAAA CCCTGCGGAT TATATCGCGC TTGCCAGGGC CGGAGACGAC CTGCCGCATC TCGTCAGCTA TGCGCATACC GAAAGCCACC TGACCCGCCT GACCGCGCCA GAGGAGGCCG GTGCCTATGA GTTGCGCTTC TTCTATGCCG AAGGCGACCG CATCGTGACC GCTCAGCCGA TCATCGTCGA CTGA
|
Protein sequence | MQHKIMTAAI CAALAAPVAA QDLTRSTLVL DASGSMWGQI DGVAKITIAQ DVMQHLLKTL PENQELGLMA YGHRRKGDCN DIEQLIAPAA GSRQAISQAV TQISPKGKTP LSAAVMQAAD ALRSSEEKAT VILISDGEET CGLDPCAVGA ELEARGVDFT LHAIGFGIAD DAARAQLQCL AENTGGFYRD ASSASELTAA LAQVAVTTST PDPASATLVG PATAQAGRTI DIIWQGPGEE GDWIGTLAPG ADVGAFSSRI GVEHGNPAPM PTPPQAGTYE IVYVRAETGA VLARAPLEVT PMVASVTAPS TGLTGGTVTV TWQGRGSEKD FIGIAPKAEG FPATTFYFET TGPEQQALRL PSQPGDYEIV FVATTDPWTV LDRTSITLSD PDLSLKAPSQ VAAGQDFAVL WNGIAPNPAD YIALARAGDD LPHLVSYAHT ESHLTRLTAP EEAGAYELRF FYAEGDRIVT AQPIIVD
|
| |