Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3344 |
Symbol | |
ID | 4071262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3966515 |
End bp | 3967585 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637985366 |
Product | von Willebrand factor, type A |
Protein accession | YP_592419 |
Protein GI | 94970371 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1240] Mg-chelatase subunit ChlD |
TIGRFAM ID | [TIGR03436] VWFA-related Acidobacterial domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCCGTCG CGCAACAACC GGCGGAACTT CCTTCGGCGC CTTCGGCTAC GCTGCAACAG CAGCAGCAAA AGGACGCGGC TGCACAGGCG GCAGCGAATC CTGCACCGCA GCAGTCCTCG TCGCCTGCGT TACCGCTTCC GGCGATCCCG AAAGACGCTC CGTCAGCCCC GGCCCCCAAG CCCCCTACCC CACAGAACGA TGCCAAGGCC GACACGCCTT CCGACGACCA GGCCATCACC AAGATCGTTG TCGGTGTGAA CGAAGTGAAC GTCATCTTTA CGGTGACCGA CAAGCGCAAT CGCTTCGTGA AGGACCTCAG CCAGCCCGAC TTCAAGTTTG TTGACGATGG CAAGCCAGTC GCCTCCATCC GCGACTTCCG CAAGGAAACG AACCTGCCGC TGCGCGTCGG CTTGCTGATT GACTCCAGCA ACTCGATTCG CGACCGTTTC AAGTTCGAGC AGGAATCGGC GATCGAATTC CTGAACCAGA TCATTCGTCC CAAGTTCGAC AAGGCGTTCG TCATCGGCTT CGATACCACA GCCGAAGTAA CCCAGGATTT CACCGACGAC ACCGACCTTC TCGGTAAAGG TGTACGCATG CTGCGTCCGG GCGGCGGTAC CGCCATGTAC GACGCGATCT ACTACGCCTG CCGCGACAAG TTGCTGAAGG AGAATGGCAA CACCGCCATG CGCAAGGCGA TGATCCTGCT CAGCGACGGT GAGGACAACC AGAGCCGCGT CACCCGGGAA GAAGCGGTCG AGATGGCACA GCGCGCGGAA GTCATCATTT ACGCGATTTC CACCAACACC AGTGGCCTGA AGCTGCGCGG CGACAAAGTG CTCGAGCGCT TCGCTGAGGC AACGGGCGGA CGCGCCTTCT TCCCCTTCAA GATTTCCGAC GTAGCGAACG CCTTTTCGGA AATTCAGGAC GAATTGCGCA GCCAGTATGC GGTGTCGTAC GTGCCCGCCG ACTTCAAGAA AGACGGTCAC TATCGCGCGA TCGAGATTGC GGCCGACAAC AAAAAGTACA AGGTCCGCGC GCGTAAAGGC TACTACGCTC CGAAAAACTA G
|
Protein sequence | MAVAQQPAEL PSAPSATLQQ QQQKDAAAQA AANPAPQQSS SPALPLPAIP KDAPSAPAPK PPTPQNDAKA DTPSDDQAIT KIVVGVNEVN VIFTVTDKRN RFVKDLSQPD FKFVDDGKPV ASIRDFRKET NLPLRVGLLI DSSNSIRDRF KFEQESAIEF LNQIIRPKFD KAFVIGFDTT AEVTQDFTDD TDLLGKGVRM LRPGGGTAMY DAIYYACRDK LLKENGNTAM RKAMILLSDG EDNQSRVTRE EAVEMAQRAE VIIYAISTNT SGLKLRGDKV LERFAEATGG RAFFPFKISD VANAFSEIQD ELRSQYAVSY VPADFKKDGH YRAIEIAADN KKYKVRARKG YYAPKN
|
| |