Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3993 |
Symbol | |
ID | 4071129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4722858 |
End bp | 4724033 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986020 |
Product | von Willebrand factor, type A |
Protein accession | YP_593067 |
Protein GI | 94971019 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03436] VWFA-related Acidobacterial domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.889697 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACG TCCGCATCCT TCTCTGCCCC TTGGTGGGCT TTTGCCTGCT TGCCTCCATC GCGACGGCAC AGGACAAACC GACGCTACGC ACTTCGTCTC CACCGGCAGA AGAGCGCCAG CCAGAAGCAC AGCCTGACAC CCCGTCGTTT CACGTTGAGG TAAAGGAAGT CACCCTGCCG GTGACGGTGC GCGACAAGCA CGGCAAGATC GTCCAGACGC TCAATAAAGA AGACTTCAGC CTGGTGCAGG ACGGTAAGAC GCAGACGATC ACCCAGTTCC GTCGCGACAC CAAACTGCCG CTGACGCTTG GCCTTTTAGT GGATACGAGC TACAGCGTGC GCGACGAGTT GCCGGCCGAA AAAACGGCAA GCGAGAAGTT TCTCGACGAT ATGCTGGCGC AACCGAAAGA CCAGGCGTTC CTTATCCACT TCGACCGCGA AGTGGAGTTG ATGACGGACC TGACCTCGTC GAAAGACAAG CTGCACAGGG GCATTGGCGA GTTGGAGACT TCCGGCCCTC CATCGCAAAG CAGTAGCGAT GACGGCCAAC GCCATCGGCG GGGAGGAACG CAGCTCTACG ACGCCATCTA CCTGGCGGCA TCCGAGATCT TGCAGAAGCA GCAAGGGCGG AAGGCAATCG TCGTTCTTAC TGATGGCGAA GATCGCGGCA GCAAGGAAAC TCTGACCGAC GCAGTGGAGG CAGCACAGCG CGCGGACGCG ATTGTCTATG CGATTTACTT CAAGGGAGAG CAGGAGCAAA GCCGGTGGGG CAACGGAGAT CACGGTAACC GTGGCGGCAT GGGTGGCCCG CGGATCGGCT ACCCAGGTGG AGGTGGCGGT TATCCGGGTG GTGGCGGCGG ACGATACCCC GGCGGCGGTG GTGGTCGCGG TGGCGAGCAA CGCGAGGCTC GGTTAGATGG GAAGAAGATC CTGACCGAAA TCGCGAGCAA GACCGGCGGA CGGATGTTCG AAGCCAGCAA GAAGGAGAAC GTCGAAGCAA TCTATGCGCA GATCGCCGAA GAACTGCGTA GCCAGTACGT GTTGGCGTAC ACGCCGGACC ATTCCAGTGC CGATGCCGGC TATCACCGTG TAACCGTTGC GGCGAAGGAC AAAGAACTGA AGATCCAAAC CCGCGAAGGG TTCTACATCC CCGAACAAAC CACGGCAACG AAGTAG
|
Protein sequence | MSNVRILLCP LVGFCLLASI ATAQDKPTLR TSSPPAEERQ PEAQPDTPSF HVEVKEVTLP VTVRDKHGKI VQTLNKEDFS LVQDGKTQTI TQFRRDTKLP LTLGLLVDTS YSVRDELPAE KTASEKFLDD MLAQPKDQAF LIHFDREVEL MTDLTSSKDK LHRGIGELET SGPPSQSSSD DGQRHRRGGT QLYDAIYLAA SEILQKQQGR KAIVVLTDGE DRGSKETLTD AVEAAQRADA IVYAIYFKGE QEQSRWGNGD HGNRGGMGGP RIGYPGGGGG YPGGGGGRYP GGGGGRGGEQ REARLDGKKI LTEIASKTGG RMFEASKKEN VEAIYAQIAE ELRSQYVLAY TPDHSSADAG YHRVTVAAKD KELKIQTREG FYIPEQTTAT K
|
| |