Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2506 |
Symbol | |
ID | 4069875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2962825 |
End bp | 2963913 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637984523 |
Product | von Willebrand factor, type A |
Protein accession | YP_591581 |
Protein GI | 94969533 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1240] Mg-chelatase subunit ChlD |
TIGRFAM ID | [TIGR03436] VWFA-related Acidobacterial domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.179837 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAGCA GGAATTGTAT GGTGCACGTG CCAGTTGCCG CGTGGATCCT GCAAGTCTCG CGCGCCGGAG TGTCGGCGCC ATCTCGCTTT CGCCTCACCC TTCTCGGATG GATGATGTTA CTGGCGGTGA TCGTCGGTTT TGCAGTTCCC GGGCTCGCGC AGGTGGACAG TAACGAGGTC CACGTGCAGC CTCGCGAGGC GCCAAAGCCT CCTACCCCGC CGCAAGGCGA TCCTGCTGAC GTGAATACCC ATACCCGTCC GATGCGGGTG GATGTGAATA TCGTGCTGGT GCCGGTAACC GTGACCGATC CGGACAACCG GCTGGTAACC GGGCTTGAGA AAGAGAATTT CGAAGTTCTG GACCAGAACA TTCCGCAACA GATCCGGCAC TTCTCGAGCG AAGACGCTCC GGTGTCCATC GGCGTGATTT TCGACATGAG CGGGTCGATG TCGAACAAGA TTGATAAATC GCGCGAGGCG ATTGTCGAGT TCTTTAAGAC CGCCAATCCG GATGACGAGT TCTTTGTGGT CGCCTTCAAC GACAAGCCGG AAGTGTTGCA GGACTTCACC AACAGGATTG AGGATATCCA GGAGAAGTTG ACGATTCTTC AGCCGAAAGA CCGGACGTCG CTGCTCGATG CCATTTACCT GGGCATGAAC AAGATGCGGC AGGCGAAGTA CGAACGGAAG GCGCTGCTGA TCATCTCCGA TGGCGGCGAC AACCATAGCC GGTATACGGA AAACGAGATT AAAAGCATGG TGCGCGAGGC CGATGTGCAG ATTTATGCCA TCGGAATTTA TGACCTGGCG CCGACGACAA CGGAAGAGAT GGCGGGGCCA GCACTGCTCG GGGAAATCTC TGATTGGACC GGCGGACGTA TGTTCCCGAT TGATAACGTC AATGAATTGG CGGACGTAGC CACAAAGATA GGAGTAGAGC TGCGCAACCA ATATGTACTC GGATACCGTC CAAGTAAACC AGCGAAGGAT GGCAAATGGC GGAAAATCAA GGTCCGTCTG AACCCGCCTA AGGGCTTGCC TCCGCTCCAT GTTTTCGCGA AGACTGGTTA CTATGCACCT TCGGAATAG
|
Protein sequence | MGSRNCMVHV PVAAWILQVS RAGVSAPSRF RLTLLGWMML LAVIVGFAVP GLAQVDSNEV HVQPREAPKP PTPPQGDPAD VNTHTRPMRV DVNIVLVPVT VTDPDNRLVT GLEKENFEVL DQNIPQQIRH FSSEDAPVSI GVIFDMSGSM SNKIDKSREA IVEFFKTANP DDEFFVVAFN DKPEVLQDFT NRIEDIQEKL TILQPKDRTS LLDAIYLGMN KMRQAKYERK ALLIISDGGD NHSRYTENEI KSMVREADVQ IYAIGIYDLA PTTTEEMAGP ALLGEISDWT GGRMFPIDNV NELADVATKI GVELRNQYVL GYRPSKPAKD GKWRKIKVRL NPPKGLPPLH VFAKTGYYAP SE
|
| |