Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0572 |
Symbol | |
ID | 6408222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 621032 |
End bp | 622150 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642710485 |
Product | von Willebrand factor type A |
Protein accession | YP_001989607 |
Protein GI | 192289002 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGCA TCACCTTCCA AACTGCGCTG ACCTTGGCGG CGCTGGCGCT TCCACTCTCC TTCGCGACGG CCGCGCAGGC GCGTCCGTCG GTCGAAGTCG CATTCGTGCT CGACACCACC GGCTCGATGA GCGGCCTGAT CGAAGGCGCC AAGCGCAAGA TCTGGTCGAT CGCCACCACG ATCCTCGACG ACAATCCCGA CGCCGACATC CGGATGGGGC TGGTCGCCTA TCGGGACATC GGCGACGACT ACGTGGTCCG CAGCGTCGAT CTCACGACCG ATATCCAGGA CCTCTACGGC CAACTCCTGC AACTGCAGGC GCGCGGCGGC GGCGACTGGC CCGAGAGCGT CAACGAGGCA CTCGATACGG CGATCAACAA GCTGCATTGG CGGCAGGGCG GCGACACCCG CCGCATCGTA TTCCTGGTCG GCGATGCTCC GCCGCATATG GACTACGCGC AGGATACCAA ATACCCGGAG ACCCTGGCGG TCGCCCGGCA GAAAGACATC ATCGTCAACG CGGTGCAGGC GGGCGACGCA CGCGACACCG CGCGGGTGTG GCATGAAATC GCTGACGGCG GCCGTGGCCG CTATATTCCG ATCCCGCAGG ATGGCGGGCA GATCGTGGTG ATCCAGACGC CGTACGACGA CGACATCATC ATCCTGCAGA AACAGATCAA CGGCACCGTG ATCCCCTACG GCCCGGCGCC GATGCAGCGG CGGACCGAAG AGAAGACCGG ACAACTCGCC AAGGTCGCAG CCTCGGCACC GGCGTCGGCG TCAGACATGG CCAGCTACAT CAACAAGCGC GCCCGCAGTT CGTCCGAGGC CGTCACCGGC GGAGGCGACC TGGTCAGCGA CGTGCAGGCC GGTCGACAGA AGCTCGACCA GGTCAAGGAG GAGGATCTGC CGCCCGAGCT GCGCGCGCTG CCGGCCGAAC AACGTGCCGC CAAGCTCGAC GCGCAGATGA CGGCCCGCAA GGCGCTCAAC GACAAACTCG CGGCCCTGGT CAAGCAGCGC GATGCCTATC TGCTGGCCCA GCGCGACAAG CAGCCGAAGC AGGCCTCGTC GTTCGACCGC GAGGTCGAGG CGACCCTGAA GGCGCAGCTG AAGCGATAA
|
Protein sequence | MKRITFQTAL TLAALALPLS FATAAQARPS VEVAFVLDTT GSMSGLIEGA KRKIWSIATT ILDDNPDADI RMGLVAYRDI GDDYVVRSVD LTTDIQDLYG QLLQLQARGG GDWPESVNEA LDTAINKLHW RQGGDTRRIV FLVGDAPPHM DYAQDTKYPE TLAVARQKDI IVNAVQAGDA RDTARVWHEI ADGGRGRYIP IPQDGGQIVV IQTPYDDDII ILQKQINGTV IPYGPAPMQR RTEEKTGQLA KVAASAPASA SDMASYINKR ARSSSEAVTG GGDLVSDVQA GRQKLDQVKE EDLPPELRAL PAEQRAAKLD AQMTARKALN DKLAALVKQR DAYLLAQRDK QPKQASSFDR EVEATLKAQL KR
|
| |