Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3349 |
Symbol | |
ID | 5210326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 4199767 |
End bp | 4201056 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640596947 |
Product | von Willebrand factor, type A |
Protein accession | YP_001277660 |
Protein GI | 148657455 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACAG AACGAACACC GGAAACCGAT CGGTTGTATG AACAGGGTAT GGCGTGGATG CGCGAAGCGC GCTGGGAAGA GGCAATCGCC GCTCTATCGC AGGTGCGCGC GCTGTCGCGC GCCTACCCGG ATGTTGATGC GCTTATCGCC GATGCGCAGT TGAAACTGGA GATCGAGCAG GTGGGCGTAC CTCCAGCATT GGCGCCGCCA CGTCCACATC CGCCGCGTGC AGCGCTGATC GGCGGGTTGA CCGCGCTGAC ATTGATCGTT GCGGCGGTCA TTATCGCGCT GATCGGTCGA ATCGACCCAT CGAGCGTCGG CAGTGCGGCG CAACCGATCA TGCTCAGCAT CGGCTTTCCA ACCATTGCCC CTACCAGAAC GCCGACCCCT GCGCCGACCG CCACGCCAAT CCCGGTGCGC CCAACTGCAA CGCCAGCGGC ATCTGTGGTC CTGCCTGGAA CGCTGGGAGT GCGCATGGCG TCGGGAGAGC GATTACCCGG CGCCACCCGA AACCTGGCAA TCATTCTGGA CGCTTCCGGC AGTATGCTGG CGCGCATCGA TGGTGCGCCC AAGACGGTGA TCGCTCGCCA GGCGCTGATT GCGCTGGTTG AACGTCTGCC GGCAACCACG AATGTTGCAT TGCGCACCTA CGGTCATCGG CGCGCCGACG ATTGCAGCGA TACCGAACTC GTTCAGGCGC CTGCCCCCAT CCAGCGCGCC GATCTGATCA ACCGCATCAA CGCTATTCGA CCGGTCAACG GTGGACGCAC TCCTATAGCG CAGTCGCTGG AAGATATGGC GCGAGACCTG GCTGGCGTCG ATGGCGAGGT GCTGATCGTG CTGGTCAGCG ACGGTGATGA AACCTGTGGC GGCGACCCGG TTGCAACGGC GGCAGCGCTG CACACCGCCA ATCCCCGTTT GCGGGTGAGT GTGATTGGGT TCAATATCGA ACAGGAAGAG TGGCGCCGGC GCCTGGAAGG AATAGCCGCG TATGGCGGAG GGGCGTACTT CGATGCTGCG AATGCCGTGC AACTCGCCGA TGCCCTCGAA CAGGCGGTTG CGCTGACTTA CCGTGTGATC GACAGTCAGG GAAACCAGGT CTACCAGGGA CGGATCGGGA ACACGGTTAC CCTGCCACCC GGCGCCTATC GTGTTGAAAT CAGCGGTGAT GCTGCGATAA CCTTTGAGAC AGTCTTTGTT GAAAGTGGAC ACACCACATT TGTCGAACTG CGTGATGAAC AGGGGGCGTT GCGCGCCAGC ATCATCGCGG GTGATGACGT AGCGCCGTGA
|
Protein sequence | MNTERTPETD RLYEQGMAWM REARWEEAIA ALSQVRALSR AYPDVDALIA DAQLKLEIEQ VGVPPALAPP RPHPPRAALI GGLTALTLIV AAVIIALIGR IDPSSVGSAA QPIMLSIGFP TIAPTRTPTP APTATPIPVR PTATPAASVV LPGTLGVRMA SGERLPGATR NLAIILDASG SMLARIDGAP KTVIARQALI ALVERLPATT NVALRTYGHR RADDCSDTEL VQAPAPIQRA DLINRINAIR PVNGGRTPIA QSLEDMARDL AGVDGEVLIV LVSDGDETCG GDPVATAAAL HTANPRLRVS VIGFNIEQEE WRRRLEGIAA YGGGAYFDAA NAVQLADALE QAVALTYRVI DSQGNQVYQG RIGNTVTLPP GAYRVEISGD AAITFETVFV ESGHTTFVEL RDEQGALRAS IIAGDDVAP
|
| |