Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4158 |
Symbol | |
ID | 5541669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5380656 |
End bp | 5381945 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640896269 |
Product | von Willebrand factor type A |
Protein accession | YP_001434207 |
Protein GI | 156744078 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.590263 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACAC AACGAACCCC CGAAACCGAT CGCCTGTATG ATCAGGGAGT CGCCTGCATG CGCGCGGCGC GTTGGGAAGA GGCGATTGTC ATCCTGAGCC GGTTGCGTGA TTTGACCGGC GCCTACCCTG ATGTCGAGGC ATTGATTGCC GATGCGCAAC TGAAAATGGA AATCGAGCAG GCAGGCGTGC CTGCGGCTGC GCCGCCGCCG AAATCGCGCC CGCCGCGCGC AGTCGTCTTC GGCGGGATAG CAGCTACCGC GCTGATCAGC CTGATCGTCG CCGCTATCCT GTTCGTTCGT CTGGATGCGT CGAGTGTGCG CGTTGCGGCG CAACCGCTTG TGCTCAACCT GACATTTCCC ACCATTGCGC CAACCAGAAC CTCAACACCG GCGCCGACGA AGACACCACA GCCTTTGCCG CCGACTGCCA CATCGCTGCC GGATGCGGTC CTTCCGGGAA CGCTCGGTGT GCGACTGGCG CCGGGAGAGC GCACAACTCG CATTACCGAA AACATAGCGG TGATCCTGGA TGCGTCCGGC AGCATGCTGG CGCGCCTCGA CGGGACGCCA AAGACGGTCA TCGCCCGCCA GGCGCTCATC GCACTGATAA ACCGTCTCCC CGAAACCACA AACGTCGCAC TGCGCACCTA TGGGCACCGG CGCGCCGATG ATTGCAGTGA CACCGAACTC ATCCAGGCGC TCGCTCCTTT GCAGCGCGAT GCTCTTATTG CGCGCATCAA CGCCATCCGT CCGGTCAACG GCGGGCGCAC ACCAATCGCC CAATCACTGG CAGATATGGC ACAGGACCTT GCAGGCATCG AGGGAAATGT GCTGATTGTG CTGGTGAGCG ACGGCGACGA AACCTGCGGC GGAGATCCGG TCGCAACTGC CTCGATGTTA CGCGCCGCCA ATTCGCAATT GCGGATCAGC GTCATCGGGT TCGACGTTGA GCAGGAAGAG TGGCGCCGCC GGTTGGAGGG CATTGCCGTA GCCGGCGGCG GAGCATACTT CGATGCCTCG AATGCCGAAC AACTCGCCGA TGCGCTCGAT CAGGCAATTG CGCTGACCTA CCGTGTTTTC GACGCACAGG GCAAAGAAGT CTACCAGGGT CGCATCGGCA GCGATGTCCG GTTGGCGCCG GGGGTGTATC GCATCGAAAT CGGCGGTGAT GCCGAACTGA CGATCGAGAC GGTGATCGTC GAAAGCAACA CAACGACATC TGTGGAACTA CGCGAAGATC AGGGCGAGTT GCGCGCCAGC ACCATAACAG ACGATGGTGT GCAGCCGTGA
|
Protein sequence | MNTQRTPETD RLYDQGVACM RAARWEEAIV ILSRLRDLTG AYPDVEALIA DAQLKMEIEQ AGVPAAAPPP KSRPPRAVVF GGIAATALIS LIVAAILFVR LDASSVRVAA QPLVLNLTFP TIAPTRTSTP APTKTPQPLP PTATSLPDAV LPGTLGVRLA PGERTTRITE NIAVILDASG SMLARLDGTP KTVIARQALI ALINRLPETT NVALRTYGHR RADDCSDTEL IQALAPLQRD ALIARINAIR PVNGGRTPIA QSLADMAQDL AGIEGNVLIV LVSDGDETCG GDPVATASML RAANSQLRIS VIGFDVEQEE WRRRLEGIAV AGGGAYFDAS NAEQLADALD QAIALTYRVF DAQGKEVYQG RIGSDVRLAP GVYRIEIGGD AELTIETVIV ESNTTTSVEL REDQGELRAS TITDDGVQP
|
| |