Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3821 |
Symbol | |
ID | 5541324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4992984 |
End bp | 4995794 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640895931 |
Product | von Willebrand factor type A |
Protein accession | YP_001433877 |
Protein GI | 156743748 |
COG category | [S] Function unknown |
COG ID | [COG5426] Uncharacterized membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.608594 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGTTT CGTTTCTTTA TCCCGAAGCC CTCTGGCTTG GTTGCGTTCT GCCACTCGTT TGGGGTGTCG CGCTCCTCGC TCCCGCGCAC ATTGCCGCCT GGCGACGATG GTTGAGCCTG GTTGTCCGCA CCATCATCGT GCTGGCATTG ATCGGCGCGC TTGCAGGCGC GCAACTGGTT CAACCTCCTG GAATCACCAC AACGATTTTC CTGCTCGATG GGTCGGACTC GGTTGCCGCA TCGCAACGTG CCCGTGCAGA GGCATTTATC GCGCGGGCGC TGGCGGTAAT GCCGCCCGAT GACCGGGCAG GGATCATCGT CTTTGGGCGT GAGGCGCTCG TCGAGCGGTT CCCTGCCCCC GAACGCACCT TTGGCGCGCC TGTAACGCGA CCCTTCGGCA GTGCAACCAA TATCGCGGAT GCGCTCCAAC TCGGTCTGAC GCTGCTTCCT GCAGAGGGGC ATCGGCGGTT GGTGCTGCTC TCCGACGGCG GCGAAAACCG CGGGACCGCC CGCCTGATTG CGCAGGACGC TGCTGCGCAG GGCATCCCCA TCGATGTGGC GCCGCTCACC GGCAGCGCCG ATGGATTGGA TGCGCAGATC ATCGGCGTGA CGTTGCCATC GACGGCGCGC GAAGGGCAGC GCCTGCCACT GCGCATTGAC CTGGAAAGCA ATACGTCCGT TGCAGGTCGC CTGATGGTCA CAGGTCCCGA TAGGACGACC GTTGCGACGA TACCGGTCGA GATCGGAACT GATCGGCAAA CTGTCGAAAT CCTGCTTCCC GAAGCGCCTG CCGGCTTCAA CCGCTACACA GCGTATCTTG AGGTTCCTAA CGATGCACGC ACCCAGAATA ACGCTATCGA GACCTTCAGC TATGTGCGGG GCACACCGCG TGTACTACTG GTTGCGCAAG CCCCCGATGA TGCCATTAGT CTGGAACGCG CGCTGCGCGC CGCTCGGATA GAAGTCACAG TCGTCAGCCC GGCATCCATA CCCGCCACAT TCGGCGAACT GATCCGGTAC GACGCGATTG CGCTGATCAA TGTGCCCCGC GCTCTGTTTT CCAACGAGAC GGTGCAACGC ATTGCGGCGT ATGTCCGCGA TTTCGGCGGC GGTCTGCTCA TGGTCGGCGG TCCGCAGTCG TTTGGTCCGG GAGGCTGGCG CGGTACGCCG GTCGAGGCTG CATTGCCGGT GACGATGGAT ATTCCTGAGC GGCAGCGCCA GCCGCCGGTC AGCATCGTGG TGGTGATCGA TATTTCAGGA AGTATGGCTG CGACGGAGGA TGGCATTCCC AAACTCTCGC TGGCGCTCGA AGGTGCGCGA CGGATCGCGG CGCTGTTGCG CGATGAAGAT GAGTTGACGG TCATTCCTTT CGATGACCGT CCCGGCGTCA TTGTCGGTCC GCTGCCGGGA TCACGGCGCG ATGTCGCTAT CGAGCAACTG AACCAGGTGC GTCTTGGCGG CAGCGGCATC AACATCCACG ATGCGCTTCG AGTTGCGGCG CGGTATACCC GCGCCAGCGA GCGCCCCGTG CGCCACATTA TCACGATTAC TGATGGAAAT GATACGACTC AGCAGGAAGG TGCGCTGGAC ATTGTGCGCT CACTGCACGA CGAAGGCGTG ACCCTGACCT CCGTTGCCAT TGGGCAGGGC GATCATGTGC CATTCATCCG TGATATGGCC GCCGTGGGCG GCGGGCGTAC CTTCCTGACG GAGCGCGCCG CCGATGTTCC CGATCTGTTG ACCGGCGAGA CGCAGACGAT CATGACACCA TATATCGTCG AAGGTTCATT GACACCGCAG CAGGCAATGC CGCACCCCTC CCTGCGCGGG ATCGACGCCC TGCCCAAACT CTACGGTTAT GTTCTGACGA CGCCACGCGA GACGGCGCAG ACAACGCTCG TTGCTCCCGA TGGTGATGTG TTGCTCGCGG GCTGGCAGTA CGGTCTGGGA CGTTCCCTCG CCTGGACGAG CGATCTCAGC GGGCGATGGG CGAAGGAATG GGTTGCATGG GACGCATTCC CGCACTTCAG CGTTCAACTC GTCACCTGGC TCCTGCCGCA GCAAACCGGC GATACCCTGA CCATCGCAAC ACACGCTGTT GGAGATGCCC TGGTCGTCGA GGCTATCACA CGCACACCCG ATGGCGCACC ACACATTGGA TTGAACGTTG AGTCCCGATT GATCGCGGCT GATGGCGCTA CTGTCGAAAC CACGCTGCGT GAAGTCGGTC CGGGACAGTA TCGTATATCA CTCGACAATT TGCCGGCCGG CGCCTATCTG GTGCAGATCG TTGCGCGTGA CGGGCAGGGG AGAGCGGTTG CCGGCGCGAC CGGCGGCGCC GCAATGCCGC TCAGCGCCGA GTATCGCCGT CAGGCGGGAG ATCGCTACCT GCTCGAAGAG ATCGCGCACA TCACCGGCGG ACGGATCGAC CCGCAGCCTC ATCAGGTTTT CGAGCCAGGA CGCGCGGCAC GCGGCATGGC GCGCGAAATT GGATTGCCGT TGCTCTGGCT GGCGCTCGTG TTGCTGCCGT TGGACATTGC GCTCCGGCGA GTCTTTATTC GACACGCGTC AATTGCCGCT GCACTGCGGC ACATTGGTCT GCGCGCGCTA GCCCGGCGCC TCGAACCGCA AGAAGCGTTG GACACCGGGG TTGCCTCATC GCTACCATCT CCACCTACCA CGCCGCCTCG TACAATCACT GTCAACCGCT CGCCTGCCAG CCCGCTCGTC CCATCCGCCA ATGAACTGGA GCGCCTCCGT GCTGCGCAGG AAGCAGCACG GCGACGATTG CGCGGCGAGG ATGAGGAATA G
|
Protein sequence | MGVSFLYPEA LWLGCVLPLV WGVALLAPAH IAAWRRWLSL VVRTIIVLAL IGALAGAQLV QPPGITTTIF LLDGSDSVAA SQRARAEAFI ARALAVMPPD DRAGIIVFGR EALVERFPAP ERTFGAPVTR PFGSATNIAD ALQLGLTLLP AEGHRRLVLL SDGGENRGTA RLIAQDAAAQ GIPIDVAPLT GSADGLDAQI IGVTLPSTAR EGQRLPLRID LESNTSVAGR LMVTGPDRTT VATIPVEIGT DRQTVEILLP EAPAGFNRYT AYLEVPNDAR TQNNAIETFS YVRGTPRVLL VAQAPDDAIS LERALRAARI EVTVVSPASI PATFGELIRY DAIALINVPR ALFSNETVQR IAAYVRDFGG GLLMVGGPQS FGPGGWRGTP VEAALPVTMD IPERQRQPPV SIVVVIDISG SMAATEDGIP KLSLALEGAR RIAALLRDED ELTVIPFDDR PGVIVGPLPG SRRDVAIEQL NQVRLGGSGI NIHDALRVAA RYTRASERPV RHIITITDGN DTTQQEGALD IVRSLHDEGV TLTSVAIGQG DHVPFIRDMA AVGGGRTFLT ERAADVPDLL TGETQTIMTP YIVEGSLTPQ QAMPHPSLRG IDALPKLYGY VLTTPRETAQ TTLVAPDGDV LLAGWQYGLG RSLAWTSDLS GRWAKEWVAW DAFPHFSVQL VTWLLPQQTG DTLTIATHAV GDALVVEAIT RTPDGAPHIG LNVESRLIAA DGATVETTLR EVGPGQYRIS LDNLPAGAYL VQIVARDGQG RAVAGATGGA AMPLSAEYRR QAGDRYLLEE IAHITGGRID PQPHQVFEPG RAARGMAREI GLPLLWLALV LLPLDIALRR VFIRHASIAA ALRHIGLRAL ARRLEPQEAL DTGVASSLPS PPTTPPRTIT VNRSPASPLV PSANELERLR AAQEAARRRL RGEDEE
|
| |