Gene Rcas_3821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3821 
Symbol 
ID5541324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4992984 
End bp4995794 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content63% 
IMG OID640895931 
Productvon Willebrand factor type A 
Protein accessionYP_001433877 
Protein GI156743748 
COG category[S] Function unknown 
COG ID[COG5426] Uncharacterized membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.608594 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTTT CGTTTCTTTA TCCCGAAGCC CTCTGGCTTG GTTGCGTTCT GCCACTCGTT 
TGGGGTGTCG CGCTCCTCGC TCCCGCGCAC ATTGCCGCCT GGCGACGATG GTTGAGCCTG
GTTGTCCGCA CCATCATCGT GCTGGCATTG ATCGGCGCGC TTGCAGGCGC GCAACTGGTT
CAACCTCCTG GAATCACCAC AACGATTTTC CTGCTCGATG GGTCGGACTC GGTTGCCGCA
TCGCAACGTG CCCGTGCAGA GGCATTTATC GCGCGGGCGC TGGCGGTAAT GCCGCCCGAT
GACCGGGCAG GGATCATCGT CTTTGGGCGT GAGGCGCTCG TCGAGCGGTT CCCTGCCCCC
GAACGCACCT TTGGCGCGCC TGTAACGCGA CCCTTCGGCA GTGCAACCAA TATCGCGGAT
GCGCTCCAAC TCGGTCTGAC GCTGCTTCCT GCAGAGGGGC ATCGGCGGTT GGTGCTGCTC
TCCGACGGCG GCGAAAACCG CGGGACCGCC CGCCTGATTG CGCAGGACGC TGCTGCGCAG
GGCATCCCCA TCGATGTGGC GCCGCTCACC GGCAGCGCCG ATGGATTGGA TGCGCAGATC
ATCGGCGTGA CGTTGCCATC GACGGCGCGC GAAGGGCAGC GCCTGCCACT GCGCATTGAC
CTGGAAAGCA ATACGTCCGT TGCAGGTCGC CTGATGGTCA CAGGTCCCGA TAGGACGACC
GTTGCGACGA TACCGGTCGA GATCGGAACT GATCGGCAAA CTGTCGAAAT CCTGCTTCCC
GAAGCGCCTG CCGGCTTCAA CCGCTACACA GCGTATCTTG AGGTTCCTAA CGATGCACGC
ACCCAGAATA ACGCTATCGA GACCTTCAGC TATGTGCGGG GCACACCGCG TGTACTACTG
GTTGCGCAAG CCCCCGATGA TGCCATTAGT CTGGAACGCG CGCTGCGCGC CGCTCGGATA
GAAGTCACAG TCGTCAGCCC GGCATCCATA CCCGCCACAT TCGGCGAACT GATCCGGTAC
GACGCGATTG CGCTGATCAA TGTGCCCCGC GCTCTGTTTT CCAACGAGAC GGTGCAACGC
ATTGCGGCGT ATGTCCGCGA TTTCGGCGGC GGTCTGCTCA TGGTCGGCGG TCCGCAGTCG
TTTGGTCCGG GAGGCTGGCG CGGTACGCCG GTCGAGGCTG CATTGCCGGT GACGATGGAT
ATTCCTGAGC GGCAGCGCCA GCCGCCGGTC AGCATCGTGG TGGTGATCGA TATTTCAGGA
AGTATGGCTG CGACGGAGGA TGGCATTCCC AAACTCTCGC TGGCGCTCGA AGGTGCGCGA
CGGATCGCGG CGCTGTTGCG CGATGAAGAT GAGTTGACGG TCATTCCTTT CGATGACCGT
CCCGGCGTCA TTGTCGGTCC GCTGCCGGGA TCACGGCGCG ATGTCGCTAT CGAGCAACTG
AACCAGGTGC GTCTTGGCGG CAGCGGCATC AACATCCACG ATGCGCTTCG AGTTGCGGCG
CGGTATACCC GCGCCAGCGA GCGCCCCGTG CGCCACATTA TCACGATTAC TGATGGAAAT
GATACGACTC AGCAGGAAGG TGCGCTGGAC ATTGTGCGCT CACTGCACGA CGAAGGCGTG
ACCCTGACCT CCGTTGCCAT TGGGCAGGGC GATCATGTGC CATTCATCCG TGATATGGCC
GCCGTGGGCG GCGGGCGTAC CTTCCTGACG GAGCGCGCCG CCGATGTTCC CGATCTGTTG
ACCGGCGAGA CGCAGACGAT CATGACACCA TATATCGTCG AAGGTTCATT GACACCGCAG
CAGGCAATGC CGCACCCCTC CCTGCGCGGG ATCGACGCCC TGCCCAAACT CTACGGTTAT
GTTCTGACGA CGCCACGCGA GACGGCGCAG ACAACGCTCG TTGCTCCCGA TGGTGATGTG
TTGCTCGCGG GCTGGCAGTA CGGTCTGGGA CGTTCCCTCG CCTGGACGAG CGATCTCAGC
GGGCGATGGG CGAAGGAATG GGTTGCATGG GACGCATTCC CGCACTTCAG CGTTCAACTC
GTCACCTGGC TCCTGCCGCA GCAAACCGGC GATACCCTGA CCATCGCAAC ACACGCTGTT
GGAGATGCCC TGGTCGTCGA GGCTATCACA CGCACACCCG ATGGCGCACC ACACATTGGA
TTGAACGTTG AGTCCCGATT GATCGCGGCT GATGGCGCTA CTGTCGAAAC CACGCTGCGT
GAAGTCGGTC CGGGACAGTA TCGTATATCA CTCGACAATT TGCCGGCCGG CGCCTATCTG
GTGCAGATCG TTGCGCGTGA CGGGCAGGGG AGAGCGGTTG CCGGCGCGAC CGGCGGCGCC
GCAATGCCGC TCAGCGCCGA GTATCGCCGT CAGGCGGGAG ATCGCTACCT GCTCGAAGAG
ATCGCGCACA TCACCGGCGG ACGGATCGAC CCGCAGCCTC ATCAGGTTTT CGAGCCAGGA
CGCGCGGCAC GCGGCATGGC GCGCGAAATT GGATTGCCGT TGCTCTGGCT GGCGCTCGTG
TTGCTGCCGT TGGACATTGC GCTCCGGCGA GTCTTTATTC GACACGCGTC AATTGCCGCT
GCACTGCGGC ACATTGGTCT GCGCGCGCTA GCCCGGCGCC TCGAACCGCA AGAAGCGTTG
GACACCGGGG TTGCCTCATC GCTACCATCT CCACCTACCA CGCCGCCTCG TACAATCACT
GTCAACCGCT CGCCTGCCAG CCCGCTCGTC CCATCCGCCA ATGAACTGGA GCGCCTCCGT
GCTGCGCAGG AAGCAGCACG GCGACGATTG CGCGGCGAGG ATGAGGAATA G
 
Protein sequence
MGVSFLYPEA LWLGCVLPLV WGVALLAPAH IAAWRRWLSL VVRTIIVLAL IGALAGAQLV 
QPPGITTTIF LLDGSDSVAA SQRARAEAFI ARALAVMPPD DRAGIIVFGR EALVERFPAP
ERTFGAPVTR PFGSATNIAD ALQLGLTLLP AEGHRRLVLL SDGGENRGTA RLIAQDAAAQ
GIPIDVAPLT GSADGLDAQI IGVTLPSTAR EGQRLPLRID LESNTSVAGR LMVTGPDRTT
VATIPVEIGT DRQTVEILLP EAPAGFNRYT AYLEVPNDAR TQNNAIETFS YVRGTPRVLL
VAQAPDDAIS LERALRAARI EVTVVSPASI PATFGELIRY DAIALINVPR ALFSNETVQR
IAAYVRDFGG GLLMVGGPQS FGPGGWRGTP VEAALPVTMD IPERQRQPPV SIVVVIDISG
SMAATEDGIP KLSLALEGAR RIAALLRDED ELTVIPFDDR PGVIVGPLPG SRRDVAIEQL
NQVRLGGSGI NIHDALRVAA RYTRASERPV RHIITITDGN DTTQQEGALD IVRSLHDEGV
TLTSVAIGQG DHVPFIRDMA AVGGGRTFLT ERAADVPDLL TGETQTIMTP YIVEGSLTPQ
QAMPHPSLRG IDALPKLYGY VLTTPRETAQ TTLVAPDGDV LLAGWQYGLG RSLAWTSDLS
GRWAKEWVAW DAFPHFSVQL VTWLLPQQTG DTLTIATHAV GDALVVEAIT RTPDGAPHIG
LNVESRLIAA DGATVETTLR EVGPGQYRIS LDNLPAGAYL VQIVARDGQG RAVAGATGGA
AMPLSAEYRR QAGDRYLLEE IAHITGGRID PQPHQVFEPG RAARGMAREI GLPLLWLALV
LLPLDIALRR VFIRHASIAA ALRHIGLRAL ARRLEPQEAL DTGVASSLPS PPTTPPRTIT
VNRSPASPLV PSANELERLR AAQEAARRRL RGEDEE