Gene Rcas_2577 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2577 
Symbol 
ID5540059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3326375 
End bp3328063 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content59% 
IMG OID640894706 
Productvon Willebrand factor type A 
Protein accessionYP_001432673 
Protein GI156742544 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACATC CTTTCCGATG GCTTCTGATC CTGGTTCTGT TGACGCCGCT GCTAGCAGCG 
TGCGGCGCCG GCGGTGGCGA CGGAGGGTTC CTGGGAGGCA ATACCGTCGA AGTCAGCATT
GCCTACGGCA GCGAAAAGCG TGCATGGCTC GAAGAGGCGG TTCGACAATT CAACGCTGCC
GGGCGGAAAA CGGCGAGCGG AGCATCCATT CAAGTGGTGG CGACGCCAAT GGGTTCAACC
GACTCGATGA ACCAGATTCT GAGCGGCGCC ATTCAGCCGA CCGTCTGGAG TCCGGCGAGC
AGGATTCTGC TGCCGGTCGC CAACGATGAA TGGGGCAAAC GCAACAATGG CGCAACGCTC
GTCGATGAAA ATGCGCCGCT CCTGGTGCTC AGCCCGGTTG TCATTGCCAT GTGGAAGCCG
ATGGCGGAAG CGCTCGGCTG GCCCAACAAA CCGCTCGGCT GGTCCGACAT CGCCGAACTG
TCGGCAAGCG GCAAAACCTG GGCGGACTTC GGCAGACCGG AGTGGGGTCC GTTGCAGTTC
GGTCACACCC ATCCTGATTA TTCGAACAGC GGGGTAGCGA CGATTATTGC GATCAGTTAT
GCCGCCGCCG AAAAAACCCG TGGCTTGACC GTCGCCGATG TGCAGAATCC AAAAACGGCG
GAGTTCATGC GGAATATCGA GAGCGGCGTC ATTCACTACG GCGAAAGCAC CGGCTTTTTC
GCCGACCAGA TGTTCAACCG GGGACCGGGA TACCTCTCGG CGGCGGTGCT GTACGAAAAT
CTGGTGATCG AAGCCTACAA TCGTGATCGC TATCCCTCCG TCTCTCTCCC GGTCGTTGCC
ATTTATCCGA AGGAAGGCAC GTTCTGGACT GACCATCCCT ACGCGATCCT GAACGCGCCG
TGGGTGACTG ATGAGCAACG CGAGGCGGCG AATATCTTTC TCCGCTATCT GCTCGACCGT
CCGCAGCAGG AATTGGCGTT GCGCTACGGC TACCGGCCCA GCAACACCGA TGTAGCAGTC
GGCGCGCCGA TTACGCCGGA GAACGGCGTC GATCCACAGC AACCACAGAC GCTTCTCGAA
GTGCCGCGCC CGGATGTATT GAGCGCTATT CGTAGCATCT GGGAGCAGAA CAAAAAGCGG
GTCGACGTGA TGGCAGTGCT CGATGTTTCT GGCAGTATGG AGGACGAAGG GCGTTTGGAG
CAGGCAAAAG CGGCGCTGCG CATCTTCGTC GAGCAGTTGC AGGACGATGA TGGTTTCGGG
TTGACGATCT TCAGCGACCA GGCGACTGTG CTGACGCCGA TCTCGCCCAT CGGTTCCAGG
CGCACCGAGG TTCTCAACCG CATCGCCGGG TTGACGCCGC GTGGCGGGAC GCGCCTGCTC
GATACGGTGG TTGAGGCGTA TCAGGAATTG ACCGCAACAC CGCCCGGTCA GCGCATTCGC
GCGGTTGTGG TGCTGACCGA CGGGCTGGAC AATAGAAGCC AGCGTTCAGC GGAAGACGTG
CTCGATCTGC TCAGGCAGGA TAGAGAAGGG TACAGCATCA AAGTGTTCAC CATTGCGTTC
GGTGGTGATG CTGATGTACA CTTGCTGAAG GAGATTGCCA GTGCTACCGG GGCGAAGAGT
TACGTTGGCA AACCTGGCGA GCGTGGCGCA ATTGAGCGTA TCTATCAGGA TATTACGACA
TTCTTTTGA
 
Protein sequence
MRHPFRWLLI LVLLTPLLAA CGAGGGDGGF LGGNTVEVSI AYGSEKRAWL EEAVRQFNAA 
GRKTASGASI QVVATPMGST DSMNQILSGA IQPTVWSPAS RILLPVANDE WGKRNNGATL
VDENAPLLVL SPVVIAMWKP MAEALGWPNK PLGWSDIAEL SASGKTWADF GRPEWGPLQF
GHTHPDYSNS GVATIIAISY AAAEKTRGLT VADVQNPKTA EFMRNIESGV IHYGESTGFF
ADQMFNRGPG YLSAAVLYEN LVIEAYNRDR YPSVSLPVVA IYPKEGTFWT DHPYAILNAP
WVTDEQREAA NIFLRYLLDR PQQELALRYG YRPSNTDVAV GAPITPENGV DPQQPQTLLE
VPRPDVLSAI RSIWEQNKKR VDVMAVLDVS GSMEDEGRLE QAKAALRIFV EQLQDDDGFG
LTIFSDQATV LTPISPIGSR RTEVLNRIAG LTPRGGTRLL DTVVEAYQEL TATPPGQRIR
AVVVLTDGLD NRSQRSAEDV LDLLRQDREG YSIKVFTIAF GGDADVHLLK EIASATGAKS
YVGKPGERGA IERIYQDITT FF