Gene Rcas_1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1686 
Symbol 
ID5539162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2170008 
End bp2172488 
Gene Length2481 bp 
Protein Length826 aa 
Translation table11 
GC content63% 
IMG OID640893823 
Productvon Willebrand factor type A 
Protein accessionYP_001431796 
Protein GI156741667 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000433208 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAGAC GTTTGCTTCA TCACAGGACA GAAGGGCAGA ATATCGTCCT GCTCGCCGGC 
ATTCTGGCAT TGCTGGTCGG CATGGCGGCC CTGGCTGTCG ACCTCGGCGT CACCTACGCC
GAGCAGCGGA ACATCGTGCG CGGCACGAAC GCGGCGTCGC TGGCGGGCAT GAATCGCCTG
ATCAGCGGCG GTCGCGATGC TGATGTCGCC CTTGCGATCT ATGAATCGCT TCGCTCGAAC
GGCATCCAGG TCACCGTGCC GGGCGAGTCG CCGCAACCCG GCGACCGCGC GTTCGAGGCG
CTCTACCTGG GAAGCGATGG CGCGCCGATC CCCGGCGCGT GCAGTCGCGT CGGCGCCTGC
GGCTCGCAAC GTCCGCAGGG GGTGAAGTAT ATTCAGATCG ACCTCAAGGG CAATGTCGAT
ACCTACTTTG CGCGCCTCTT CGGTCAGAGC ACCCTGCCGG TCGGCGCTAC GGCATACGCC
AGCGTCGGCG CCTGCGCCAC CGGATACTAC CCCATCGGTG TGCGCACGAC AGTCGGCGGG
CAGCCGATGT TCGATGAGTA TGGGTTCGCC AATTACGACG GCTTCTATGA AGACGAGACC
TATGCTCAAT TGCGCTATCA GCGCCTCTAT CTGCGCACCG AAAACAACCC GAATGGCGGG
TTTAGCCTGC TGCGCTGGCG CAACGACATC CCTGCCGGCA ATGCCAATGC GCTTGCGGAG
ATGCTGACCG GCGACGGCAC CTATAGCCGG GGGTACGCCG AAGCGCCCTG GCCCGAAATC
GCCAGCGGAA CGGACGGTCC GACCCGCCCT GACTCCTATC CGTTCGAGCC GAATACGATC
AACACGGCGG ACTGGGTGTA TGGTAATGTC TTCGGCGGCA ATCCCTTCAA TGCGAATGTG
ATCAATCAAC TCGATACGCT GAAGCGCAAT CGCACCCTGA TGACTCTGCC GATGTATCGC
TTCGATAACG GCAGTCTTTC CAACCCGACG TACTACCTCG AGCGCTTTGG CGCCTTCCTG
CTGGTTGATT ACGGTGTGGA TACCGATGGG CGGGCGCCCG ACGGTCCTTG GATGGAACTG
GCTTACGTGC GCGAAGGCGG GCAGTGCGCC AACCTCGTCA CCGGTGCGCC GCCAACCAAC
AACTTCACCG TCGGCGGCAG CGTGACCTAT ACGCCACGCT ATCAGACCTA CCCGGACCTC
AATCGCCCGG TGCAGTTCCT ACTCATCCTC GACGTGTCCG GTTCGATGTC GTGGACGTTC
GATGGGCGCG GCGTGCAGAA TGGGCAAACC GTCTTCTGCA CCAACCCGTC TCAAGGGTGC
GTCTCGGTGC AAACGGCATG GCCCAACGCG CAGGAACGCC GGATCTATAC TGCGAAGCAG
GTGCTGCGCT CGTTCGTGGC ACAGATCGAT CAGGATCGGC AGAGCGGTCT CCGACCGTAT
GATACCGTGC GCCTCGTGAC CTTCAGCGGA CGCCTGGGCA GCTTTGTCAA CAGCAGCGGC
GCCGTCGGCG ACAATAATCG CGCGTTGAAC GACCTGACCG AGGTGCTGCC TGCCGGTTGG
ACGAATGACC GCGCGACATT GGAAGCGGCG ATCAATAGTG CCGGTATGGT GGACGGCGAT
CCGTATATGA CGGCAGGCGC AACGCCCAGC GCCGTCGCCT TTGCCCGCGC CAGCCAGGTG
TTCGCCAACG CTCCCGAACG CGCGCCCAAC GGGATGAAGT ATCGCCGCGT GGTGATTTTC
GTGACCGACG GTGTGGCGAA TGTGTTGCGC AATGGTATGC AGAATAACTA CGGTGAGGGG
TGCCAGTTGG GCGCTGAGAA CGTCGGTTGC CAGATGGGCG ATCCGCTGCC GGACGGCAGC
CTGCGACCGC TCAATGCGAT GGTCGCAGAG GCGCAGGCGC TCAAGGAGGC GTATATCCGC
CCCAGCGACG GTTCGGTCTA TGTGGTGGCG CTGAGCGGCA CATTCGAGGC GACCGGTCTG
AACCTCGTCG CCAGCCAGCC GGACTATGTC AAGCGCGCCG ACCGGTCCGA GGAGTTGCAG
CAGATTTTCG ACGACATCCA GGTTTCGGCG ATCCAGGGCG ATTGTACGCC ATCCCAGGGT
GAACTGCGCG ACTCGATGGC GCCCTCCGAG GTGCCGACCG ACCTGCGACC GGAATTGACC
GATCGGATCG TTGGACAGGT GACGCTCACC GATGCCAATA ACAATGTGCG CGAGGCATGG
ATTACGGCCG ATCCGATCAC GCGCAAACTG TCGTATGCGT TCACGAACGT TCCGCCGGGC
CAGTATACGC TGCGCGCCTG GCTCGGCTAT CGCGCGCCGC AGGATGGTAT CTCGCGCGAT
AAGTACGAAC TGCTGGTGCG CGGACCGGAT GTGACGACCG AAATCACGGT GCAGGTCAGC
GCCGGTCGCA GTCTCGGCGG CGTCATCGGC GTGCCGATCT CGCTCGACCT GAACGACAGC
GTCTGCCCGG CATTGTCGTA A
 
Protein sequence
MKRRLLHHRT EGQNIVLLAG ILALLVGMAA LAVDLGVTYA EQRNIVRGTN AASLAGMNRL 
ISGGRDADVA LAIYESLRSN GIQVTVPGES PQPGDRAFEA LYLGSDGAPI PGACSRVGAC
GSQRPQGVKY IQIDLKGNVD TYFARLFGQS TLPVGATAYA SVGACATGYY PIGVRTTVGG
QPMFDEYGFA NYDGFYEDET YAQLRYQRLY LRTENNPNGG FSLLRWRNDI PAGNANALAE
MLTGDGTYSR GYAEAPWPEI ASGTDGPTRP DSYPFEPNTI NTADWVYGNV FGGNPFNANV
INQLDTLKRN RTLMTLPMYR FDNGSLSNPT YYLERFGAFL LVDYGVDTDG RAPDGPWMEL
AYVREGGQCA NLVTGAPPTN NFTVGGSVTY TPRYQTYPDL NRPVQFLLIL DVSGSMSWTF
DGRGVQNGQT VFCTNPSQGC VSVQTAWPNA QERRIYTAKQ VLRSFVAQID QDRQSGLRPY
DTVRLVTFSG RLGSFVNSSG AVGDNNRALN DLTEVLPAGW TNDRATLEAA INSAGMVDGD
PYMTAGATPS AVAFARASQV FANAPERAPN GMKYRRVVIF VTDGVANVLR NGMQNNYGEG
CQLGAENVGC QMGDPLPDGS LRPLNAMVAE AQALKEAYIR PSDGSVYVVA LSGTFEATGL
NLVASQPDYV KRADRSEELQ QIFDDIQVSA IQGDCTPSQG ELRDSMAPSE VPTDLRPELT
DRIVGQVTLT DANNNVREAW ITADPITRKL SYAFTNVPPG QYTLRAWLGY RAPQDGISRD
KYELLVRGPD VTTEITVQVS AGRSLGGVIG VPISLDLNDS VCPALS