Gene RoseRS_1843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1843 
Symbol 
ID5208803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2276963 
End bp2278225 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content60% 
IMG OID640595451 
Productvon Willebrand factor, type A 
Protein accessionYP_001276182 
Protein GI148655977 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.122736 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGCG AGGTCACAAT TCGCGCGTCG CTGGCGCGCC CATACATGGC AGCAGCGGCG 
ACGCCGCAGG TTGCCTATAT GCTGATCGAA ATCACGCCTG GTCAGGTGAT GACGCAGGTG
CGTGCACCGG TCAATGTCTG TTTCGTGATT GACCGGAGCG GCTCGATGAA GGGCGAGAAG
ATCGACCGGG TGCGACGGGC GACGATCCGC GCCATTGAGA TGCTCGATGC GCAGGATGTC
GTCTCGGTGG TCATCTTCGA TCATCGAACC GAGGTGCTGA TCCCTGCCAC GCCGGTGACC
AGACCTGCCG AACTGATCGA CCGTATCAAT CGTGTGCGCG ACAGTGGCGG CACCCGGATC
GCACCGGCCA TCGAAGCGGG ATTGCGCGAG ATCGAGAAGG GACCGCCACA GATGGTGCGG
CGGCTCATTT TGCTCACCGA CGGTCAGACC GAGAACGAGT CCGACTGCCT GCGCCGTGCT
ACGGATGCCG GGCAACGCAA TGTGCCGATC ACGGCACTCG GTGTGGGCAA GGATTGGAAC
GAAGACCTGC TGATCGAAAT GGCGAACCGT TCAGGCGGAA CTGCCGATTA CATTGATCGT
CCGGAAAAGA TCGTCGAGTA CTTCCAGAGC ACCATCCAGC GCGCCCAGGC GACTGCGGTG
CAGAATGCAA ACCTGACGCT GCGACTGGTG CAGGGAGTGC TGCCACGCGC AGTATGGCAG
GTCTACCCGC TGATCAACAA TCTCGGCTAC CGCCCGATCT CCGACCGTGA TGTCAGCGTG
CCGCTCGGTG AACTGGAAAC CGGCAGCGGT CGCACCCTGC TGATCGAAGT GCTGGTCGAG
CCGCGCCCGG CAGGTGAATA TCGCATCGGG CAGGCGGAAG TGAGTTACGA CATTCCGCTG
CTGAACCTGC GCGATGAAAA GACCCGCGCC GACATCATGC TCACGTTTAC GACCGACGCT
GCGCTTGCGA GTCAGGTAAA TGCCAGCGTC ATGAACATTG TTGAAAAGGT CAGCGCCTTC
AAACTGCAAA CGCGAGCGCT GCAAGACCTG GCGGCCGGCG ATGTCACCAG CGCGACGCAA
AAATTGCAGA GCGCCGTGAC CCGTCTGCTC AACCAGGGCG AAGTCGAACT GGCGCAGACG
ATGCAGCGCG AAATCCAGCA CCTGCAACAG ACAGGCAAAC TCTCCAGCGA AGGACAGAAG
ACGATCAAGT TTGGAGTACA GAAAACCGTT CGCCTGAGCG ACATCAAGAA AGATGAACCC
TGA
 
Protein sequence
MAGEVTIRAS LARPYMAAAA TPQVAYMLIE ITPGQVMTQV RAPVNVCFVI DRSGSMKGEK 
IDRVRRATIR AIEMLDAQDV VSVVIFDHRT EVLIPATPVT RPAELIDRIN RVRDSGGTRI
APAIEAGLRE IEKGPPQMVR RLILLTDGQT ENESDCLRRA TDAGQRNVPI TALGVGKDWN
EDLLIEMANR SGGTADYIDR PEKIVEYFQS TIQRAQATAV QNANLTLRLV QGVLPRAVWQ
VYPLINNLGY RPISDRDVSV PLGELETGSG RTLLIEVLVE PRPAGEYRIG QAEVSYDIPL
LNLRDEKTRA DIMLTFTTDA ALASQVNASV MNIVEKVSAF KLQTRALQDL AAGDVTSATQ
KLQSAVTRLL NQGEVELAQT MQREIQHLQQ TGKLSSEGQK TIKFGVQKTV RLSDIKKDEP