Gene RoseRS_3380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3380 
Symbol 
ID5210357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4242716 
End bp4244095 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content62% 
IMG OID640596977 
Productvon Willebrand factor, type A 
Protein accessionYP_001277690 
Protein GI148657485 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.178617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00546751 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTCGTT TGATGCGTTG TTTGTACTGT GGTGCATTGC AGGATGAACC TGTCGGGGTC 
AAGTCCTGCG TCCGCTGCGG CGGCGAACTC AAGTACGAGG ACGCCGCCCG CACAGCCAGC
GGCGGTTCCT ACCTCACCGC TCAACTGGAA CTGGACCAGG TCTATGCCCC GCCCGGTCAG
AACGTGGATC GCTACCTGCT GCTCACCCTC TGCTCCCCGG CCAAAGTGCC GCCCGAACAT
GCGCTGCCCC GCGAACAGCA CCGCCCGCCG CTGCACCTTG TCGCCGTCCT GGATGTGTCT
GGCTCAATGA GTGGCACAAA ACTGGCTTCC GCCAAAGAAG CCCTGCGTCA GGCGTTGCAC
TTTTTGCAGG ACGGAGACGT CTTCTCGCTG GTTACCTTTT CCGATCAAGT GCAAACCCAC
CTCAAAGCCG AGTCCTACGC CCAACGAAAG CGCGACAAGA TGGAAAACCT GCTGGACGAA
ATCAGGGCGA GCGGCATGAC CGCCCTGGAT GGTGGGCTGG CACAGGGGAT TGACCTGGGC
CAAAAGAAAC GGCAGGCCAC CACCCTGGTC CTGCTGCTCA GCGATGGGCA GGCGAATGTC
GGCGAGACCG ACCTGGAGAA AATCGGCTTG CGGGCGCAAA AGGCGCGGCA ATCCGGTCTC
ATCGTCTCCA CGCTGGGGGT TGGCCTGGAT TACAACGAGG CGCTGATGGT GGAAATTGCC
AACCAGGGCG GTGGGCGTTT CTACCACATT CAAGAGGGCA GTCAGATCCC GGCGGCTCTG
ATGCAGGAAC TGGGCAGCGC CGCCATGCTC GCCGCGCGTC AGGTGGAAGT GGAGTTCGAT
CTCCCGTCCG GCGCGGCGCT GGTCTCGCTC ACCGCGCTCT ACCCGCTGGA AATGGTCAAC
AGCCGCCCAC TTCTGAAAGT GGGGGACTTG CTGCCGGATG TGCGGGTGGA GATTCCGCTG
CGCCTGACCC TTTACCCCCA TGCCGCTGGT GAGCGTTTCA GCGTCAGTGG AGGAGTGCAT
CATCAAACGC CGCGCGGCCA GACCCTGGGA TTGGCGCTGA ACGCCGTCAG CGTGCGCTTT
GTGGAACAGC GTCAATTCGA GGAGAGACCA GGTTACGTTG CCCCGGTGAT GGAGCGCGTG
CTGGAATTTC GCCGCGCTGC CCACCTGTTG GAGTTTGCCC GCCTGCAGGA ACGGGATAGC
ACGCTGGCCA GGCAGCAAGC CGAGCGGGAA CGCCAGGCCC TGCGGGATTA CGCTCGCATG
TTCAATCCAG ACAAGGCGAT GGAACTGGAG GTAGAAAAGA TAGATGCGCT CTTTGCCACG
CCCAATGTGG CCAAACAAGC CATGCATCGC GCCGCCCGAA TGATCCGCGG GCTGGATTAG
 
Protein sequence
MARLMRCLYC GALQDEPVGV KSCVRCGGEL KYEDAARTAS GGSYLTAQLE LDQVYAPPGQ 
NVDRYLLLTL CSPAKVPPEH ALPREQHRPP LHLVAVLDVS GSMSGTKLAS AKEALRQALH
FLQDGDVFSL VTFSDQVQTH LKAESYAQRK RDKMENLLDE IRASGMTALD GGLAQGIDLG
QKKRQATTLV LLLSDGQANV GETDLEKIGL RAQKARQSGL IVSTLGVGLD YNEALMVEIA
NQGGGRFYHI QEGSQIPAAL MQELGSAAML AARQVEVEFD LPSGAALVSL TALYPLEMVN
SRPLLKVGDL LPDVRVEIPL RLTLYPHAAG ERFSVSGGVH HQTPRGQTLG LALNAVSVRF
VEQRQFEERP GYVAPVMERV LEFRRAAHLL EFARLQERDS TLARQQAERE RQALRDYARM
FNPDKAMELE VEKIDALFAT PNVAKQAMHR AARMIRGLD