Gene Rsph17029_2675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2675 
Symbol 
ID4897513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2821132 
End bp2823087 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content72% 
IMG OID640113276 
Productvon Willebrand factor, type A 
Protein accessionYP_001044549 
Protein GI126463435 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.260015 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG ACCGCACCGA CGACGATCTC GAGGCTCTGC GCCTGGCACT CGCGCGCGAG 
GCGGCTCCGG CGCCCGATCC CGAGGCTCGG GCACGGGCCA TGGCGCTGGC GCTCGAGAAT
TACGACAGGT TCCAAGGACG GGCGAGCGGG CTGCGTCAGG GGAAGGACCG CCCGGCCGGG
GCGGGGTTCC TCAACGGAGT GCGCCGCATG TCGGGTTCCT TCTTCTCCCG CCCCCTTCTC
GCCGCCACCG GATCGGTCGC GGCTCTGGGC CTCGCCCTCG TCGTGGTGAT GCCGAACGCG
CGGCTGGCCG AGCCGCCGCA GACCGCGCCG GACGCGCCCG AGGCGGATGC CCGCCTCACC
GCCGCGCCGG AGGCGGGTGG CGGGGCCGAG ACGGCCGGTG CGCCGGTTCC CGCGGAACCG
CGGGCGCGGA GTGCCGAGGG CGCCGCCCCG CAGACCTTCG CCGCCGACGA AGCGATGCCC
ATGGCTGCGC CGCCCGCGCC GGATCTCGCC CTCTCGAAGC AGGCCGCCGA AGCCCCCGCG
CGCGCCCTTC CGCAGGGCGA CAGCGAGGCC TTCGCGAATG CGCCCGACAA TCCGCTCCGC
GTGACCGCCG AGGATCCCGT CTCGACCTTC TCGATCGACG TGGATACGGC GAGCTACGCG
ATCCTCCGCT CGAGCCTGCG GGCCGGGCAG CTTCCCCCGC GCGAGGCGGT GCGGATCGAG
GAGATGATCA ACTACTTCCC CTACGACTAC CCGGCGCCGG AGAACGGAAC GCCGCCCTTC
CGCCCCACCC TCTCCATCAC CCGGACGCCC TGGAACCCGG AGACGCGGCT CGTCCATGTG
GCGCTGCAGG GCCGGATGCC CGCCATCGAG GACCGGCCGC CGCTGAACCT CGTCTTCCTG
ATCGACACAT CGGGCTCGAT GCAGGATCCG GCGAAGCTGC CGCTCCTCAA GCAGTCGTTC
GGGCTGATGC TCGGCCGCCT GCGCCCCGAG GATCAGGTGG CCATCGTGAC CTATGCCGGC
TCGGCGGGCG AGGTGCTGGC GCCCACGGCT GCGAACCAGC GCAGCACCAT CCTCTCCGCC
CTCGACCGGC TCGATGCCGG CGGATCGACC GCGGGGGACG AGGGGCTGGC GCTCGCCTAC
CGGACGGCTT CGGAAATGGC GGGCGCGGGC GAGGTCACGC GCGTGGTGCT GGCCACCGAC
GGAGACTTCA ACCTCGGGAT CAGCGACCCG GAAGAGCTGG CCCGCCTCGT GGCGCACGAG
CGCGACACCG GCGTCTATCT CTCGGTGCTG GGCTTCGGGC GCGGCAATCT CGACGATGCG
ACGATGCAGG CGCTGGCGCA GAACGGCAAC GGGCAGGCCG CCTATATCGA CAGTCTGAAC
GAGGCGCAGA AGGTTCTGGT CGACCAGCTC AGCGGCGCGC TCTTCCCCAT TGCCGACGAT
GTGAAGGTGC AGGTGGAGTG GAGCCCGGCC CGCGTCGCGG AATACCGGCT CATCGGCTAC
GAGACCCGCG GCCTGCGCCG CGAGGATTTC GCCAACGACC GGGTCGATGC GGGCGAGATC
GGCGCCGGCC ATTCGGTGAC GGCGATCTAC GAGATCACGC CGGTGGACAG CCCCGCGCGC
CTGACCGATC CCCTGCGCTA CGGCGCCGAA CCGCCCGAGG GCGCGCATGG CGATGAACTG
GGCTTCCTGC GGCTGCGCTA CAAGGCGCCG GGGGAAAGCA CATCGACCCT GATCGACACG
CCGATCCCGG ACATGCTGAC CGAGGCTTCC GAGGACGTGC GCTTCTCCAC CGCCATCGCG
GGCTTCGGCG AGCTTCTGCG CGGGTCGGAC AAGCTCGGCG CCTGGGGCTG GGACGAGGCC
ATCGCCTTGG CCGACGGGGC GCGCGGGGCC GATCCCTTCG GCTACCGGGT GGAGGCCGTC
CAGCTGATGC GCCTGGCCGA GAGCCTCAGC CGCTGA
 
Protein sequence
MSDDRTDDDL EALRLALARE AAPAPDPEAR ARAMALALEN YDRFQGRASG LRQGKDRPAG 
AGFLNGVRRM SGSFFSRPLL AATGSVAALG LALVVVMPNA RLAEPPQTAP DAPEADARLT
AAPEAGGGAE TAGAPVPAEP RARSAEGAAP QTFAADEAMP MAAPPAPDLA LSKQAAEAPA
RALPQGDSEA FANAPDNPLR VTAEDPVSTF SIDVDTASYA ILRSSLRAGQ LPPREAVRIE
EMINYFPYDY PAPENGTPPF RPTLSITRTP WNPETRLVHV ALQGRMPAIE DRPPLNLVFL
IDTSGSMQDP AKLPLLKQSF GLMLGRLRPE DQVAIVTYAG SAGEVLAPTA ANQRSTILSA
LDRLDAGGST AGDEGLALAY RTASEMAGAG EVTRVVLATD GDFNLGISDP EELARLVAHE
RDTGVYLSVL GFGRGNLDDA TMQALAQNGN GQAAYIDSLN EAQKVLVDQL SGALFPIADD
VKVQVEWSPA RVAEYRLIGY ETRGLRREDF ANDRVDAGEI GAGHSVTAIY EITPVDSPAR
LTDPLRYGAE PPEGAHGDEL GFLRLRYKAP GESTSTLIDT PIPDMLTEAS EDVRFSTAIA
GFGELLRGSD KLGAWGWDEA IALADGARGA DPFGYRVEAV QLMRLAESLS R