Gene Rsph17029_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1966 
Symbol 
ID4897138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2080663 
End bp2082537 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content72% 
IMG OID640112560 
Productvon Willebrand factor, type A 
Protein accessionYP_001043842 
Protein GI126462728 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4548] Nitric oxide reductase activation protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCC ACCTGCTCGA TCTGATGGAG CCCGAGGAAA CCGTCGGCAA CCTCTGGCAC 
GATTACGCAA GCCGCTTTGC CGCCACCCGC GGCCATGCGG AGGCAGCGGT GCCCCTCGAG
GCGATGCGGC CGAGCGTGGC GGTCCTCTTC CGCGCGCTGG GCGGGCCCTC GGGCGTCGAG
ATCGGCGCGA GCCCCGCCAC GGCGGCGCTC CACCGCCGCA GCGTAGGCTC GGTTCTGGGC
GAGACGCGGA CCTTCGAGCA GGTGGCGCGC TTCGACGGCG AGCGGCTGCG GCTGCCGGCC
GTGCTCGACA CCTTCCCGTC GGCGGATATG AACCGCGCGG CCTATCTCTG GCTGGCGGCG
CTGGCCGCGA AGGTGGAGAT CCCCGAGGCC GAGGGTGCGC TCGCCGCCGA CCTGGCGCAG
ATCGCCGCGA TGAAGGCCGC GACGGCGCGG GTGCTGAAGA CGGCCCCGGG CCTTGCTGCG
ATCTACGGTC GGCTGGCAGC CCATGTGCTG GCCTCGCGCG AGACGCACGC AGCCTTCGGG
ATCGAGGCCT CGATAGAGGC GCGGGTGCGG GCGGCTCTCG GAGGCCCGGC CGCGACACGC
GCCGTGGCTC CCCACGCGCC GCGTGGCTAC AGGCCCTTCG CCCCTGTCCC GCTCTGGCTG
CGGCTCAGGC GCGGCACGGG GCAGGGCGCG GCCGATGCCC CCGAGGAGAC GCAGGCCCCC
GCGGGCCTGC CGACCGCCAC CCGCAAGATG GGCGAGCGGC GCGACCTCGA TCAGGCGAAC
CGCAAGGACA ATTTCATCGT CCACCGTTTC GAGGCGATCC TGAGCTGGGT CGAGAGCCTG
AACCTCAACC GCGCGACGGA TGACGACGAT CAGGAGAATG CGCAGAAGGC GGCCGAGGAT
CAGGATCGGA TCACCCTCAC CAAGCACGTC AAGCGCGCGG CGAGCCGGCT GCGGCTGCAT
CTCGATCTCG CGCCGCAGGA TGCCGAGCAC GAGCGCCTGT CGGACCGCTT CACCTATCCG
GAGTGGAACC ACCGTTCGCG CAGCCTGATG CCGGACCATA CCCGCGTGCT GGAAGCGCCG
GCCGAGGCGG GGCAGGGCTT CTGCCCAGAT CCGCGGCTGA GCGCGCGGGT CCGGCGGCAG
TTCGAGACGC TGCATCCGCG CCGCGTGCTC TGCAATCGTC AGGTCGAGGG GGCGGAGCTC
GACCTCGACG CGCTGATCGA GGCGCAGGTG GCGCTCCGCA CCACCGGCCG CGGCACCGAC
CGGATCTACC GGAGCTCCCG CGCGCTCGAG CGCGACCTCG CGGTCGCGAT CCTGATGGAT
TGCTCGCGCT CGACCGAGGC CTCTGTGGGC GAGCGGACGG TGATCGACAC GGCACGCGAG
GCGCTGGCGG CGCTTGCCGC GGGCATCGAC GTGGCGGGCG ACCGGCAGGC GATCTGGGGT
TTCTCCTCGC TCCGGCGCGA CCGGGTGTTC CTGCATCTCT GCAAGCGGTT CGACGAGCCG
ATGGGGCCTG CGGTCACGGC CCGCATCGGC GGGCTCAGGC CCGGCCATTA CACCCGCCTC
GGCGCGGCCA TCCGCCATGC CTCGGCGCGG CTGAACGAGG AGGCGGCGAG CCGCAAGCTC
CTTCTGGTCC TGACCGACGG CAAGCCCAAC GATCTCGACC ATTACGAGGG CGTCCACGGG
ATCGAGGACA GCCGCATGGC GGTCCGCGAG GCCCGCGCCC TGTCGCAGGC GGTGCATGGC
GTGGTGATCG ATGCCGACGG GCAGGACTGG TTCGCCCGCA TCTTCGGCCG CGCGGGCTTC
ACGCTCTTGC CCGAACCCGT CCGCCTTGCG CGGGCGCTCC CCGACCTCTA CCGTTCCCTC
ACCCAGGAGA TCTGA
 
Protein sequence
MSFHLLDLME PEETVGNLWH DYASRFAATR GHAEAAVPLE AMRPSVAVLF RALGGPSGVE 
IGASPATAAL HRRSVGSVLG ETRTFEQVAR FDGERLRLPA VLDTFPSADM NRAAYLWLAA
LAAKVEIPEA EGALAADLAQ IAAMKAATAR VLKTAPGLAA IYGRLAAHVL ASRETHAAFG
IEASIEARVR AALGGPAATR AVAPHAPRGY RPFAPVPLWL RLRRGTGQGA ADAPEETQAP
AGLPTATRKM GERRDLDQAN RKDNFIVHRF EAILSWVESL NLNRATDDDD QENAQKAAED
QDRITLTKHV KRAASRLRLH LDLAPQDAEH ERLSDRFTYP EWNHRSRSLM PDHTRVLEAP
AEAGQGFCPD PRLSARVRRQ FETLHPRRVL CNRQVEGAEL DLDALIEAQV ALRTTGRGTD
RIYRSSRALE RDLAVAILMD CSRSTEASVG ERTVIDTARE ALAALAAGID VAGDRQAIWG
FSSLRRDRVF LHLCKRFDEP MGPAVTARIG GLRPGHYTRL GAAIRHASAR LNEEAASRKL
LLVLTDGKPN DLDHYEGVHG IEDSRMAVRE ARALSQAVHG VVIDADGQDW FARIFGRAGF
TLLPEPVRLA RALPDLYRSL TQEI