Gene RPD_0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0133 
Symbol 
ID4020589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp152241 
End bp153359 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content66% 
IMG OID637960310 
Productvon Willebrand factor, type A 
Protein accessionYP_567274 
Protein GI91974615 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCA TCACAATGTT ACGGGCGCTG GCCTTCGCGG CGCTGATCGC GCCGCTGGCG 
ATCCCCTCCG CCGCGCGCGC GCGGCCGGCG GTCGAGGTCG CCTTCGTGCT CGACACCACC
GGGTCGATGA GTGGACTGAT CGAAGGCGCC AAGCGCAAGA TCTGGTCGAT CGCCACCGCG
ATCGTCGACA GCAATCCGGG CGCCGATATC CGGATGGGTC TGGTGGCGTA TCGCGACATC
GGCGACGACT ACGTCACCCG CAACGTCGAG CTGACGCCTG ATATTCAGGA TCTCTACGCG
CGGCTGCTCG AATTGCAGGC GCGCGGCGGC GGCGACTGGC CGGAGAGCGT CAACGAGGCG
CTCGACGTCG CGGTCAACAA GCTGCGCTGG AGCAAGGACG GCGACACCCG CCGCATCGTG
TTCCTGGTCG GCGACGCGCC GCCGCATATG GACTACGCCC AGGACACCAA ATATCCGACC
ACGTTGTCGG TGGCGCGGCA GAAGGACATC ATCGTCAACG CGGTGCAGGC GGGTGCTGCG
CGCGACACCG AACGGGTGTG GCACGAGATC GCCGACGGCG GCCGCGGCCG CTACATCCCG
ATCCCGCAGG ACGGCGGCCA GATCGTGGTG ATCGAGACGC CCTACGACCA GGACATCATC
ATCCTGCAGA ACCGGATCAA TGGCACCGTG ATCCCCTACG GCCCGGCGCC GTTGCAAAAG
CGCACCGAGG AGCAGACCCG GCAATTGTCG AAGGTTGCAG CCGCAGCGCC GGCCGCGGCC
TCCGACATGG CGAGCTACAT CAACAAGCGG GCCCGCACCT CGTCGGAGGC CGTCACCGGC
GGCGGCGATC TGGTCAGCGA CGTGATGGCC GGGCGGCAGA AACTCGATCA GGTCAAGGAC
GAGGAACTAC CCGACAGCCT GCGCGCCCTG CCGGCGGAGC AGCGCACCGC CAAAATCGAA
GCTGAAATGA ACCAACGCAA GGCGCTGAAC GAGAAACTCG CCACGCTGGT GAAACAACGC
GACGCCTATC TGTTTGCCCA GCGCGACAAG CAGCCCGAAC AAGCCTCATC CTTCGACCGC
GAAGTCGAAG CGACGCTGAA GGCGCAGCTG AAGCGGTAG
 
Protein sequence
MKRITMLRAL AFAALIAPLA IPSAARARPA VEVAFVLDTT GSMSGLIEGA KRKIWSIATA 
IVDSNPGADI RMGLVAYRDI GDDYVTRNVE LTPDIQDLYA RLLELQARGG GDWPESVNEA
LDVAVNKLRW SKDGDTRRIV FLVGDAPPHM DYAQDTKYPT TLSVARQKDI IVNAVQAGAA
RDTERVWHEI ADGGRGRYIP IPQDGGQIVV IETPYDQDII ILQNRINGTV IPYGPAPLQK
RTEEQTRQLS KVAAAAPAAA SDMASYINKR ARTSSEAVTG GGDLVSDVMA GRQKLDQVKD
EELPDSLRAL PAEQRTAKIE AEMNQRKALN EKLATLVKQR DAYLFAQRDK QPEQASSFDR
EVEATLKAQL KR