Gene RPD_1785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1785 
Symbol 
ID4022267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1999861 
End bp2001105 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content67% 
IMG OID637961979 
ProductVWA containing CoxE-like 
Protein accessionYP_568922 
Protein GI91976263 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.23574 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.248193 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTGCTCA TTCTTTGCTC GATAACTTTG AGCGAGTTCG ACATGCCCAC AATCGATCAC 
CTCAATCCGC CCACCGGCAT GATGGCCGAC AACGTCGTCG GCTTTGCCCG CGCGCTGCGC
GCCGCCGGGT TGCCGGTCGG GCCCGGCGCG GTGATCGATG CGCTGAACGC GCTGCAACTG
ATCGAGATCG GCAATCGCGA CGATCTCTAC GCGACGTTGG AGGCGATCTT CGTCAAGCGT
CGCGAGCACG CGCTGATCTT CGCGCAGGCC TTCGCGCTGT TCTTCCGCGC CGCGGAGGAG
TGGCAGCACA TGCTGGATTC GATCCCGCTG CCGGATCACG CCAGGAAGAA GCCGCCGCCG
GCCTCGCGCC GGGTGCAGGA AGCGATGGCG CCGTCGACGA CCCGGGACTT CCCTTCCGCC
GAGGAGCAGG AAATCCGGCT CGCGGTGTCG GACAAGGAGA TCCTGCAGAA GAAGGACTTC
GCGCAGATGA GCGCTGCGGA GATCGCCGAG GTCACTCGCT CGATCGCGCG GATGCGGCTG
CCGCAGGCGG AATTGCGCAC GCGCCGCGTC CGGCCGGACA AGCGCGGTCT CAAGCTCGAT
CTGCGCCGCA CGCTGCGCGC TTCGCTCCGG ACCGGCGGCG ACATCGTCGA TATCCGCAGG
CTCGGCCTGA TCGACAAGCC GGCGCCGATC GTGGCGCTGC TCGATATCTC CGGCTCGATG
AGCGAATACA CGCGGCTGTT CCTGCACTTC CTCCACGCCA TCACCGACGA TCGCAAGCGG
GTCTCGACCT TCCTGTTCGG CACGCGGCTG ACCAACGTCA CCCGCGCGCT GCGGGCGCGC
GATCCCGACG AGGCGCTGGC GAGTTGCACG TCGTCGGTCG AGGACTGGGC CGGCGGCACG
CGGATCGCGA CCTCGCTGCA TGTCTTCAAC AAGGCGTGGG CGCGCCGCGT GCTGGGGCAG
GGTGCGATCG TGCTGCTGAT TTCCGACGGG CTGGAGCGCG AGGCCGATAG CAAGCTCGCC
TTCGAGATGG ACCGGCTGCA TCGCTCCTGC CGGCGGCTGA TCTGGCTCAA CCCGCTGCTG
CGCTTCGGCG GTTTCGAGCC GCGCGCGCAG GGCATCAAAA TGATGCTACC CCACGTTGAC
GAATTCCGCC CGGTGCATAA TCTGACCTCG ATGCAGGGAT TGATCGAGGC GCTGTCCTCC
GCGCCGCCGC CGCACCATTT CAGTGCGATC CGCTCGGCCG CATAA
 
Protein sequence
MLLILCSITL SEFDMPTIDH LNPPTGMMAD NVVGFARALR AAGLPVGPGA VIDALNALQL 
IEIGNRDDLY ATLEAIFVKR REHALIFAQA FALFFRAAEE WQHMLDSIPL PDHARKKPPP
ASRRVQEAMA PSTTRDFPSA EEQEIRLAVS DKEILQKKDF AQMSAAEIAE VTRSIARMRL
PQAELRTRRV RPDKRGLKLD LRRTLRASLR TGGDIVDIRR LGLIDKPAPI VALLDISGSM
SEYTRLFLHF LHAITDDRKR VSTFLFGTRL TNVTRALRAR DPDEALASCT SSVEDWAGGT
RIATSLHVFN KAWARRVLGQ GAIVLLISDG LEREADSKLA FEMDRLHRSC RRLIWLNPLL
RFGGFEPRAQ GIKMMLPHVD EFRPVHNLTS MQGLIEALSS APPPHHFSAI RSAA