Gene RPD_3702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3702 
Symbol 
ID4024218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4131022 
End bp4132491 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content68% 
IMG OID637963906 
Productprotein of unknown function DUF463, YcjX-like protein 
Protein accessionYP_570824 
Protein GI91978165 
COG category[R] General function prediction only 
COG ID[COG3106] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.79297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTCC GTTTTTCCAA TCTGGTCGAG GAAGCGCTGC TGTCGGCGCG GGCGCTGAAA 
GACTACAGCG AGAACATCTT CAATCCGACC ATCCGGCTCG GCGTCACAGG ATTGTCACGC
GCCGGCAAGA CGGTGTTCAT CACCGCGCTG GTGCATGGCC TGTCTCGTGG CGGGCGGTTT
CCGATCTTCG AATCGATGTC GACCGGGCGG ATCGCCAAGG CCCGGCTGGC GCCGCAGCCC
GACGATGCGG TGCCGCGATT CGGCTACGAG GGCTTCCTCG CCACGCTGAT GGAGCAGCGC
AACTGGCCGA GTTCGACGGT GGATATCAGC GAGCTGCGTC TGGTGATCGA CTATCAGCGC
AAGAACGGCG CCGAGCGGAC CCTGACGCTG GATATCGTCG ACTATCCCGG CGAGTGGCTG
CTCGACCTGC CGCTCCTGAA CAAAAGCTAC GAGCGCTGGG CGGCGGAGAG TCTGGCGCTG
TCGCGGCAGG ACCCCCGGCG GCGGGTCGCC CTGGACTGGC ATGCGCATCT CGCCACCCTC
GACCCCAACG GCCGCGAGAA CGAGCAGGAG ACGCTGACCG CCGCGCGGCT GTTCACCACC
TATCTGCGCG ACTGCCGCAA CGAGCAGTTC GCGATGAGCC TGCTGCCGCC GGGCCGCTTC
CTGATGCCGG GCAACCTCGC CGGCTCACCC GCGCTGACCT TTGCCCCGCT GGATGTGCCG
ATCGACGGGA CCGCGCCGGA GCGTTCGCTG TGGGCGATGA TGCGGCGGCG CTACGAGGCC
TACAAGGACG TCGTGGTGCG GCCGTTCTTC CGCGATCACT TCGCCCGGCT CGACCGCCAG
ATCGTGCTGG CGGATGCGTT GTCGGCGTTC AACGCCGGTC CCGAGGCGCT GCAGGACCTT
GAAGCGGCGC TCGCCGGCAT CCTCGATTGC TTCCGGGTCG GGCGGTCGTC GATGCTCTCG
ACGATGTTCC GGCCGCGGAT CGATCGCATC CTGTTCGCGG CGACCAAGGC CGACCATCTG
CACCATTCCA GCCACGACCG GCTCGAGGCA ATCCTGCGCA AGCTGGTCGA GCGGGCGATG
CAGCGCGCCG AATTCGCCGG CGCAACCGTC GACGTGGTCG CGCTGGCCGC GGTGCGCGCG
ACACGCGAGG CCCAGGTGCA GCGCGGCCGC GACCGGCTGC CGTCGATCGT CGGCACCCCG
ATCAAGGGCG AAATGGCCGA CGGCGAGATC TTCGACGGCG AGACCGAAGT CGCCACCTTC
CCCGGCGACC TGCCGACCAA TCTGCAGGGC CTGTTCAAGG GCGAGGACAC CTTCCGCGGC
CTTGCGGCAG GGTCGCACGA GGACGCCGAT TTCCGCTTCC TGCGCTTCCG GCCGCCGCGG
CTCGACAATC GCGATCCGGA CGGCCCGGCA CTGCCTCACA TCCGCCTCGA CCGCACCCTC
CAGTTCCTGA TCGGAGACAA ATTGCAATGA
 
Protein sequence
MAFRFSNLVE EALLSARALK DYSENIFNPT IRLGVTGLSR AGKTVFITAL VHGLSRGGRF 
PIFESMSTGR IAKARLAPQP DDAVPRFGYE GFLATLMEQR NWPSSTVDIS ELRLVIDYQR
KNGAERTLTL DIVDYPGEWL LDLPLLNKSY ERWAAESLAL SRQDPRRRVA LDWHAHLATL
DPNGRENEQE TLTAARLFTT YLRDCRNEQF AMSLLPPGRF LMPGNLAGSP ALTFAPLDVP
IDGTAPERSL WAMMRRRYEA YKDVVVRPFF RDHFARLDRQ IVLADALSAF NAGPEALQDL
EAALAGILDC FRVGRSSMLS TMFRPRIDRI LFAATKADHL HHSSHDRLEA ILRKLVERAM
QRAEFAGATV DVVALAAVRA TREAQVQRGR DRLPSIVGTP IKGEMADGEI FDGETEVATF
PGDLPTNLQG LFKGEDTFRG LAAGSHEDAD FRFLRFRPPR LDNRDPDGPA LPHIRLDRTL
QFLIGDKLQ