Gene RPB_3941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3941 
Symbol 
ID3911748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4497283 
End bp4498752 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content68% 
IMG OID637885845 
Producthypothetical protein 
Protein accessionYP_487545 
Protein GI86751049 
COG category[R] General function prediction only 
COG ID[COG3106] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00618827 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTCA GTTTTTCAGA TCTGGTCGAG GAAGCGTGGC TGTCGGCGCG GGCGCTGAAG 
GACTACAGCG AGAATATCTT CAATCCGACG GTCCGGCTCG GCGTCACCGG GCTGTCACGT
GCCGGCAAGA CGGTGTTCAT CACCGCGCTG GTGCACGGCC TGTCGCGCGG CGGGCGGTTT
CCGATCTTCG AATCGATGTC GACCGGACGA ATCGCCAAGG CCCGGCTGGC GCCGCAGCCC
GACGACGCGG TGCCGCGGTT CGGCTATGAG GGCTTTCTCG CCACCCTGAT GGAGCAGCGC
GACTGGCCGA GTTCGACGGT GGATATCAGC GAATTGCGGC TGGTGATCGA CTATCAGCGT
AAGAACGGCG CCGAACGCAC GCTGACGCTG GACATCGTCG ACTACCCCGG CGAATGGCTG
CTCGACCTGC CGCTGCTGAA CAAGAGCTAC GAGAAATGGG CGGCGGAGAG CCTGGCGCTG
TCGCGACAGG ACCCGCGCCG CCGCGTCGCC GTCGACTGGC ATGCGCATCT CGCCACGCTC
GATCCGAACG GCCGCGAGAA CGAGCAGGAA ACGCTGACCG CGGCGCGGCT GTTCACCACC
TATCTGCGCG ACTGCCGCAA CGAGCAGTTC GCGATGAGCC TGCTGCCGCC CGGCCGTTTC
CTGATGCCCG GCAACCTCGC CGGCTCGCCG GCGTTGACCT TCGCGCCTCT GGATGTGCCG
GAGGGCGGCA CCGCGCCGGA GCGCTCGCTG TGGGCGATGA TGGTGCGACG CTACGAGGCC
TACAAGGACG TGGTGGTGCG GCCGTTCTTC CGCGATCACT TCGCGAGGCT CGACCGCCAG
ATCGTGCTGG CGGATGCGCT GTCGGCTTTC AACGCCGGCC CCGAGGCACT GCAGGACCTC
GAAGCCGCGC TCGCCGGCAT TCTCGACTGT TTCCGGGTCG GGCGCGCGTC GATGCTGTCG
ACGATGTTCC GGCCGCGGAT CGACCGCATC CTGTTCGCCG CCACCAAGGC CGACCATCTG
CACCATTCCA GCCACGACCG GCTGGAGGCG ATCCTGCGCA AGCTGGTCGA GCGGGCGATG
CAGCGCGCCG AATTCGCCGG GGCGACCGTC GACGTGGTGG CGCTGGCCGC GGTGCGCGCC
ACCCGCGAGG CCCAGGTGCA GCGCGGCCGC GACCGGCTGC CGTCGATCGT CGGCACCCCG
ATCAAGGGCG AAATGGCCGA CGGCGAGATC TTCGACGGCG AGACCGAGGT CGCTACCTTC
CCCGGCGACC TGCCGACCAA TCTGCAGGGG CTGTTCAAGG GCGAGGACAC GTTTCGCGGC
CTCGCCGCGG GCCGCCACGA GGACGCCGAT TTTCGCTTCC TGCGCTTCCG GCCGCCACGG
CTCGACAATC GCGATCCGGA CGGCCCGGCA CTGCCTCACA TCCGCCTCGA CCGCGCGCTC
CAGTTCCTGA TCGGAGATCA ACTGCAATGA
 
Protein sequence
MAFSFSDLVE EAWLSARALK DYSENIFNPT VRLGVTGLSR AGKTVFITAL VHGLSRGGRF 
PIFESMSTGR IAKARLAPQP DDAVPRFGYE GFLATLMEQR DWPSSTVDIS ELRLVIDYQR
KNGAERTLTL DIVDYPGEWL LDLPLLNKSY EKWAAESLAL SRQDPRRRVA VDWHAHLATL
DPNGRENEQE TLTAARLFTT YLRDCRNEQF AMSLLPPGRF LMPGNLAGSP ALTFAPLDVP
EGGTAPERSL WAMMVRRYEA YKDVVVRPFF RDHFARLDRQ IVLADALSAF NAGPEALQDL
EAALAGILDC FRVGRASMLS TMFRPRIDRI LFAATKADHL HHSSHDRLEA ILRKLVERAM
QRAEFAGATV DVVALAAVRA TREAQVQRGR DRLPSIVGTP IKGEMADGEI FDGETEVATF
PGDLPTNLQG LFKGEDTFRG LAAGRHEDAD FRFLRFRPPR LDNRDPDGPA LPHIRLDRAL
QFLIGDQLQ