Gene RPD_4247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4247 
Symbol 
ID4024768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4715292 
End bp4716650 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content67% 
IMG OID637964453 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_571365 
Protein GI91978706 
COG category[R] General function prediction only 
COG ID[COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.262982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCAT CACCGCCGAC ATCTTCACCT GAGACCACGC AAGTCGGCGC GCCGCGCCGG 
CTTGCGTTTC CCGTTCGGAG ACATGACGTG CCGAAACCAA ACCTGATCAG CCGTCGCGGC
TTGATCCGCG CGGCCGCGGC GACCACCGCG CTGCTCGCCT GTCCGGCGAT CGCCAAGGCG
AAGCCGCGCG TCGTCGTGGT CGGCGGCGGC GCCGGCGGCG CGACCGCGGC GAAGTATCTG
CGTCACGGCG ACGACGCCGT CGAGGTCACG CTGGTCGAAG CCAACCGCGT TTATGTGACG
CCGTTCACCT CGAACCTGTT TCTCGGCGGG CTGAAATCGT TCGAGACGCT GAGCTTCGGC
TATGAGGCGA TCGCTGCGCG CGGCGTCGGC ATGGTGTTCG ACAGCGTCAC CGCAATCGAC
CGCGACGCCA GACAGGTGCG CACCGCAGGC GGCGCGCGGC TGGCCTACGA CCGGCTGGTG
TTGTCGCCCG GCATCGATTT CCGCTGGGAT GCGGTCCCCG GCTATTCCGA AGCTGCGGCC
GAGCTGATGC CGCACGCTTA TCGCGGCGGC GCCCAATTCA AGCTGCTGAA GAGCCGCCTC
GATGCGCTGT CGGACGGCGC TCTGATCGTG ATCATCGCGC CGCCGAACCC GTATCGCTGC
CCGCCGGCGC CGTATGAACG CGCCTCGATG ATGGCCTATG CGCTGAAGAG CCGTGGCGTG
AAGAACGCCC GCATCGTGAT CCTCGACGCC AAGGATCACT TCGCGATGCA GACGCTGTTC
ATCGACGGCT GGGAGCGCCA CTACACCGGC ATGATCGAGT GGCAGGACCC GACCATCCAC
GGCGGCATCA AGGCGGTCGA TCCAAAGGCG ATGACGGTGA CCACTGATTT CGAGACCCAC
AAGGCGTCGC TGGTCAACGT CATTCCCCCG CAGATCGCCG GACGGCTCGG CCGCGACGCC
GGCCTCGCCG ACGACAACGG CTTCTGCCCT GTCTACGCCG AGACCATGAC GTCGCGGATC
GATCCGCTGA TCCAGGTGAT CGGCGACTCC GCGGTCGGCG GCGAGTTTCC GAAGTCCGGC
TTCGCTGCCA ATAACGAGGC CAAGGCCGCG GCGATGATCC TGCGCGCCGA ACTGCTCGGC
GAGCGGCGGA TGCCGGTGCG CTTCACCAAC CATTGCTGGA GCGGCATTGC GCCCGACGAC
GCGGTCAAGA ACGGCGCGCG CTACGCGCCG CAGGACGGCA AGATCGTGGC GTCCGATCCG
TACACCTCGC AGCTCGACGA AACTCCGCAG CTCCGCGCCA AGCAGGCGCG CGAAGCGGCG
GGCTGGTACG CCGGCATGAC GACGGATATT TTCGGCTGA
 
Protein sequence
MPASPPTSSP ETTQVGAPRR LAFPVRRHDV PKPNLISRRG LIRAAAATTA LLACPAIAKA 
KPRVVVVGGG AGGATAAKYL RHGDDAVEVT LVEANRVYVT PFTSNLFLGG LKSFETLSFG
YEAIAARGVG MVFDSVTAID RDARQVRTAG GARLAYDRLV LSPGIDFRWD AVPGYSEAAA
ELMPHAYRGG AQFKLLKSRL DALSDGALIV IIAPPNPYRC PPAPYERASM MAYALKSRGV
KNARIVILDA KDHFAMQTLF IDGWERHYTG MIEWQDPTIH GGIKAVDPKA MTVTTDFETH
KASLVNVIPP QIAGRLGRDA GLADDNGFCP VYAETMTSRI DPLIQVIGDS AVGGEFPKSG
FAANNEAKAA AMILRAELLG ERRMPVRFTN HCWSGIAPDD AVKNGARYAP QDGKIVASDP
YTSQLDETPQ LRAKQAREAA GWYAGMTTDI FG