Gene Rpal_4300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4300 
Symbol 
ID6411984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4628869 
End bp4630428 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content69% 
IMG OID642714182 
Producthypothetical protein 
Protein accessionYP_001993271 
Protein GI192292666 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACCGAC CGACGATCAA GCATGGCACC TCGCGGCGCT TCAGGACGCC CGCCCTCCTC 
ACCCTGGCGA CGCTGCTCGC CGTGGCAGCC CCGGCGCCCG ACGCGGACGC CAAGCGGGCG
CGCCCGGCGG CCACCACCGA GGCGACCGCG CCGCGCGAAG CCGGTGAGCC GATCATGGCG
ATCGTCTCGA TCAAGGGTCA GCGGGTGACG TTGTACGACT CCGAAGGCTG GATCTATCGC
GCGCCGGTCT CGACCGGCAC CACCGGTCGC GAAACCCCGG CCGGCGTGTT CGCCGTGGTC
GAGAAGGACA AGGACCACCG TTCGACGATG TACGACGACG CCTGGATGCC GAACATGCAG
CGCATCACCT GGAACGGCGT CGCGCTGCAC GGTGGTCCGC TGCCCGGCTA TCCCGCGTCA
CACGGCTGCG TCCGGATGCC GTACGAGTTC GCCGAGAAGC TGTTCGACAA GACCCGGATC
GGGATGCGGG TGATCGTGTC GCCGGAGGAC GTCGAGCCGG CCGATATCAG CCATCCGGTG
CTGTTTTCGC CGAGTGCCGA GGCGCTGGCC GCCGCGCCGA CGCGCGCCGA GACCGCTGTG
CGTGAGGCCG AGCAGGCCGC GCAGGCGGCC GACGAGGCCA AGACCGCCGC GGCCGCTGCC
GCCCGTGCGG TAAAACCGCT CAAAGACAGC TTACGCAAGC TGGAGCGCGC CAAGGCGCGG
GCCGAAGCCG CGCTGAAGGC CGCCGACAAG GTGCTGGTCG CCGCCGCCAC CGATGAAGCC
AAGGCCAAGG CGGAAGAGCG TCAGCAGCAG GCCGCGCAGC AACTCGGCGA AGCCACGACC
CAGCTCGAAA CCGCCAAAGC GGATGCCGAC GCCAAGCACG CCGCCGCGGC CGCCACCAAG
GAGGCGGCCA AAGCTACCGC GGCGAAGAAA GCGGAAACCG CGAAGCTCGC GACCGACGCC
AAGCTGGCGC AGGAGCCGGT GTCGATCTAC ATCAGCCGGG CGACGCAGAA GCTCTACGTC
CGCCGCAACA CCCGCAAGCC GCTGCCCGAT GGCGGCGAGC TGTTCGACTT CTCGATCGAA
GTGCCGGTGG CGATCCTCGA TCCGGAGCGG CCGATCGGCA CTCACATCTT CACCGCGACG
GCGCGCAACG ACGCCGGCCT GCGCTGGAGC GCGGTGACGA TCGAGAGCGC CGACAATGCC
AAGAGCGCGC TCGACCGCGT CACGATCCCG CCGGAGGTGC TGGAGCGGAT CGGCCCGACC
GCGCTGCCGC GCTCCTCGAT CATCATCTCC GATGAGCCGC TGAGCGCAGA GACTAACTAC
CGCACCGAAT TCGTCGCGGT GCTGAGCGAT CAACCGCAGG GCGGCTTCAT CACCCGCAAG
CCGACCAGCA GCGACGTTCC GGTGGCCAGC AGCGATGACT GGAACGATGG TGGCTTCGGC
TTCTTCTTCC AGCCGAGGGA GCAACGCGTC CCTGCGCAGT CCCGGCGCGG CCGCTACGGC
GAAGGCTATT ACCGCCAGCC GCAAGACTAC TACCGCCAGG AGCAGCCGGG CTGGTGGTAG
 
Protein sequence
MNRPTIKHGT SRRFRTPALL TLATLLAVAA PAPDADAKRA RPAATTEATA PREAGEPIMA 
IVSIKGQRVT LYDSEGWIYR APVSTGTTGR ETPAGVFAVV EKDKDHRSTM YDDAWMPNMQ
RITWNGVALH GGPLPGYPAS HGCVRMPYEF AEKLFDKTRI GMRVIVSPED VEPADISHPV
LFSPSAEALA AAPTRAETAV REAEQAAQAA DEAKTAAAAA ARAVKPLKDS LRKLERAKAR
AEAALKAADK VLVAAATDEA KAKAEERQQQ AAQQLGEATT QLETAKADAD AKHAAAAATK
EAAKATAAKK AETAKLATDA KLAQEPVSIY ISRATQKLYV RRNTRKPLPD GGELFDFSIE
VPVAILDPER PIGTHIFTAT ARNDAGLRWS AVTIESADNA KSALDRVTIP PEVLERIGPT
ALPRSSIIIS DEPLSAETNY RTEFVAVLSD QPQGGFITRK PTSSDVPVAS SDDWNDGGFG
FFFQPREQRV PAQSRRGRYG EGYYRQPQDY YRQEQPGWW