Gene Rpal_5301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5301 
Symbol 
ID6413002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5720670 
End bp5722355 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content63% 
IMG OID642715190 
Producthypothetical protein 
Protein accessionYP_001994262 
Protein GI192293657 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3637] Opacity protein and related surface antigens 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGAA TCCTTGTTAC CGTAGTCACG CTCGTGGCGG TCATCCCTCC GGCCGCGGCG 
GCCGACATGC CGCGCTTTCC TCCGCTCATG CCCGTTGCGC CGGTGTCGTG GAATTGGACG
GGCCTGTATT GGGGCGCCCA TGTCGGCGGC AGCTTCGGCT CATCGAGCTT CAGCGATCCC
GCTGGCCCCG CTCTCTACGG CGGCAGCGTC CGCAGCCCGG CTGCGCTGGC AGGCATCCAG
ATCGGCTACA ACTATCAGCC GAACGCGAAT TGGCTGATCG GCGTCGAGGC CGATCTCAGT
TCGATGAACG CCAAGGGCAC CAACACTTGC ATGGCGTCGT CGGGGTTGTT CATTTCGTCC
AATTGCCGCG TCCGCCAGGA TGCGCTGGCG ACGCTGACCG GCCGTGCCGG CTTCGTCGCC
GGTCCCGGCG GCCGCACGCT GCTCTACGCC AAAGGCGGCG CGGCTTTCCT GAGTGAACGG
CTCGACATCG CCGTCGGCTA TCCGCTCGGC AACTCGACCG ACGTCAACGA CGGTCGGTGG
GGCTGGACCG TCGGTGCCGG CATCGAACGG GCGCTGGCGC CGGCCTGGTC GGTGAAGCTC
GAATACAACT ACGCCGATTT CGGCAGCCGC GACATGGCGA CACCGGACAG CGAACGGCTG
GTGCCGGGCG TCGGTTCTTT CGCCACGCCG CAGGGCAGGA GCAGGGTCGA GCAGGACCTG
CATGCGGTGA AGGTCGGCCT CAACCTCAAA TTCGGCGGCG ATGTCGACGC ACGCTTCGAC
GACTATCATC TGCGCGGCAC CCAGGCCGGG GTCGATGTCG TCGAGCGCGG CGCGGTCGAG
GTCGGCGGTC GCGTCTGGTA TAGTTCGGGT CGTTTCCAGA AAGATCTCGG CGGCACATTC
GATCAGGGCA ATCAGAATTT CCTGATCTCG CGGTTGACCT ATCAAAATAC CGCTGTGTCG
GGGGAGGCTT TCGGTCGTAT CGAAGGTCCC TACGACACCT TCCTCAAGGG CTTTGCCGGC
GGCGGCACGC TATTGAGTGG ACATATGAAC GACGAGGACT GGATCGCCTC CGACGGCATC
CCCTATTCGA ATACGCTGTC GGATCCGGTT AGGGGCAGCA TCGCCTATGC GACTGTGGAT
GTCGGCTACA ATCTGGTGCA CGGGCCGGAC TACAAATTCG GCGGCTTCGT CGGTTACAAT
TATTATCGCG AGAACAAATC GGCCTATGGC TGCGTGCAGA CGGCAGGCGC CACGTCGCTG
GTCTGCGCGC AACCGATTTC GGATGCAGTT CTCGCCATCA CCCAGAACGA CACCTGGCAT
TCGTTGCGGG TCGGCCTCAA CGGTGAGATC GGGCTCGGTC GTGGGGTCAA GCTCTCGGCG
GATGCGGCCT ATCTGCCGTA TGTGAAGACC TTCGGAACCG ATACGCATCT GCTGCGGACC
GACGTCGCCG ACAAAGTGTC GCCGTCACAG GGCACGGGGC AGGGCGTCCA GCTCGAAGCC
ATCCTGTCGT ATCAGTTGAA CAGCGCTTTC AGCATTGGCG CCGGTGCGCG GTATTGGGCG
ATGTGGGCGA CCACCAACGC CTACACCAAT ATCTTCGGCA CGGAGTGTCC CTGTCAGACC
CAGCCGGCGC GCACCGAGCG CTATGGAACC TTTTTGCAAG CGGCTTATAA GTTCGACAGC
CTCTAA
 
Protein sequence
MSRILVTVVT LVAVIPPAAA ADMPRFPPLM PVAPVSWNWT GLYWGAHVGG SFGSSSFSDP 
AGPALYGGSV RSPAALAGIQ IGYNYQPNAN WLIGVEADLS SMNAKGTNTC MASSGLFISS
NCRVRQDALA TLTGRAGFVA GPGGRTLLYA KGGAAFLSER LDIAVGYPLG NSTDVNDGRW
GWTVGAGIER ALAPAWSVKL EYNYADFGSR DMATPDSERL VPGVGSFATP QGRSRVEQDL
HAVKVGLNLK FGGDVDARFD DYHLRGTQAG VDVVERGAVE VGGRVWYSSG RFQKDLGGTF
DQGNQNFLIS RLTYQNTAVS GEAFGRIEGP YDTFLKGFAG GGTLLSGHMN DEDWIASDGI
PYSNTLSDPV RGSIAYATVD VGYNLVHGPD YKFGGFVGYN YYRENKSAYG CVQTAGATSL
VCAQPISDAV LAITQNDTWH SLRVGLNGEI GLGRGVKLSA DAAYLPYVKT FGTDTHLLRT
DVADKVSPSQ GTGQGVQLEA ILSYQLNSAF SIGAGARYWA MWATTNAYTN IFGTECPCQT
QPARTERYGT FLQAAYKFDS L