Gene Rpal_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0444 
Symbol 
ID6408092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp479406 
End bp480491 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content67% 
IMG OID642710356 
Producthypothetical protein 
Protein accessionYP_001989480 
Protein GI192288875 
COG category[S] Function unknown 
COG ID[COG5330] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.71629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGCCG CTACATCTCT GCTTCCTGAA CTCGATGACA TTGTCAGGCA GGGCGATCCC 
GTCCGGCGCG CCGACGCCGT GCGCCGGATT TCCGACCTGT ATATTCGGGG CGCTGAGAGC
TTCCAGCCCG ATCATGTCGC GCTGTTTGAT GGCATCCTGC TGACGCTGGT GCCGGAGATC
GACGTCGAAG TTCGCAGCGA ACTGGCACAG CGGTTTTCGG AAATCACCAA TGCGCCGCCC
GAACTGGTCC GGCAGCTCGT GCATGACGAA GACATCGGTA TCGCCGGGCC GCTGTTGCGG
CGCTCGACGA TGCTCGATGA TCCAACGCTC GTCGAACTCG CCAGGCTGCG TGGTCAGACG
CATCTCCTGG CGATCTCCGA GCGGCTGAGC ATTTCGCCGC CGATCACCGA CGTGATCGTG
CGCCGGGGTG ACCGCGATGT GGTGCGCAAG GTCGCCGGCA ACGCGGGCGC CGAATTCTCC
GCCACCGGTT TCAACGGCCT GATCCGCCGT GCCGCGCAGG ACGGTGTGCT GGCGGTCGCG
GTTGGCACGC GGGACGATCT GTCGCCGCCG CGGCTGAAGG ATCTCTTGGC GTGCTCGACC
GATCTGGTGC GCCGGCGCTT GTTCGAAAGT GCGCGGCCGA GTGCGCGGAT CGCGATCAAC
CGGGCGATGC GCGAGCTCGC TGGCGAGTCG CGGCAGCCGT CGGTGCAGCG CGATTTCGAT
GCCGCACAGC GCTCGGTGGT GGAGTTGCAC AACAGGGGTG AACTCAACGA AGCGACCGTG
ATCGGCTTCG CGCGGGCGCA TCAATACGAG GAGACCGTGG CGGCGCTGTC GGCGATGACC
GGCACGCGAA TCTCCACCCT CGACCAGATG ATGTCCGGCG AGCGGCACGA CCCGGTGCTG
ATCCTCGGCA AAGCGCTCGG CTTCGGCTGG GCGACCGTAC GAGCCCTGAT CGGGCTGCGG
CTCGGGCCGG ACCGCTCGGT GGCCTGCCCC GACGTCGAAG AAGCGCAGCA CAATTTCGAG
CGCCTGGCGC TGTCCACGGC GCAGCGTGTG CTCGGCTTCT GGAAGATGCG ACAGGCTGAC
GCCTGA
 
Protein sequence
MPAATSLLPE LDDIVRQGDP VRRADAVRRI SDLYIRGAES FQPDHVALFD GILLTLVPEI 
DVEVRSELAQ RFSEITNAPP ELVRQLVHDE DIGIAGPLLR RSTMLDDPTL VELARLRGQT
HLLAISERLS ISPPITDVIV RRGDRDVVRK VAGNAGAEFS ATGFNGLIRR AAQDGVLAVA
VGTRDDLSPP RLKDLLACST DLVRRRLFES ARPSARIAIN RAMRELAGES RQPSVQRDFD
AAQRSVVELH NRGELNEATV IGFARAHQYE ETVAALSAMT GTRISTLDQM MSGERHDPVL
ILGKALGFGW ATVRALIGLR LGPDRSVACP DVEEAQHNFE RLALSTAQRV LGFWKMRQAD
A