Gene Rpal_4957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4957 
Symbol 
ID6412649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5337024 
End bp5338319 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content64% 
IMG OID642714840 
Productoxidoreductase molybdopterin binding 
Protein accessionYP_001993921 
Protein GI192293316 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAA AGCCCACATC CGACGTCCTC AACCGCCGCC GGTTTCTCGG CGCGGCGGGC 
CTTGGAGTAG CCGGTCTCGC CGGTGCTGGG TCGATGCTGC CCTCGCTCGC GGCCAAAGCC
AGTGAGGCGG CCAAGCCCGA TCCGGCGATC ACCGAGATCA AGGATTGGAA TCGCTATCTC
GGCGACGGCG TCGACAAGCG TCCCTATGGC GTGCCCTCGA AATTCGAGAA GGACGTGATC
CGCCGCGACG TGGCGTGGCT CACCGCGTCG CCGGAGTCCT CGGTCAACTT CACACCGCTG
CACGCGCTCG ATGGCATCAT CACCCCGTCC GGCCTGTGCT TCGAACGGCA TCACGGCGGC
GTTGCCGAGA TCGATCCGGC GCAGCACCGG CTGATGATCC ATGGCCTGGT TGACACCCCG
CTGGTGTTCA CTATGGACGA CATCAAGCGG ATGCCGCGCG TCAACAAGAT CTACTTCCTG
GAATGCGCGG CGAACTCCGG CATGGAGTGG CGCGGCGCGC AGCTCAACGG CTGCCAGTTC
ACCCACGGCA TGATCCACAA CGTGATGTAC ACCGGCGTCA CGCTGAAGAC GCTGCTCGAG
CAGGCCGGCG TGAAGTCCAA CGCCAAATGG TTGCTGCTCG AAGGCGCTGA CTCTGCCGGG
ATGGATCGGT CGCTGCCGCT GGAGAAGGCG CTCGACGACG TCATGATCGC CTATGCGATG
AACGGCGAGG CGCTGCGTCC GGAGAACGGC TATCCGCTGC GCGCCGTGAT CCCCGGTTGG
CAGGGCAATC TGTGGGTGAA GTGGCTGCGC CGGATCGAAG TCGGCGATAT GCCGTGGCAG
ACCCGCGAAG AGACCTCGAA GTACACCGAC CTGATGCCGG ACGGCCGCGC GCGCAAGCAT
ACGTTCGTGA TGGACGCCAA GAGCGTGATC ACCAGCCCGT CGCCGCAGAT GCCGCTGAAG
TTCAAGGGCC GCAACGTGCT CACCGGCATT GCCTGGTCCG GGCGCGGCAC CGTCAAGCGC
GTCGACGTCT CGATGGACGG CGGACGCAAC TGGTACGAAG CGCGGATCGA CGGCCCGGTG
CTGAACAAAT CGATCGTGCG GTTCTACGTC GACTTCGACT GGAACGGCGA AGAGCTGATG
CTGCAATCGC GCGCGATTGA CGAGACAGGC TACGTGCAGC CGACCAAGGC GGAGCTGCGC
AAGATCCGCG GCGTCAATTC CGTGTACCAC AACAACGGCA TCCAGACCTG GCTCGTGCAT
CCCGACGGAG TGACCGAAAA TGTCGAAATC GCTTAA
 
Protein sequence
MSEKPTSDVL NRRRFLGAAG LGVAGLAGAG SMLPSLAAKA SEAAKPDPAI TEIKDWNRYL 
GDGVDKRPYG VPSKFEKDVI RRDVAWLTAS PESSVNFTPL HALDGIITPS GLCFERHHGG
VAEIDPAQHR LMIHGLVDTP LVFTMDDIKR MPRVNKIYFL ECAANSGMEW RGAQLNGCQF
THGMIHNVMY TGVTLKTLLE QAGVKSNAKW LLLEGADSAG MDRSLPLEKA LDDVMIAYAM
NGEALRPENG YPLRAVIPGW QGNLWVKWLR RIEVGDMPWQ TREETSKYTD LMPDGRARKH
TFVMDAKSVI TSPSPQMPLK FKGRNVLTGI AWSGRGTVKR VDVSMDGGRN WYEARIDGPV
LNKSIVRFYV DFDWNGEELM LQSRAIDETG YVQPTKAELR KIRGVNSVYH NNGIQTWLVH
PDGVTENVEI A