Gene Rpal_4669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4669 
Symbol 
ID6412355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5030926 
End bp5032548 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content70% 
IMG OID642714548 
Productprotein of unknown function DUF882 
Protein accessionYP_001993635 
Protein GI192293030 
COG category[S] Function unknown 
COG ID[COG3108] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.227252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAGGG CCGGTTACGG CGCCGTGCTG ACGACGGCAT TGCTGCTGGC GGGCGCCGGA 
TCGGTCCATG ACGCATCCGC CGTCGGCGAC AGCCGGACCC TGTCGTTCCA CCACACCCAT
TCGGGCGAGA GCCTCACCGT CACCTTCAAG CGCAGCGGCC GCTACGACGA AGATGCGCTG
AAGCAGCTCA ATCACTTCCT GCGTGACTGG CGATCCCAGG AACAGACTGT GATGGACCGC
CAGCTGTTCG ACATCCTGTG GGAAGTTTAC CGGGACGTCG ACGCCAAACA GCCTATCCAG
ATCATCTCCG CCTATCGCTC CCCTGCCACC AACGCCATGC TGCGCCGCCG CTCCTCGGGA
GTGGCGCGCC ACAGCCAGCA CATGCAGGGC CACGCGATGG ACTTCTTCAT CCCCGGCGTG
GCGCTGGAGC AGATCCGGTT TGCCGGCCTG CGGCTGCAGC GCGGCGGTGT CGGCTTCTAT
CCGACCTCCG GCTCGCCGTT CGTGCATCTC GATACCGGCG GCATCCGGCA CTGGCCGCGG
ATGACCCCCG ACCAGCTCGC CCGCGTGTTC CCGGATGGCC GCACCGTGCA CATCCCCACC
AACGGCAAGC CGCTGCGCGG CTACGAGCTG GCGCTGGCGG ACATCGAAAA GCGCCGCGAC
GGCAGCACTG TCGCACCGGC CAAGACCAAC TTCCTGGCAA CGCTGTTCGG CGGCAAGTCG
CGTGACGACG AGGACGAAAC CGCAGCGACG GCGGCACCGT CCGGCGCCAA GCCGATGGCC
GACATCAAGG CCGCGGCCGC CGACGCGGTC GCGGCTGCGG CCGGCGTGAA GCCGGCCGAC
GTGGGCTCCA GCGATCCGGT GCCGATGCCC CGCGCCAAGC CCGCCGCCGC TATCCAGATC
GCCTCCGCCG GCGACGTCGT GCTGCCGGCG CCCCGCCCGG CTCAGGCTGC TAAAGCCGAG
GCCAAGACCG CGGAACCGAG GACGGCAGAG TCGAAGACGG CTGACGCCAA GCCGCAAAGC
CCCGCCGACA TCATCAACGC CCGCGGGTTT TGGGACGACA TCCCCGTAGC ACCGAAGCAG
GCGAGCCCGG CCCAGGTCGC CGCCATCAGC GCCCGGCAGG CATTGGCCGC CGCCGACAAA
TCCGAACAGG CCGCCGCGAT GAACGCGCTG GCCTACGCGC CGATGGCGCA GGAAAATTCC
TCGAAGCACG CCCCGACCCG CCATCCGCAC GTCGTGACCG CCAGCGCCCC GCTGCCGCCG
ACGCGCGCAT CGCTGCAGCG GCAGGCGGCG GTGTCGGGCA AGGTCGACAG CGTGATCGGC
AAGTCGTCCG GTCAGGGCAA GACGGTGATC GCGACCTCGG CGCGACTCGC CGCCGCCGGC
AGCCGCGACA ACGACGTCTG GATCCGCGCC ATGATCCTGA TGCCGCGGGC GATGCACACC
GCCGCCACCG TGATTGGCGA TCCCGACATG ACGCTGCTGA GCGGCTATCT GGCCAAGCCC
GAGGCGACGC TGGCCACCAG CTTCGCCGAC GATCCGCAGC CGGGCCTCTA CGCCGACGCC
TTCAGCGGCT CGGCGGTGGC GACGCTGACC ACCACGGCAT TCCCGGGCGA CGCGTCGCGC
TGA
 
Protein sequence
MPRAGYGAVL TTALLLAGAG SVHDASAVGD SRTLSFHHTH SGESLTVTFK RSGRYDEDAL 
KQLNHFLRDW RSQEQTVMDR QLFDILWEVY RDVDAKQPIQ IISAYRSPAT NAMLRRRSSG
VARHSQHMQG HAMDFFIPGV ALEQIRFAGL RLQRGGVGFY PTSGSPFVHL DTGGIRHWPR
MTPDQLARVF PDGRTVHIPT NGKPLRGYEL ALADIEKRRD GSTVAPAKTN FLATLFGGKS
RDDEDETAAT AAPSGAKPMA DIKAAAADAV AAAAGVKPAD VGSSDPVPMP RAKPAAAIQI
ASAGDVVLPA PRPAQAAKAE AKTAEPRTAE SKTADAKPQS PADIINARGF WDDIPVAPKQ
ASPAQVAAIS ARQALAAADK SEQAAAMNAL AYAPMAQENS SKHAPTRHPH VVTASAPLPP
TRASLQRQAA VSGKVDSVIG KSSGQGKTVI ATSARLAAAG SRDNDVWIRA MILMPRAMHT
AATVIGDPDM TLLSGYLAKP EATLATSFAD DPQPGLYADA FSGSAVATLT TTAFPGDASR