Gene Rpal_4117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4117 
Symbol 
ID6411801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4412605 
End bp4414185 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content66% 
IMG OID642713999 
ProductProtein of unknown function DUF1800 
Protein accessionYP_001993088 
Protein GI192292483 
COG category[S] Function unknown 
COG ID[COG5267] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.970573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAACCA ACACGGCCGG ACGGTGGAGC GCACGAGGAT CGGCGATACT GGCGATCGGC 
ATCGCCGCAA TCGTCACCGG CGGATCGGCC GGGGCCGCGG AGATCTCGGC GCACGATCTG
GCGCTGATCG ATCGGCTGAC CTGGGGCATC AACGGATCCA GCGTGGCGCA ATTCCAGAAA
CTCGGCGCCG CGCGCTGGGT GGACCAACAG CTGCACCCCA CGGCGGACAG TGCGCTGCCG
CAGCTGGTCG TTGCGCAGAT CGACGCGATG CCCGACGCCG CCGGCCTGAC GCCGGCGGCG
ATCAATGCGT TTCAGGCGCA GGGCAAGGAT GCCGATCAAC TCACTGACCC GGAGGCGCGC
AAGACAGCCA AGCAGGCGTA CCAGCAGGCG CTGAACGACC GCGCCAAGCA GGCGGCAACC
CGGTCGATCC TGCGCGCGCT CTATGCGCCC GATCAGCTCC GCGAGCGGAT GAGCTGGTTC
TGGCTGAACC ATTTCAACGT CCACCAGAGC AAGGCCGAGC TGCGCCTCCT GGTCGGCGAC
TATGAGGATC GCGCGATCCG CGCGCATGCG CTCGGCAAGT TCGGCGATCT GTTACGCGCC
ACGCTGCGGC ACCCGGCGAT GCTGCGCTAT CTCGACAATG CCGGCAACGC CAACGGCCAT
CTCAACGAGA ACTACGCCCG CGAGATCATG GAGCTGCACA CCATGGGCGT CGGCAGTGGC
TACACCCAGG CCGATGTGGA GTCGCTCGCC AAGATCCTCA CCGGCGTCGG CATCGACCTG
AAGCCCGAGG ACCCGAAGCT GAAGCCTGCG CTGGCTCCGC AGCTCGTCCG CGACGGCGCG
TTTGAGTTCA ACCCGGCGCG GCACGATTAC TCCGACAAAA CCTTCCTCGG CCACACCATC
CGCGGCAGCG GCTTTGCCGA AGTCGACGAG GCGCTCGACC TGATCGTGCA CAATCCGGCG
ACCGCGCAGC ACGTCTCGCG CAAGATCGCG ACCTACTTCG TCTCGGACGA GCCGCCGCAA
CCGCTGATCG ACAAGATGGC GAAAACCTTC ACCGCCTCCG ACGGTGATAT CGCGCAGGTG
CTGGCCACGA TGATCGCCGC GCCGGAGTTC GATGCGTCGC TGAAGACGGC GGAACGCTTC
AAGGATCCGG TTGGCTACGT CTATTCGGCG GTGCGGCTCG CTTACGACGA CAAGGTCGTG
CTCAACACCG TGCCGATCCA GCGTTGGCTC GGCCGGCTCG GCGAAGGGCT GTATCAGCGC
CAGACGCCGG ACGGCTATCC ACTGACGGCG AGCGCCTGGA ACGGCCCCGG CCAGATGATG
CTGCGGTTCG AGATTGCGCG TCAGATCGGT TCCGGTTCGG CCGGGTTGTT CAAGCCGGAG
CAGGCCGACG CCAAGGATCG GCCCGCATTT CCGCTGCTGC AGAACGCGCT GTATTTCGGC
GGGCTCAGCC GGACGCTGAG CTCGACCACG CGCGGCGCGC TCGATCAGGC GATCTCACCG
CAGGATTGGA ATACGCTGTT TCTGTCCTCG CCCGAATTCA TGGTTCGTCA ACGCGCGGAG
GCACCGCATG AACCGTCGTG A
 
Protein sequence
MRTNTAGRWS ARGSAILAIG IAAIVTGGSA GAAEISAHDL ALIDRLTWGI NGSSVAQFQK 
LGAARWVDQQ LHPTADSALP QLVVAQIDAM PDAAGLTPAA INAFQAQGKD ADQLTDPEAR
KTAKQAYQQA LNDRAKQAAT RSILRALYAP DQLRERMSWF WLNHFNVHQS KAELRLLVGD
YEDRAIRAHA LGKFGDLLRA TLRHPAMLRY LDNAGNANGH LNENYAREIM ELHTMGVGSG
YTQADVESLA KILTGVGIDL KPEDPKLKPA LAPQLVRDGA FEFNPARHDY SDKTFLGHTI
RGSGFAEVDE ALDLIVHNPA TAQHVSRKIA TYFVSDEPPQ PLIDKMAKTF TASDGDIAQV
LATMIAAPEF DASLKTAERF KDPVGYVYSA VRLAYDDKVV LNTVPIQRWL GRLGEGLYQR
QTPDGYPLTA SAWNGPGQMM LRFEIARQIG SGSAGLFKPE QADAKDRPAF PLLQNALYFG
GLSRTLSSTT RGALDQAISP QDWNTLFLSS PEFMVRQRAE APHEPS