Gene Rpal_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1008 
Symbol 
ID6408663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1071230 
End bp1072321 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content69% 
IMG OID642710922 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_001990040 
Protein GI192289435 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCGC TGCGCGCGTT CGCGATTGTT GCGGCTTCAC TGCTGCTTGC CACCGGCGCG 
GCGGCGCAAG TTTCGCTGTC GCCGCCGTCT GGCCCCAACC CATTCCCGAA GCCGCTGGAG
CCGGAAAAGC CGAAGCCCAG GCCGCCGGCC CCGGCCAAGG CACCCGCCAC CGAGGCGAAG
GACAAGGCCA AGAAGCCGGG AGACAAACCC GACGCCAAGG CCGCCCCCGA GGGCGGCGCG
GCCGCGGCCG AAGACCCCAA CGTCGACCTG GTGTACGGCG CCTATCAGCG CGGCTTCTAC
AAGACCGCGT TCGAACTGGC GCAGAAGCGC GCCGCCGACA ACGACGCCAA GGCCATGACC
ATGCTGGGCG AGCTCTATGC CAATGCCCTC GGCGTGAAGC GCGACTACAA GAAGGCGGCG
GAGTGGTACT CGCGAGCGGC CGATCTCGGC GACCGCGAGG CGATGTTTGC CCTCGCCATG
GCCCGGATGG GCGGCCGCGG CGGCCCGCCG AACCGCGAGG AAGCCGCCAA ATGGCTGGCG
CAGGCTGCCA AGCTTGGCGA GCCGAAGGCA GCCTATAATC TGGCGCTGCT CTATCTCGAC
GGCCAGACCT TCCCGCAGGA CGTCAAGCGC GCTGCCGAAC TGCTGCGGAT GGCCGCCGAT
GCCGGCAACC CGGAAGCCCA ATACGCGCTG GCGACCTTCT ACAAGGAAGG TACCGGGGTA
ACCAAGAGCA TCGAGCAGTC GGTGCGGCTG CTGCAGGCCG CCGCACTGGC CGGCAACGTC
CCAGCCCAGG TCGAATACGC CATCGCGCTG TACAACGGTA CCGGCACCCC GAAGAACGAG
CCGGCCGCCG TCGCGCTGCT GCGCAAGGCG GCGCGCGCCA ACAACCCGAT CGCCCAGAAC
CGCCTCGCCC ATGTGTTGGT CTCCGGCCAG GGCGCGCCGC GCGACATCAA CGAGGCGATG
AAATGGCACC TGATCGCCAA GACCGCCGGC AAGGGCGATC TGCAGCTCGA CCAGACGCTG
GCCCAGATGT CGGCCGAGGA TCGCGCCAAG GCCGAAGAGG CGGCGCGCAC CTGGATTGGC
GGCGGCAAAT GA
 
Protein sequence
MKALRAFAIV AASLLLATGA AAQVSLSPPS GPNPFPKPLE PEKPKPRPPA PAKAPATEAK 
DKAKKPGDKP DAKAAPEGGA AAAEDPNVDL VYGAYQRGFY KTAFELAQKR AADNDAKAMT
MLGELYANAL GVKRDYKKAA EWYSRAADLG DREAMFALAM ARMGGRGGPP NREEAAKWLA
QAAKLGEPKA AYNLALLYLD GQTFPQDVKR AAELLRMAAD AGNPEAQYAL ATFYKEGTGV
TKSIEQSVRL LQAAALAGNV PAQVEYAIAL YNGTGTPKNE PAAVALLRKA ARANNPIAQN
RLAHVLVSGQ GAPRDINEAM KWHLIAKTAG KGDLQLDQTL AQMSAEDRAK AEEAARTWIG
GGK