Gene Rpal_4702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4702 
Symbol 
ID6412388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5060604 
End bp5061719 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content65% 
IMG OID642714581 
Productprotein of unknown function UPF0118 
Protein accessionYP_001993668 
Protein GI192293063 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.605317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAGA CGCTGGGTGG TCCATTGCGT GCGCTTCCCG AGCCTACCGG CACGCCGCTC 
CCCGACAGCC AGGACGAGCG GCCGCCGCTG ATCAGGCGGA CGGAAGTCGT CACGTTCACC
CTCGTCGCCC TGTTAGTGAT CTTGCTGGTC GGCCTGCTGT ATGTCGGCAA GCCTTTCTTC
CTGCCGATGG TGACCGCCTT CGTGGTCGGC ACCATGGTGT CGCCGGCCGC GAGCTTCTTG
GAGCGCTTTC GGATTCCGCG CGCGGTCAGC GCAGTGCTGA TCGTGACGCT CGGGCTCGGC
ACCGTGATCT TCATGATCGG GCTGATCTCT GCGCCGCTGA TCGAGTGGAG CAGCCGCATC
CCCGAGATCG GCTCATTGCT GCGCGACAAG CTGCACGTGC TCGATCGGCC GCTGCAGATG
TGGCGGCAGA TCCAGAGCTC GCTGAGCGGC TCCGAAAGCC TGCCGCAGCC GTCCGTGCAG
ATGCCGAAGA TCGACTGGGT GTTCGAGTTT CTCTCCCCGA CGCTCACCGA GGTTCTGCTG
TTTCTGGTGA TGCTGGTGCT GTTCGTGGCG GGCTGGAAAG ACCTGCGGCG CTCGCTGGTA
ATGAACTTCG CCGGGCGCGA GGCCCGGCTG CGCACGCTGC GGATCCTCAA CGAGATCGAG
GGCAGCCTCG GCGCCTATCT GCTGACGGTC ACCGTCATCA ACCTGTGCTA CGGCGCCGCC
ACCGGCCTGC TCTGCGCTGC CGCGGGAATG CCGAACCCCG CGGGCCTGGG CGCGCTGGCA
GCGGTACTGA ATTTCATCCC GATCATCGGG CCGTTCGTGA TGTTCGTGAT CATGACGGTG
GTCGGCATCA TCAGTATGCC GACGCTCGGC GCCGGCCTGC TCGCCCCGCT CGGCTTCGTG
CTGCTGACGT TCTTCGAAGG GCATTTCATC ACGCCGACCA TCATCGGGAG GCGGCTGTCG
CTGAACACGC TGGCGGTGTT CATCACCCTG GCGTTCTGGA CTTGGCTGTG GGGACCGATG
GGGAGTTTTC TAGCCTCGCC GCTGCTGATC GTCGGTCTGG TGCTCAAGGA GCATCTGATG
CCCGAGGACA GCCCGCAACT TCCCGGCAGC GACTGA
 
Protein sequence
MTQTLGGPLR ALPEPTGTPL PDSQDERPPL IRRTEVVTFT LVALLVILLV GLLYVGKPFF 
LPMVTAFVVG TMVSPAASFL ERFRIPRAVS AVLIVTLGLG TVIFMIGLIS APLIEWSSRI
PEIGSLLRDK LHVLDRPLQM WRQIQSSLSG SESLPQPSVQ MPKIDWVFEF LSPTLTEVLL
FLVMLVLFVA GWKDLRRSLV MNFAGREARL RTLRILNEIE GSLGAYLLTV TVINLCYGAA
TGLLCAAAGM PNPAGLGALA AVLNFIPIIG PFVMFVIMTV VGIISMPTLG AGLLAPLGFV
LLTFFEGHFI TPTIIGRRLS LNTLAVFITL AFWTWLWGPM GSFLASPLLI VGLVLKEHLM
PEDSPQLPGS D