Gene Rpal_3358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3358 
Symbol 
ID6411032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3612762 
End bp3613868 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content68% 
IMG OID642713238 
ProductMu-like prophage I protein-like protein 
Protein accessionYP_001992335 
Protein GI192291730 
COG category[R] General function prediction only 
COG ID[COG4388] Mu-like prophage I protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGGAC ACCAATCCAA ACCAGTTTTG AATGTGGCGC GCGGCGTCGG TCAGCCGATC 
GCGCTGAACG CCGACGGCTC GGCGCCGGAA TGGATCATGC TGATCCCGGC CGGCGACGGC
GGTGTGATCC ACACCGTCGA TGGCCGTGGT CCGTATCGCG TCGCAGATCC GGCGGCGCTC
GCGGCGCAGA GCCTGGCGGC GGTCGGTGGC CGCGCGCCGC TCGACGAGAA CCATGCGACG
GATCTCGCTG CGCCGAATGG CGAGCCGTCG CCGGCGCGCG GCTGGATCGT CGGCGCCGAG
GCGCGTGACG GCGCCATCTG GGGGCGTATC GACTGGAACG CATCCGGCGC GGCGCTGATG
GCGGATCGTG CCTACCGGTT CATCTCTCCC GTCTTCACCC ACGACAAGGC CGGCAACGTG
CTGACGCTGC TGCGTGCCTC TCTGACCAAC GTCCCAAACC TGCGCGGCAT GGCCGCTCTG
CACCAACAGG AGAATGCAAT GGATCTGCTC GCTCAGCTGC GCGCGCTGCT CGGCCTCGAC
GACACTGCGG ACGAAGCTGC GGTGATCGCC AAGATCAAGG ATCTGAAGGG CGGCGGCGAT
GCAACCGCGA TGAACGCGGC TGTTAGCAAG GCGCTCAACG CCGCGCTGTC GCCGATCGCG
GCGGTCGTCG GCCTCGCTGC CGACGCCGAT GCCCAGGCGA TTGCGCAGGC AGTGTCGAAG
GCGGCGGCGC CCGAGGGTAA TCCGATCGTC AAGTCGCTGC AGTCCGAACT GGCGACCGTC
ACCACAAAGC TCAACGATCT GCTCGGCAGC GCCGCCAAGG AGAAGGCGAC CGCTTTCGTC
GATGGCGCGA TCCGGGATCT GCGCGTCGGC GTGAAGCCGC TGCGCGAGCA CTACATCGCG
CGCCACATGG CAGACCCCGC CGCGGTCGAG AAGGAAATCA ATTCATTCCC GAAACTCGGC
GAGTCCGGCC AGACGCTGTT GCCGACCGAT CCGCCGAAGG ACGGCCAGGT CTCGCTCAAT
GCCGAGCAGC TGACTGCCGC CAAGGTGCTG GGCATCAAGC CCGAGGACTA CGCCAAGACG
CTGGCGGCCG AACGCGCCGC AAGCTGA
 
Protein sequence
MSGHQSKPVL NVARGVGQPI ALNADGSAPE WIMLIPAGDG GVIHTVDGRG PYRVADPAAL 
AAQSLAAVGG RAPLDENHAT DLAAPNGEPS PARGWIVGAE ARDGAIWGRI DWNASGAALM
ADRAYRFISP VFTHDKAGNV LTLLRASLTN VPNLRGMAAL HQQENAMDLL AQLRALLGLD
DTADEAAVIA KIKDLKGGGD ATAMNAAVSK ALNAALSPIA AVVGLAADAD AQAIAQAVSK
AAAPEGNPIV KSLQSELATV TTKLNDLLGS AAKEKATAFV DGAIRDLRVG VKPLREHYIA
RHMADPAAVE KEINSFPKLG ESGQTLLPTD PPKDGQVSLN AEQLTAAKVL GIKPEDYAKT
LAAERAAS