Gene Rpal_3980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3980 
Symbol 
ID6411662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4271235 
End bp4272383 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content63% 
IMG OID642713862 
Productphage major capsid protein, HK97 family 
Protein accessionYP_001992951 
Protein GI192292346 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCCC ACTACGAACT GGAACTGAAG GAAGCCGACA CCGACCCGGC CGAACTGGTC 
ACCAAAGCGC TCGCCGACTT CAAGGATGCC GTCGACGGCC GCTTGACCGC GATCGAAACC
AAGGCCGCGA ACGACAACAA GCTGGCCGAT CGGCTTGACC GTATCGAAGC CAAAATGAAC
CGGCCGGGCA TCGGTGCCGC GAACGACAAC GACGACCTCG AGCGTAAGGC GTTCATTTCG
TTCGCGCGTC GTGGTGTCGA ACGCATGGGC GCCGACGAAC AGAAGGCGTT GACCGTCTCG
ACCGATGCGT CCGGCGGCTA CCTCGCCCCG GAGCAGTTCG GCAACGAGCT GATCAAGTTG
TTGCGGCAAT ATTCGCCTGT CCGACAGTAT GCCAACGTGG TCAGCATCGG CGCTGCCGAG
ATCAAGTATC CCCGCCGCAC CGGCAGCACG GTGGCATCCT GGGTGGATGA AACCGAGGAT
CGCAGCGAAA GCGAGCCGAG CTTCGAACAG ATCACGATCG CGCCCTTTGA GCTGGCGACC
CATTCGGACG TGTCGACGCA GCTTCTGGAA GACAACGCCT ACAATCTCGA AGGCGAGCTC
GCAGCGGACT TCGCCGAGAC TTTCGGCATC AAGGAAGCCG CGGCGTTCGT GAAGGGCTCC
GGCGTCAAGC AGCCGACCGG CATCATGACC GCGGCGGGCA TCACCGAAGT CAAGACCGGC
GCCGCCGCGA CGTTCCCGAC CTCCAACCCG GCCGACGTGC TGATCGGCAT GTATCACGCG
CTGCCCGGCG TTCATGCCCA GAATGGCGTG TGGATGATGA ACCGCACCAC GCTCGGCACC
ATCCGCCAGT GGAAGGACGG CAACGGCCGC TATCTGGTCC TCGATCCGAT CTCGGCTGGC
GCACCGGTCA CCCTGCTTGG TCGCCCGATC GTCGAAGCGA TCGACATGGA CGACATCGGC
GCCAACAAAT ATCCGGTGCT GTTCGGCGAT CTGAAGGGCT ATCGCATCGT CGATCGCGTC
GGCCTTTCGG TGCTTCGCGA CCCGTATTCG CTCGCGACCA AGGGTCAGGT TCGGTTCCAC
GCCCGGACGC GGGTCGGCGC CGGCCTCACT CACCCCGACC GCTTCATCAA GCTCAAGGTG
GCGGCGTAA
 
Protein sequence
MKPHYELELK EADTDPAELV TKALADFKDA VDGRLTAIET KAANDNKLAD RLDRIEAKMN 
RPGIGAANDN DDLERKAFIS FARRGVERMG ADEQKALTVS TDASGGYLAP EQFGNELIKL
LRQYSPVRQY ANVVSIGAAE IKYPRRTGST VASWVDETED RSESEPSFEQ ITIAPFELAT
HSDVSTQLLE DNAYNLEGEL AADFAETFGI KEAAAFVKGS GVKQPTGIMT AAGITEVKTG
AAATFPTSNP ADVLIGMYHA LPGVHAQNGV WMMNRTTLGT IRQWKDGNGR YLVLDPISAG
APVTLLGRPI VEAIDMDDIG ANKYPVLFGD LKGYRIVDRV GLSVLRDPYS LATKGQVRFH
ARTRVGAGLT HPDRFIKLKV AA