Gene RPB_3481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3481 
Symbol 
ID3911283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3985787 
End bp3987040 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content68% 
IMG OID637885383 
ProductPhage major capsid protein, HK97 
Protein accessionYP_487087 
Protein GI86750591 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.137838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTACG ACATCACCCA CGCCCCCGAG ACCAAGGCCG GCATTGCCGG CGACGATGCG 
CAGCAGGTCT ACGACACGCT GATGCGCACC TTCGAGGACT ACAAGGCGGA GAACGATACC
CGGCTGCAGG CGATCGAGAA GCGGCGCGGC GACGTGCTGG CGGAGGAGAA GGTGGCGCGG
ATCGACGCGG CGCTGGATGC GCAGCAGCGC AAGCTCGACG AACTGGCGCT GAAGGCGGCG
CGGCCGCAGC TCGGCAGCGC GATCACGCCG GTTCCGGCGG CGCGCGAGCA CAAGAGTGCG
TTCGACGCCT ATATCCGCTT CGGCGACACC GCGGGGCTGC GCGCGCTCGA AACCAAGGCG
ATGTCGATCG GCTCCAATCC CGACGGCGGC TACCTGGTTC CGGACGAGCT GGAGCATACG
ATCGGCGAGC GGCTAGCGGT GGTCTCGCCG ATCCGCGCGA TCGCCGCCGT GCGGCAGATC
TCCGGCAACG TCTACAAGAA GCCGTTCATG ATCACCGGCC CCACCACCGG CTGGGTCGGC
GAGACCGCGG CGCGGCCGCA GACCGGCTCG CCGCAGCTCG ACGCGCTGTC GTTCCCGGCG
ATGGAGCTGT ATGCGATGCC GGCGGCGACC GCGAATCTGC TGGAAGACGC CGTCGTCAAT
CTCGACCAGT GGATCGCCGG CGAGGTCGAA TTGGTGTTCT CGGTGCAGGA GGGGACGGCC
TTCATCACCG GCGACGGCCT CGGCAAGCCG AAGGGCTTTC TCGCCTACCC GACGGTGGCG
AATGCCTCCT GGAGCTGGGG CAATCTCGGC ACCATCGCCT CCGGCGCCGC CGGCGCGTTC
GCCGCATCGA GCCCGTCCGA CGTGCTGATC GACCTGATCT ACGGGCTGAA GCCGGGCTAC
CGCCAGAACG CCTCGTTCGT GATGAACCGG CGCACCCAGG CGGCGGTCCG CAAGTTCAAG
GACTCCACCG GCGTCTATCT GTGGCAGCCG CCGGCGACCG TCTCCGGCCG CGCCAGCCTG
ATCGGCTTCC CGCTGGTCGA TGCCGAGGAC ATGCCGGACA TCGCCGCGAA CTCGCTCAGC
ATCGCGTTCG GCGACTTCCA GCGCGGCTAT CTGATCGTCG ACCGCCAGGG TATCCGCGTG
CTGCGCGACC CGTATTCCGC CAAGCCCTAC GTGCTGTTCT ACACCACCAA GCGCGTCGGC
GGCGGCGTGC AGGACTTCGA CGCGATCAAG CTGTTGAAGT TCGCGGCGAG TTGA
 
Protein sequence
MDYDITHAPE TKAGIAGDDA QQVYDTLMRT FEDYKAENDT RLQAIEKRRG DVLAEEKVAR 
IDAALDAQQR KLDELALKAA RPQLGSAITP VPAAREHKSA FDAYIRFGDT AGLRALETKA
MSIGSNPDGG YLVPDELEHT IGERLAVVSP IRAIAAVRQI SGNVYKKPFM ITGPTTGWVG
ETAARPQTGS PQLDALSFPA MELYAMPAAT ANLLEDAVVN LDQWIAGEVE LVFSVQEGTA
FITGDGLGKP KGFLAYPTVA NASWSWGNLG TIASGAAGAF AASSPSDVLI DLIYGLKPGY
RQNASFVMNR RTQAAVRKFK DSTGVYLWQP PATVSGRASL IGFPLVDAED MPDIAANSLS
IAFGDFQRGY LIVDRQGIRV LRDPYSAKPY VLFYTTKRVG GGVQDFDAIK LLKFAAS