Gene RPC_1805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1805 
Symbol 
ID3972070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1962048 
End bp1963298 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content69% 
IMG OID637924918 
Productphage major capsid protein, HK97 
Protein accessionYP_531683 
Protein GI90423313 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.539863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.129536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCT ACGCCGACGA TCACGCCCCG GAAACCAAGG CCGGCATCGC CGTCGACGCG 
CAGGACGAGC TGCGCGCCAC CTTCGAGGAT TTCAAATCCG CCAATGACGA GCGGCTGGCG
GAGCTGGAGC GCCGCCGCGG CGACGTGCTG TTGGAGGAGA AAGTCGACCG CATCAACGCC
GCGCTCGACG CCCAGCAACG CAAGCTCGAC GAGCTCGTGC TGCGCCAGGC CCGGCCGCAG
CTCGAGGGCC GCGGCAAGAA CGCCGGCGAT GTCGCCGGCC GCGAGCACAA GAGCGCGTTC
GAGGCCTACG TCCGTGGCGG CGACGCCGCG CCGCTGCGCG CGCTGGAAAG CAAGGCGATG
TCGGTCGGCT CCAATCCGGA CGGCGGCTAT CTGGTGCCGG TGGAGCTGGA AACCGCGATC
GGCGAGCGGC TGGCGGTGAT TTCGCCGGTG CGCGGGCTGT CCGCGGTGCG GACGATTTCC
GGCAGCGTCT ACAAGAAGCC GTTCATGACC GCGGGCCCGG CCACCGGCTG GGTCGGCGAG
ACCGACGCCC GCACCCAGAC CGCCTCGCCG ACGCTCGATG CGCTGTCGTT TCCGGCGATG
GAGCTCTACG CCATGCCGGC GGCGACCGCG ACGCTGCTGG AGGACTCGGC CATCAATCTC
GACGAGTGGC TGGCCTCCGA GATCGACCAG GTGTTCGCCG AGCAGGAGAG CACCGCCTTC
GTCAACGGCG ACGGCATCAA CAAGCCGAAG GGTTTTCTGG CCTATCCGAC GGTGGTCAAC
GCGACCTGGA GCTGGGGCAA CATCGGCAGC ATCCTGTCCG GCGCCGCCGG CGGCTTTGCG
GCGCAAAATC CCTCCGACGT GCTGGTCGAC CTGATCTACG CGCTGAAGGC CGGCTATCGG
CAGAATGCCA GCTTCGTGAT GAACCGCCGC ACCCAGGCCG CGATCCGCAA GTTCAAGGAC
TCCACCGGGG TCTATCTGTG GCAGCCGCCG GCGCAGCCCG GCGGCCGCGC CAGCCTGATC
GGCTTTCCGC TGGCCGACGC CGAGGACATG CCGGATATCG CGGCGAACTC GCTGTCGATC
GCGTTCGGCG ATTTCCGCCG CGGCTACCTG ATCGTCGACC GCCAGGGCGT CCGCGTGCTG
CGCGATCCGT ATTCCGCCAA GCCCTACGTG CTGTTCTATA CCACCAAGCG GGTCGGCGGC
GGCGTGCAGG ACTTCGACGC CATCAAGCTG CTGAAGTTCG CGGCGGGGTG A
 
Protein sequence
MSIYADDHAP ETKAGIAVDA QDELRATFED FKSANDERLA ELERRRGDVL LEEKVDRINA 
ALDAQQRKLD ELVLRQARPQ LEGRGKNAGD VAGREHKSAF EAYVRGGDAA PLRALESKAM
SVGSNPDGGY LVPVELETAI GERLAVISPV RGLSAVRTIS GSVYKKPFMT AGPATGWVGE
TDARTQTASP TLDALSFPAM ELYAMPAATA TLLEDSAINL DEWLASEIDQ VFAEQESTAF
VNGDGINKPK GFLAYPTVVN ATWSWGNIGS ILSGAAGGFA AQNPSDVLVD LIYALKAGYR
QNASFVMNRR TQAAIRKFKD STGVYLWQPP AQPGGRASLI GFPLADAEDM PDIAANSLSI
AFGDFRRGYL IVDRQGVRVL RDPYSAKPYV LFYTTKRVGG GVQDFDAIKL LKFAAG