Gene Rpal_2092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2092 
Symbol 
ID6409752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2266569 
End bp2267744 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content66% 
IMG OID642711977 
Productphage portal protein, HK97 family 
Protein accessionYP_001991089 
Protein GI192290484 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGATC GTCTCAAGGC CTTTCTCACC GTCCCGGAAG CCAAGACATC GCGAACCGCG 
CAGTTACTTG CGGTGGGATT CGGCGGAGTG GCGCGATTTA CCCCGCGGGA CTACGCCGGG
CTGGCGCGAG AAGGCTACGT ACGAAATGCA ATCGTGTATC GCTGCGTGAG GCTGGTGGCG
GAGAATGCCG CAGCCTGCGT GTTCGGCGTA TTCGACGGCG CGCAGGAGAA GGAGGCACAT
CCGCTCGCGG CTTTGTTAGC GCGCCCCAAT CCTCGCCAGG ATGGCGCCGC GGTGCTGGAG
ACGCTGTATG CGCATCTTCT GCTCGCGGGC AATGCCTATA TCGAGGCGGT GACGCTCGGT
GAGGCCGTGC ACGAGCTCTA CGCGCTGCGG CCCGATCGCA TCAAACTGAT CCCGGGCGCC
GATGGCTGGG CGGAGGCGTA TGATTACAGC GTCGGCGGCC GCACCGTACG GTTCGATCAG
CATGCCGCTC CGGTTCCGCC GATCCTGCAT CTGACGTTTT TTCATCCGCT CGACGATCAT
TACGGCCTGG CGCCGCTCGA AGCCGCCGCG GTCGCGGTCG ACACCCACAA CGCGGCGGCG
CGCTGGAACA AGGCTCTGCT CGACAATTCC GCGCGGCCTT CCGGCGCGCT GGTGTACGCC
GGCCCGGAAG GCGCTGTGCT CAGCGAGAAC CAGTTCGAAC GGCTGAAACG CGAATTGGAA
CTCACCTACG AAGGTGCCGC CAATGCCGGC CGGCCGCTGC TGCTCGAAGG CGGGCTCGAA
TGGACGGCGA TGGCGCTGTC GCCGAAGGAC ATGGACTTTC TTGAGGCCAA GCACGCCGCC
GCGCGCGAGA TCGCGCTGGC GTTCGGCGTG CCGCCGATGC TGCTCGGCAT TCCCGGCGAC
AACACGTTCT CGAACTATCA GGAAGCCAAC CGCAGTTTCG TGCGCCAGAC CGTGCTGCCG
CTGGCGACGC GCGTCGGCAA TGCTCTGGCG CAGTGGCTGT CGCCGCAATT CGGAGATGGC
GTGCGCCTGG TGATCGATAC CGACCGGATC GACGCGCTGT CACCCGACCG CACCGCGCTG
TGGGACCGAG TCACCCGCGC GCCGTTCCTA ACCCTGAACG AAAAGCGCGA AGCGGTCGGC
TACGCGCCGA TCGAAGGCGG GGACGGGTTG GGGTGA
 
Protein sequence
MFDRLKAFLT VPEAKTSRTA QLLAVGFGGV ARFTPRDYAG LAREGYVRNA IVYRCVRLVA 
ENAAACVFGV FDGAQEKEAH PLAALLARPN PRQDGAAVLE TLYAHLLLAG NAYIEAVTLG
EAVHELYALR PDRIKLIPGA DGWAEAYDYS VGGRTVRFDQ HAAPVPPILH LTFFHPLDDH
YGLAPLEAAA VAVDTHNAAA RWNKALLDNS ARPSGALVYA GPEGAVLSEN QFERLKRELE
LTYEGAANAG RPLLLEGGLE WTAMALSPKD MDFLEAKHAA AREIALAFGV PPMLLGIPGD
NTFSNYQEAN RSFVRQTVLP LATRVGNALA QWLSPQFGDG VRLVIDTDRI DALSPDRTAL
WDRVTRAPFL TLNEKREAVG YAPIEGGDGL G