Gene Rpal_0686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0686 
Symbol 
ID6408339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp721826 
End bp723085 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content67% 
IMG OID642710601 
Productphage tail protein I 
Protein accessionYP_001989721 
Protein GI192289116 
COG category[R] General function prediction only 
COG ID[COG4385] Bacteriophage P2-related tail formation protein 
TIGRFAM ID[TIGR01634] phage tail protein, P2 protein I family 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCCG ATTTCACCCC GATCCAGCCG ACGCTCAACG ATGCCGAGCG CTGCCACAGC 
CTGGTCAACG CCGCGCGCTG GCCGGTGCTG TTCGCCGAGG CCGCCAAGCT GCGCACGCTG
TGGGACCCGT GGGCATGCCC GGTCGATCAG TTGCCGCTGC TCGCGTGGGC ATGGTCGGTC
GACTTCTGGA AGGCTGAATG GCCGGAGCAC CGCAAGCGGC AAGTGATCGC CGAGTCGCGC
GCTTATCACG AAGCTAAGAC CACGGTGCTC GGCTATCGGA TGGCTCTTGG CTACGTCGAT
GCCGAGTTCG TTCGGGCCCG GCTGCCGCGA CACGCGTTCT TCGCCGGTGC TGCGCCGACC
CAGGAGTCGC ATGATCGCTG GCTCGCTGGC CTGCCGGAGA TCCGCATCTA CACGGCGGTG
TTTCCGATCA AGCGCCGGCC AGTGCGGTTC TTGCAGGCCG ACGGAACGAT CGTTCAACGC
ACCCTTAAAG GGCGCTTTGT CGGCCGGCGC TTCCGGGCGC ATTCGGCGAA GTCGATCGTG
CTCGACGGTC GGCGGCCCGA ATTGCGGCGG CCCGATGGAT CGGTGCAGCG CCTCGTGTTC
GAAGGCGTGC GGATCGACAG CTTTGGACGG CTGCTGAGCG ATCCGGAGCG CCTGGTCATT
CCGGCGCCGC TGCGCAACAC CTTCCAGGTC GGAAAGCGGC TCCGCGGTCG GTTCGTCGGC
GACGGCAAGG CGAACGGCAA GCGCGTTATA TCGCTGAGTT TCTCGACGGG CGTCGATGTA
TTCCGGCCGA ATGCCGTGTC GCCGGGCCTC ACGGCGATCG ACGTCGTGCC GCGGCAGATG
TGGGACGTGA TGCCGCCCGG CCGCGGCTGG TTCATCAGCC GGGCTCGTAA GGGCCGCGGC
ATCCAGCGGA ACAGGGCGGA CGAGCTGGCC TATCTGTCGA TCCGGCTCGC TGACGGCAGT
CGTCCGGAGT ACGGCGGGCG CATGCGCAAC CGCATCGGCC GCACGCGGAT CAGGCGCGCA
CCGTTCACCG CGACAGTGCT GGCGCATCTT GCAGGACCGC CCAAGCACGG CTTTCCGGTT
GGTCGCTTCG TCAAGGTCGG CCCGGAAGCG CGCATTGCCG AAGCAATCAA AGCGCTGGCG
GTCACACAGG CGGCTCGCGA CACCGTCTAT CTCGACATCA ATTCGGTCCG TCCGATCACC
TACGGCGATC TTGCTCGATT GCCGGAGAAC GTTCGCGCCG GTGCCATCAT CCGCATCTGA
 
Protein sequence
MTADFTPIQP TLNDAERCHS LVNAARWPVL FAEAAKLRTL WDPWACPVDQ LPLLAWAWSV 
DFWKAEWPEH RKRQVIAESR AYHEAKTTVL GYRMALGYVD AEFVRARLPR HAFFAGAAPT
QESHDRWLAG LPEIRIYTAV FPIKRRPVRF LQADGTIVQR TLKGRFVGRR FRAHSAKSIV
LDGRRPELRR PDGSVQRLVF EGVRIDSFGR LLSDPERLVI PAPLRNTFQV GKRLRGRFVG
DGKANGKRVI SLSFSTGVDV FRPNAVSPGL TAIDVVPRQM WDVMPPGRGW FISRARKGRG
IQRNRADELA YLSIRLADGS RPEYGGRMRN RIGRTRIRRA PFTATVLAHL AGPPKHGFPV
GRFVKVGPEA RIAEAIKALA VTQAARDTVY LDINSVRPIT YGDLARLPEN VRAGAIIRI