Gene Rpal_3975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3975 
Symbol 
ID6411657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4268318 
End bp4269514 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content61% 
IMG OID642713857 
Productphage portal protein, HK97 family 
Protein accessionYP_001992946 
Protein GI192292341 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATTG CTTCCCGAGT TCAAAGTTGG TTCGGCCTCG AAAAGAAGGC CGGCATTGCC 
GCGCCCGAAC CATGGCTGTT TGAGCTTTTC GGCGCCCAAG CGTCCGGATC CAGCATTCGA
GTAACGCCGC GGATCGCGAT GGAGTGCGCG CCGGTCGCCT GCGCCGTCAA CGCGATCTCT
CAAGCGGTTG GCCTTCTGCC GGTCCACATC CTTAAGCGCG GCACGGATGG CGCGAAGGAT
CGCGCGCCGG AACACCCGGC CTATCGACTG CTGCACCACG AAGCGAACGA ATGGACCCCT
GCCGGCAAAC TCCGCCAAGA GGTTACCCGC GACGCTCTGC TTTATAAGCA CGGCGGCTTC
GCCGAGATCA TCCGAGTTGG AGACGGTCGG CCCTTCGAGC TCATTCGGAT CGACCCCGAA
GTCTCGCCGA TCACCGTCAC CATGACGAGT GACGGTCCGG CCTACGCCGT TCAAGAGGAC
GGCATCACCC GCCAGATCGA TCGCGCCAAC ATCCTGCATA TCCCGAGCCC TTCACTGTCG
GGCTTGGGCC TCGCGCACGA TGCGCGGAAG GTGATCGGCC TGTCACTGCT GATGGAGCGG
CACGCCGAGC GGCTATTCGC CAACGGCGCC CGCCCCTCTG GATTGCTTTC ACTCAAAGGC
AACATCAGCA CCGACACTCT GAAGAATGCC AGGGCCGCAT GGAATGCCCA GCACTCCGCT
GCAAATAGCG GCGGCACCGC CGTGTTGCCC GCGGATGTTG TTTGGCAATC TCTCACTCTT
AATTCCGCCG ACGCTCAGTT CCTTGAACTG CGCAAGTATC AGATCGAAGA GACGTCGCGC
ATTTTTCGCG TCCCTCCGCA TCTGCTCTAC GAAATGGGCC GGGCAACTTG GGGCAACAGC
GAGCAAGTTG GTCAAGAATT CCTTGACTTC TCACTGATGC ATTGGGTCTC AGCCTGGGAA
GGGGAGATTC GGTTAAAGCT ATTCGACCGC GAAGAGCGCG ACAAATACAT CGCTGAGTTC
TTCACCGATG GCTTTGCACG CGCCGATCTC GCCGCGCGAA TGGATGCCTA CAGCAAGGCT
ATCGCCGCCC GCATCCTCAG TCCGAACGAA GCTCGCGCTG CCGAGAATCG CCCGCCGTAT
TCCGGCGGCG ACCGCTTCGA AAACCCGAAC ACCACCGCTT CGGGAGCCGC CGCATGA
 
Protein sequence
MTIASRVQSW FGLEKKAGIA APEPWLFELF GAQASGSSIR VTPRIAMECA PVACAVNAIS 
QAVGLLPVHI LKRGTDGAKD RAPEHPAYRL LHHEANEWTP AGKLRQEVTR DALLYKHGGF
AEIIRVGDGR PFELIRIDPE VSPITVTMTS DGPAYAVQED GITRQIDRAN ILHIPSPSLS
GLGLAHDARK VIGLSLLMER HAERLFANGA RPSGLLSLKG NISTDTLKNA RAAWNAQHSA
ANSGGTAVLP ADVVWQSLTL NSADAQFLEL RKYQIEETSR IFRVPPHLLY EMGRATWGNS
EQVGQEFLDF SLMHWVSAWE GEIRLKLFDR EERDKYIAEF FTDGFARADL AARMDAYSKA
IAARILSPNE ARAAENRPPY SGGDRFENPN TTASGAAA