Gene Rpal_3989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3989 
Symbol 
ID6411671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4275486 
End bp4277108 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content62% 
IMG OID642713871 
ProductTerminase 
Protein accessionYP_001992960 
Protein GI192292355 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACCC GCTCAACGTA TCCTGATTGG CTGTTCGACG GCTCACCGAT CGACGACCCG 
CTCGGCTATG GCGAGCGCGC CGTCAATTTC CTGCGCCTAC TCAAGCACCC CAAAAGTGCG
GCGCCTAAGC GCGCGCTGAT GCTGGACGAA TGGCAGGACC GGATCGTCCG CCGCATCTAT
GGTCCGCGGG ACCAGAACGG CCACCGGATT GTCAAGACGG TGGTCCTGCT ACTCCCGCGC
GGCAACCGTA AGACGTCGTT GGCGGCGGCG CTTTCGCTTC TGCACACCAT CGGCCCGGAA
CGACGGCCCG GCGGCGAGGC AATCTTCGCA GCGGGCGACC GGCCGCAAGC AAGCCTCGGT
TTCAAAGAGG CCGCCGGCAT CATTCGGGAA GACAAGCGAC TGGTGAAAGC CACGCGTATC
TATGACGCTC ACAACAGCGT CAAGAAAATC GTCTTCAACA AGGATGGCTC TTTCCTCGAA
GCCATCAGCG GCGAAGGAGC GCCGGCCCAC GGCCGCACCC CAGCCTTCGC CTTCGTTGAC
GAACTGCACA TTTGGAAGAA CGCGGACCTC TGGACCGCGA TCAAGTCGGG CCTGCCCAAG
ACCCAAGGCT CTCTGCTGAT CATCGCGACC ACTGCCGGCC GCGGTCAGGA CAACATCGCT
CACGAGATCG TCGACCGCGC CCGCAAGGTT GCGCGCGGCG ACATCGATGA TCCGTCGTTG
CTTCCGATCC TGTTCGAAAC GCCCGATGAT GCCGATTGGA GAGACGAAGC CCTTTGGCAC
CGCGCAAATC CTGGCCTTGC ACTCGGCTAT CAGGACATTG AGGGACTGCG CCAGCTCGCG
CGCGAGGGTG AAACCAGCAT CACTGCCCGT GAGACATTCC GGCAATACAA TCTGAATGTC
TGGCTCGACC GCTCAACTGA CCCGTTCGTG GAGATGGCGG TCTATGATCA GGGCGCAGAC
CCGGTCGACC TTGAGGCGCT GAAGGGGCGC CCGTGTTGGC TCGGTGTCGA TCTCTCGTCA
CAAACCGACC TCACCGTGAT CGTTGCCGCG TGGCGCGATG ACGATGGCGG GTTCACCGTT
CTGCCCATTT TCTTCTGTCC GAAGATGAAC CTTCGCGAGC GGGAAGAGCA AACCGGTGCC
CCATATCTCG AATGGGAACG ACAAGGGCTG ATCACCGCGA CCGACGGCAA CGTGGTGGAC
TTTGATGCGG TGGAAGCCGC TATTCGCGAC CTCTGCGATC GCTTCGAAGT CACGGAGATT
GCGTTCGATC CTGCTCTTGC GCGAAGCGTG CTCAACAGCT TGCAGAAAGA CGGCTATCCA
GCGGTGGAAA TGCGCCAGGG TGCGCTCACC ATGATGCCCG CCATCGCCGA GCTTGAACGC
GCGATCGTTG CCGGCAAGTT TCGGCACGGT GGCAACCCCG TGCTACGGTT CAACTTCGCC
AATGTCGAGG TAGAGCGGAA CAAGCAACAG CACGCCGTTC GGTTCGTCAA ATCCAAGAGG
TGGTTGAGTA TCGACGGTGC GGTTGCGGCG GCGATGGCAG TCTCGCGCGC CGCGGCCGGC
GAGAGCGGCC GGTCCCTTTA CGACGATCCG GCCCTCAAAC CCGAAGATTT CGTGTGGAGC
TGA
 
Protein sequence
MTTRSTYPDW LFDGSPIDDP LGYGERAVNF LRLLKHPKSA APKRALMLDE WQDRIVRRIY 
GPRDQNGHRI VKTVVLLLPR GNRKTSLAAA LSLLHTIGPE RRPGGEAIFA AGDRPQASLG
FKEAAGIIRE DKRLVKATRI YDAHNSVKKI VFNKDGSFLE AISGEGAPAH GRTPAFAFVD
ELHIWKNADL WTAIKSGLPK TQGSLLIIAT TAGRGQDNIA HEIVDRARKV ARGDIDDPSL
LPILFETPDD ADWRDEALWH RANPGLALGY QDIEGLRQLA REGETSITAR ETFRQYNLNV
WLDRSTDPFV EMAVYDQGAD PVDLEALKGR PCWLGVDLSS QTDLTVIVAA WRDDDGGFTV
LPIFFCPKMN LREREEQTGA PYLEWERQGL ITATDGNVVD FDAVEAAIRD LCDRFEVTEI
AFDPALARSV LNSLQKDGYP AVEMRQGALT MMPAIAELER AIVAGKFRHG GNPVLRFNFA
NVEVERNKQQ HAVRFVKSKR WLSIDGAVAA AMAVSRAAAG ESGRSLYDDP ALKPEDFVWS