Gene Rpal_3056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3056 
Symbol 
ID6410727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3301883 
End bp3303148 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content65% 
IMG OID642712936 
Productphage terminase, large subunit, PBSX family 
Protein accessionYP_001992037 
Protein GI192291432 
COG category[R] General function prediction only 
COG ID[COG1783] Phage terminase large subunit 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.345936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACCGCG GCGCCTGGGG CGGGCGTGGC TCGGGTAAAA CCCGCACGTT CGCCAAGATG 
TCGGCTGTGC GTGGGATCGA CTTCGCGCAG GCCGGCATGG ATGGCGTGAT TGTCTGCGGC
CGCGAGTTCA TGAACTCGCT CGCTGACAGC TCGTTCGCGG AGGTGAAAGC GGCGATCCTC
TCGACGCCGT GGCTCGCAGA GCGTTACGAC GTCGGCGACA CGTACATTCG GACGAAATGT
AGGCGGATCA TCTTCGTTTT CGTCGGATTG CGGCACAATC TAGACAGCAT CAAGTCAAAA
GCGCGCATTC GGCTGCTGTG GGTCGATGAG GCGGAGCCGG TCTCCGATGA AGCTTGGAAC
ATCACAATCC CGACCGTGCG CGAAGAAGGG TCGGAAATCT GGCTGACCTG GAATCCGGAC
CGGAAGGCGA GCGCGACCAA CAAGCGCTTT CGCGAAAACC CGCCGGCTGG CGCGAAAATC
GTTGAGTTGA ATTGGCGGGA TAACCCCTAT TTCCCGGAGA TCCTGAATCG CACGCGGCTC
GACGACAAGG CGAACCGGCC CGACCAATAC GGTTGGGTGT GGGAGGGCGA GTATCGCTCC
GTGGTGGCCG GCGCCTATTA CGCGAAGGCA CTGACGCAGG CGAAAGAGCA GGGGCGGATT
ACGTTCGTCC CCCTCGATCC GCTCATGCAG GTGCGCGCCT ATCTCGATAT CGGCGGTGCC
GGCGCCAAGG CGGACGCCGC GGCGATCTGG ATTGTTCAAT TCGTCGGCCA GCGGATCAAC
GTCCTCGACT ACTACGAGGC GCAGGGCCAG CCGCTCGCGA CGCACGTTGC GTGGATGCGG
GAACGCGGCT GGGGCAAGGC GCTGGTCGTT CTGCCGCACG ACGGCGCGCA GACCGACAAG
GTGCACGCCA CGTCCTACGA AAGCGCGCTG CGTGAAGCCG GGTTCGATGT GATCGTGATC
CCGAACCAGG GCGCCGGCGC CGCCGCCGCG CGGATCGAAA CGGCTCGCCG GCACTTCCCG
CGGGTCTGGT TCAACGCCGA AACGACGGAA GCGGGGCGCG ACGCGCTCGG CTGGTACCAC
GAAAAGCGAT CGAACGATGA TCGCAACATC GGCCTCGGCC CAAACCACGA TTGGAGTTCT
CACGGCTCCG ACGGCTTCGG CCTGATGGCA ATCCACTACG ACCAACCCAA CGGGGCGCCG
CCGCCGCGGC AACCGTACAG CGGGCGGCGC AACTCAGGCG GCGGCGGATC GTGGATGGCG
GCATGA
 
Protein sequence
MYRGAWGGRG SGKTRTFAKM SAVRGIDFAQ AGMDGVIVCG REFMNSLADS SFAEVKAAIL 
STPWLAERYD VGDTYIRTKC RRIIFVFVGL RHNLDSIKSK ARIRLLWVDE AEPVSDEAWN
ITIPTVREEG SEIWLTWNPD RKASATNKRF RENPPAGAKI VELNWRDNPY FPEILNRTRL
DDKANRPDQY GWVWEGEYRS VVAGAYYAKA LTQAKEQGRI TFVPLDPLMQ VRAYLDIGGA
GAKADAAAIW IVQFVGQRIN VLDYYEAQGQ PLATHVAWMR ERGWGKALVV LPHDGAQTDK
VHATSYESAL REAGFDVIVI PNQGAGAAAA RIETARRHFP RVWFNAETTE AGRDALGWYH
EKRSNDDRNI GLGPNHDWSS HGSDGFGLMA IHYDQPNGAP PPRQPYSGRR NSGGGGSWMA
A