Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3056 |
Symbol | |
ID | 6410727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 3301883 |
End bp | 3303148 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642712936 |
Product | phage terminase, large subunit, PBSX family |
Protein accession | YP_001992037 |
Protein GI | 192291432 |
COG category | [R] General function prediction only |
COG ID | [COG1783] Phage terminase large subunit |
TIGRFAM ID | [TIGR01547] phage terminase, large subunit, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.345936 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACCGCG GCGCCTGGGG CGGGCGTGGC TCGGGTAAAA CCCGCACGTT CGCCAAGATG TCGGCTGTGC GTGGGATCGA CTTCGCGCAG GCCGGCATGG ATGGCGTGAT TGTCTGCGGC CGCGAGTTCA TGAACTCGCT CGCTGACAGC TCGTTCGCGG AGGTGAAAGC GGCGATCCTC TCGACGCCGT GGCTCGCAGA GCGTTACGAC GTCGGCGACA CGTACATTCG GACGAAATGT AGGCGGATCA TCTTCGTTTT CGTCGGATTG CGGCACAATC TAGACAGCAT CAAGTCAAAA GCGCGCATTC GGCTGCTGTG GGTCGATGAG GCGGAGCCGG TCTCCGATGA AGCTTGGAAC ATCACAATCC CGACCGTGCG CGAAGAAGGG TCGGAAATCT GGCTGACCTG GAATCCGGAC CGGAAGGCGA GCGCGACCAA CAAGCGCTTT CGCGAAAACC CGCCGGCTGG CGCGAAAATC GTTGAGTTGA ATTGGCGGGA TAACCCCTAT TTCCCGGAGA TCCTGAATCG CACGCGGCTC GACGACAAGG CGAACCGGCC CGACCAATAC GGTTGGGTGT GGGAGGGCGA GTATCGCTCC GTGGTGGCCG GCGCCTATTA CGCGAAGGCA CTGACGCAGG CGAAAGAGCA GGGGCGGATT ACGTTCGTCC CCCTCGATCC GCTCATGCAG GTGCGCGCCT ATCTCGATAT CGGCGGTGCC GGCGCCAAGG CGGACGCCGC GGCGATCTGG ATTGTTCAAT TCGTCGGCCA GCGGATCAAC GTCCTCGACT ACTACGAGGC GCAGGGCCAG CCGCTCGCGA CGCACGTTGC GTGGATGCGG GAACGCGGCT GGGGCAAGGC GCTGGTCGTT CTGCCGCACG ACGGCGCGCA GACCGACAAG GTGCACGCCA CGTCCTACGA AAGCGCGCTG CGTGAAGCCG GGTTCGATGT GATCGTGATC CCGAACCAGG GCGCCGGCGC CGCCGCCGCG CGGATCGAAA CGGCTCGCCG GCACTTCCCG CGGGTCTGGT TCAACGCCGA AACGACGGAA GCGGGGCGCG ACGCGCTCGG CTGGTACCAC GAAAAGCGAT CGAACGATGA TCGCAACATC GGCCTCGGCC CAAACCACGA TTGGAGTTCT CACGGCTCCG ACGGCTTCGG CCTGATGGCA ATCCACTACG ACCAACCCAA CGGGGCGCCG CCGCCGCGGC AACCGTACAG CGGGCGGCGC AACTCAGGCG GCGGCGGATC GTGGATGGCG GCATGA
|
Protein sequence | MYRGAWGGRG SGKTRTFAKM SAVRGIDFAQ AGMDGVIVCG REFMNSLADS SFAEVKAAIL STPWLAERYD VGDTYIRTKC RRIIFVFVGL RHNLDSIKSK ARIRLLWVDE AEPVSDEAWN ITIPTVREEG SEIWLTWNPD RKASATNKRF RENPPAGAKI VELNWRDNPY FPEILNRTRL DDKANRPDQY GWVWEGEYRS VVAGAYYAKA LTQAKEQGRI TFVPLDPLMQ VRAYLDIGGA GAKADAAAIW IVQFVGQRIN VLDYYEAQGQ PLATHVAWMR ERGWGKALVV LPHDGAQTDK VHATSYESAL REAGFDVIVI PNQGAGAAAA RIETARRHFP RVWFNAETTE AGRDALGWYH EKRSNDDRNI GLGPNHDWSS HGSDGFGLMA IHYDQPNGAP PPRQPYSGRR NSGGGGSWMA A
|
| |