Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3975 |
Symbol | |
ID | 6411657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4268318 |
End bp | 4269514 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642713857 |
Product | phage portal protein, HK97 family |
Protein accession | YP_001992946 |
Protein GI | 192292341 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATTG CTTCCCGAGT TCAAAGTTGG TTCGGCCTCG AAAAGAAGGC CGGCATTGCC GCGCCCGAAC CATGGCTGTT TGAGCTTTTC GGCGCCCAAG CGTCCGGATC CAGCATTCGA GTAACGCCGC GGATCGCGAT GGAGTGCGCG CCGGTCGCCT GCGCCGTCAA CGCGATCTCT CAAGCGGTTG GCCTTCTGCC GGTCCACATC CTTAAGCGCG GCACGGATGG CGCGAAGGAT CGCGCGCCGG AACACCCGGC CTATCGACTG CTGCACCACG AAGCGAACGA ATGGACCCCT GCCGGCAAAC TCCGCCAAGA GGTTACCCGC GACGCTCTGC TTTATAAGCA CGGCGGCTTC GCCGAGATCA TCCGAGTTGG AGACGGTCGG CCCTTCGAGC TCATTCGGAT CGACCCCGAA GTCTCGCCGA TCACCGTCAC CATGACGAGT GACGGTCCGG CCTACGCCGT TCAAGAGGAC GGCATCACCC GCCAGATCGA TCGCGCCAAC ATCCTGCATA TCCCGAGCCC TTCACTGTCG GGCTTGGGCC TCGCGCACGA TGCGCGGAAG GTGATCGGCC TGTCACTGCT GATGGAGCGG CACGCCGAGC GGCTATTCGC CAACGGCGCC CGCCCCTCTG GATTGCTTTC ACTCAAAGGC AACATCAGCA CCGACACTCT GAAGAATGCC AGGGCCGCAT GGAATGCCCA GCACTCCGCT GCAAATAGCG GCGGCACCGC CGTGTTGCCC GCGGATGTTG TTTGGCAATC TCTCACTCTT AATTCCGCCG ACGCTCAGTT CCTTGAACTG CGCAAGTATC AGATCGAAGA GACGTCGCGC ATTTTTCGCG TCCCTCCGCA TCTGCTCTAC GAAATGGGCC GGGCAACTTG GGGCAACAGC GAGCAAGTTG GTCAAGAATT CCTTGACTTC TCACTGATGC ATTGGGTCTC AGCCTGGGAA GGGGAGATTC GGTTAAAGCT ATTCGACCGC GAAGAGCGCG ACAAATACAT CGCTGAGTTC TTCACCGATG GCTTTGCACG CGCCGATCTC GCCGCGCGAA TGGATGCCTA CAGCAAGGCT ATCGCCGCCC GCATCCTCAG TCCGAACGAA GCTCGCGCTG CCGAGAATCG CCCGCCGTAT TCCGGCGGCG ACCGCTTCGA AAACCCGAAC ACCACCGCTT CGGGAGCCGC CGCATGA
|
Protein sequence | MTIASRVQSW FGLEKKAGIA APEPWLFELF GAQASGSSIR VTPRIAMECA PVACAVNAIS QAVGLLPVHI LKRGTDGAKD RAPEHPAYRL LHHEANEWTP AGKLRQEVTR DALLYKHGGF AEIIRVGDGR PFELIRIDPE VSPITVTMTS DGPAYAVQED GITRQIDRAN ILHIPSPSLS GLGLAHDARK VIGLSLLMER HAERLFANGA RPSGLLSLKG NISTDTLKNA RAAWNAQHSA ANSGGTAVLP ADVVWQSLTL NSADAQFLEL RKYQIEETSR IFRVPPHLLY EMGRATWGNS EQVGQEFLDF SLMHWVSAWE GEIRLKLFDR EERDKYIAEF FTDGFARADL AARMDAYSKA IAARILSPNE ARAAENRPPY SGGDRFENPN TTASGAAA
|
| |