Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1601 |
Symbol | |
ID | 4571124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1816695 |
End bp | 1817939 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639766182 |
Product | hypothetical protein |
Protein accession | YP_912046 |
Protein GI | 119357402 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01547] phage terminase, large subunit, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000118956 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGTTTG TGATTGATCG GCGGAGGATG TTGCCTCATC AGCGGGCATT CTGGGAGTTG CCGAATTTTC TGAAGGTGCT GGTTGGAGGG TATGGGTGCG GGAAGACGCA CATTGGGGCG TTGCGGTCGA TTTATGATAG TTATGTGAAT GCGCCGGTGC CGCATTTGTA TGTGTCGCCG TCATACAAGC AGGCTCGGAA GACAGTGGTG ATTTCGATTC GGGAGTTGCT GGACGCGGCG GGTGTGCGGT ATCGGTTCAA TAAGACGAAT CATGAGTTTG CGATTGCGAA TTGGAATGGG ACGATCTGGA TTGCGAGCGG TGATGAGCCT GACAGTTTGA AGGGTCCGAA CATCGGGAGC GCGGGGATCG ATGAGCCGTT CATCCAGCAG AAGGAGGTGT TTGATATTAC GCTGTCGCGG GTGCGGCATC CGAGGGCGAA ACATCGGGAG ATTTTTCTGA CGGGGACGCC GGAGCAGTTG AATTGGGGGC ATGAGGTTTC GCAGAATGAT GAGGGTCGGT ATGATCTGGG GCTGGTGGTT GGTCGGACGG CGGATAATGT GCATTTGCCG GGTCAGTTCG TTTCGATGCT TGAGCGGGCG TATGATGAGA ATCAGCGGGC TGCGTATATG AACGGGTTGT TTGTGAACCT GACGGTTGGC AGGGTGTACA GTTATTTCGA TCGGTCGGTG CATATGGGCG GGGCTGGCCT GGGTGGTGAT GGTGCGGATG GCGAAGTGGT GGCGGGGATT GATTTCAACG TGGATCATTT GACGGCGGTG GTGTTGCGGG TGTGGGGTGA CCGGGTGCAT TGTTTCGATG AGATGGTGTT GCGTGGTTCG ACGACGTATG AGCTGGCGGA TCGGCTGTAT GAGCGGTTTC CGGGGATTCG GGTGTTTCCG GATCCGTCGG GCGGGGCGCG GCGGACGTCG GCTCCGAAGA CGGATGTGCG GATTCTGCAG GATAAGGGGT TCAGGGTGGA GATGCGGCCG AAGCAGCCGC CGGTGAAGGA CAGGGTGCAT GCGGTGCAGA AGTTGTTGCG GGAGGGTCGG TTGTCGGTGA CGGGGTGCGC GTGTCTGGTT CGTGATTTTG AGCAGGTGGT GTGGCGCGGG GGTGATATTG ATAAGGTGAC GAGGCCGGAG TTGACGCATG CCTCGGATGC GGTGGGGTAT GCGATTGAGA AGTTGTTCCC TGTTCCGCTG CCGGAGCGGG ATTATTGGCG GCAGCCGGAG CATTGGAGGG CTTAG
|
Protein sequence | MRFVIDRRRM LPHQRAFWEL PNFLKVLVGG YGCGKTHIGA LRSIYDSYVN APVPHLYVSP SYKQARKTVV ISIRELLDAA GVRYRFNKTN HEFAIANWNG TIWIASGDEP DSLKGPNIGS AGIDEPFIQQ KEVFDITLSR VRHPRAKHRE IFLTGTPEQL NWGHEVSQND EGRYDLGLVV GRTADNVHLP GQFVSMLERA YDENQRAAYM NGLFVNLTVG RVYSYFDRSV HMGGAGLGGD GADGEVVAGI DFNVDHLTAV VLRVWGDRVH CFDEMVLRGS TTYELADRLY ERFPGIRVFP DPSGGARRTS APKTDVRILQ DKGFRVEMRP KQPPVKDRVH AVQKLLREGR LSVTGCACLV RDFEQVVWRG GDIDKVTRPE LTHASDAVGY AIEKLFPVPL PERDYWRQPE HWRA
|
| |