Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3916 |
Symbol | |
ID | 5901378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4232397 |
End bp | 4233551 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641564437 |
Product | HK97 family phage portal protein |
Protein accession | YP_001685539 |
Protein GI | 167647876 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0348958 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCTGT TCAAACCCCG CCGGCCGCGC CCCGTGGCGC CGGAGATCAA GGACTCCCGG GCGGCCAGGC TGATCGCCAT CACCACGGCC GGCCGGCCGC GCTGGACGCC GCGCGACTAC GCGGCCCTGG CGTCCGAGGG CTTCGCCAAG AACCCGGTCG CCTATCGCTG CGTGCGGATG ATCGCCGAGG CCGCCGCGGC CGTGCCGCTG ACGGTGTTCG TCGGCGGCCA GCGGGCCGAC GACCACCCGT TGAGAAAGCT GCTCCAGGCC CCCAACCGAG AGCAGGGCGG GGCCGATCTG ATGGAGGCGT TCTTCGGGCA TCTGCAGGTG GCCGGGAACG GCTACCTGGA GGCGTCCGGA GACGACGCGC CCACCGAGCT CTACGCCCTG CGGCCCGACC GGATGACCGT CGTCCCCGGT CCGCGCGGCT GGCCCCTGGC CTATGACTAC CAGGCCGCCG GCCGCACCGC CCGGATCGGC CGTGACGCCG CCGGCTGGCT GCCGGTGCTG CACCTGCGGC TGTTCAACCC CACCGACGAC CACTACGGCT TCTCGCCGCT CGAGGCGGCC GCCTTCGCCA TCGACGTGCA CAACGCCTCC GGGGCCTGGA ACAAGGCCCT GCTCGACAAT TCGGCCCGGC CGTCCGGCGC CCTGGTCTAC GCCAATCGCG AGGCCGGCGA CCGGCTCTCG GCCGAGCAGT TCGAGCGGCT GAAGGCCGAG CTGTCCGACG CCCATGCGGG CACCGCCAAC GCCGGCCGGC CGCTGCTTTT GGAAGGCGGG CTTGACTGGC GGCCGATGTC GCTGTCGCCC GCCGACATGG ACTTCATCGC CGGCAAGCAC GCCGCCGCCC GCGAGATCGC CCTGGCCTTC GGGGTCCCGC CCCAGCTACT CGGTATTCCT GGCGACGCGA CCTACGCCAA CTATCGCGAG GCCAACGGGG CGTTCTGGCG ACACACCGTC GCGCCCCTGG CCGAGCGGGC GGCGCGGGCC CTGTCGGTGT GGCTGGAGCC CAAGTTCCCC GGCGCGAGGA TCGCCTGCGA CCTGGACGCC GTGCCGGCCC TGTCGGCCGA GCGCGACGCC CTGTGGGCGC GGCTGGAGGG GGCGAGTTTC CTGACGGATG CCGAGCGGAG ACGGTTGGCG GGGTTGGAGG GGTAA
|
Protein sequence | MPLFKPRRPR PVAPEIKDSR AARLIAITTA GRPRWTPRDY AALASEGFAK NPVAYRCVRM IAEAAAAVPL TVFVGGQRAD DHPLRKLLQA PNREQGGADL MEAFFGHLQV AGNGYLEASG DDAPTELYAL RPDRMTVVPG PRGWPLAYDY QAAGRTARIG RDAAGWLPVL HLRLFNPTDD HYGFSPLEAA AFAIDVHNAS GAWNKALLDN SARPSGALVY ANREAGDRLS AEQFERLKAE LSDAHAGTAN AGRPLLLEGG LDWRPMSLSP ADMDFIAGKH AAAREIALAF GVPPQLLGIP GDATYANYRE ANGAFWRHTV APLAERAARA LSVWLEPKFP GARIACDLDA VPALSAERDA LWARLEGASF LTDAERRRLA GLEG
|
| |