Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3484 |
Symbol | |
ID | 5900939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3761999 |
End bp | 3763381 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641563990 |
Product | portal protein |
Protein accession | YP_001685109 |
Protein GI | 167647446 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTATGA ACAACAAGAA GGCGTACAAG CCTCTCGTCC TGACCCAAAA GTCCTACGAC TACACCGAGC ACGAACCGCA ACTTCAAAGC TCGATGAACT ACGTGGGTGC CTACACGGGC TTTGGTACGA AAATCGAGTG GACCAATCCC AAGGACTTTG AGAAGCGAGC AACGCGACTC TATTACCACT GCTCTATCAG CCGCCGTTGC ATCGACATTC GCGCTGAGCG CGTCGCGTCC GTGCCTATCG TGCTTGAGGG CGGCAACGCT GACAGCAAGG CACTGATCGA AAGCCCCAAC GTAATCGACG GCACGCTGCG CCAATCACTG CGCGTATGGG AAACAAACCT CGCACTAGGT GGCGATCTTT GGATGTTCCT AGATCGTCGC ATCAAAGGGG CGCCCCAGCT TCACACGTTC CGTCAGGACT ACATGGTTCA CGACGCGACC GCCGCGACCG TGACCTATGA TCCTGGCTTC CTAAAGAACC AACCACGCCC TGAATACAAG TTCGATTTCA AGAACGGTCG AAGCACGCAA GCCTATGTCG CGAAGAACGG TAAGTGGGAG AAAATCGATG GCGCACTCAT CCACATCATG GAGCACAACC CGCTATCAAG TGGTCAAGGC TCCGGTGCTG GCGACGCGGT GTTGCGCGAA GTCGATACAT GGGTTGCAGC GAACACATTG ATCGCAAGTC GCTTCCAGGC AGGCGGTCGC AAGAACGGTT TCGTCTCGGC TCCGCAGCTT CACAGCGACG AGGAAGTTGC TCAGTGGAAG GCCGCACTTC AGCAGCTATC CCAGGCGGCT ATTGGCGACA CTGGCGTGCT CGCAGGCGGC GCGACTTTCA CTAGCAACCA ACTCACATTC ACCGAGCTTG ATGTCGTCAA CATCCTAGAC GCAGCAGCTA GGACAATCGC CAACGGCTTT GCCGTCCCGG CTGTCATGCT GAACCTCGCT GGCGAGAGTT CATATGCCCG CGACCGTAGC GTTGACCGCA TCTATTACAC CAGTTGGGTG AAGCCTCGCG CTCAGTGGAT CACAGAGCAG CTTGAATCGC ACCTGAAGCG CGTTCTCGAC CCCACAATCA AGCTGGCAAT CGACGATACG CAGCTTCCAT ATTTCCAAGA CGACTTGATG GAACAGGCCA AGGCGCAAGC TGCGATTGGC TGCTTCACGG TCAACGAGAT CCGCAAAATC CTGTCCTACG AGCCCGTAGA TGGCGGCGAC GAGCTGATGA AGCCCGTAAG CGCGCCAAAG CCCGAACCAA CCGCCCCTGA GACGCCCGAC CAGGGCACTC AGGAAGCCCC AAACAACCCC CGCGAGGTGG ACTTCAACGC AGATACGAGC CGTCGTCCCA GCGAAAAGCG AGCCGCCAAA TAA
|
Protein sequence | MPMNNKKAYK PLVLTQKSYD YTEHEPQLQS SMNYVGAYTG FGTKIEWTNP KDFEKRATRL YYHCSISRRC IDIRAERVAS VPIVLEGGNA DSKALIESPN VIDGTLRQSL RVWETNLALG GDLWMFLDRR IKGAPQLHTF RQDYMVHDAT AATVTYDPGF LKNQPRPEYK FDFKNGRSTQ AYVAKNGKWE KIDGALIHIM EHNPLSSGQG SGAGDAVLRE VDTWVAANTL IASRFQAGGR KNGFVSAPQL HSDEEVAQWK AALQQLSQAA IGDTGVLAGG ATFTSNQLTF TELDVVNILD AAARTIANGF AVPAVMLNLA GESSYARDRS VDRIYYTSWV KPRAQWITEQ LESHLKRVLD PTIKLAIDDT QLPYFQDDLM EQAKAQAAIG CFTVNEIRKI LSYEPVDGGD ELMKPVSAPK PEPTAPETPD QGTQEAPNNP REVDFNADTS RRPSEKRAAK
|
| |