Gene Caul_3916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3916 
Symbol 
ID5901378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4232397 
End bp4233551 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content73% 
IMG OID641564437 
ProductHK97 family phage portal protein 
Protein accessionYP_001685539 
Protein GI167647876 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0348958 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTGT TCAAACCCCG CCGGCCGCGC CCCGTGGCGC CGGAGATCAA GGACTCCCGG 
GCGGCCAGGC TGATCGCCAT CACCACGGCC GGCCGGCCGC GCTGGACGCC GCGCGACTAC
GCGGCCCTGG CGTCCGAGGG CTTCGCCAAG AACCCGGTCG CCTATCGCTG CGTGCGGATG
ATCGCCGAGG CCGCCGCGGC CGTGCCGCTG ACGGTGTTCG TCGGCGGCCA GCGGGCCGAC
GACCACCCGT TGAGAAAGCT GCTCCAGGCC CCCAACCGAG AGCAGGGCGG GGCCGATCTG
ATGGAGGCGT TCTTCGGGCA TCTGCAGGTG GCCGGGAACG GCTACCTGGA GGCGTCCGGA
GACGACGCGC CCACCGAGCT CTACGCCCTG CGGCCCGACC GGATGACCGT CGTCCCCGGT
CCGCGCGGCT GGCCCCTGGC CTATGACTAC CAGGCCGCCG GCCGCACCGC CCGGATCGGC
CGTGACGCCG CCGGCTGGCT GCCGGTGCTG CACCTGCGGC TGTTCAACCC CACCGACGAC
CACTACGGCT TCTCGCCGCT CGAGGCGGCC GCCTTCGCCA TCGACGTGCA CAACGCCTCC
GGGGCCTGGA ACAAGGCCCT GCTCGACAAT TCGGCCCGGC CGTCCGGCGC CCTGGTCTAC
GCCAATCGCG AGGCCGGCGA CCGGCTCTCG GCCGAGCAGT TCGAGCGGCT GAAGGCCGAG
CTGTCCGACG CCCATGCGGG CACCGCCAAC GCCGGCCGGC CGCTGCTTTT GGAAGGCGGG
CTTGACTGGC GGCCGATGTC GCTGTCGCCC GCCGACATGG ACTTCATCGC CGGCAAGCAC
GCCGCCGCCC GCGAGATCGC CCTGGCCTTC GGGGTCCCGC CCCAGCTACT CGGTATTCCT
GGCGACGCGA CCTACGCCAA CTATCGCGAG GCCAACGGGG CGTTCTGGCG ACACACCGTC
GCGCCCCTGG CCGAGCGGGC GGCGCGGGCC CTGTCGGTGT GGCTGGAGCC CAAGTTCCCC
GGCGCGAGGA TCGCCTGCGA CCTGGACGCC GTGCCGGCCC TGTCGGCCGA GCGCGACGCC
CTGTGGGCGC GGCTGGAGGG GGCGAGTTTC CTGACGGATG CCGAGCGGAG ACGGTTGGCG
GGGTTGGAGG GGTAA
 
Protein sequence
MPLFKPRRPR PVAPEIKDSR AARLIAITTA GRPRWTPRDY AALASEGFAK NPVAYRCVRM 
IAEAAAAVPL TVFVGGQRAD DHPLRKLLQA PNREQGGADL MEAFFGHLQV AGNGYLEASG
DDAPTELYAL RPDRMTVVPG PRGWPLAYDY QAAGRTARIG RDAAGWLPVL HLRLFNPTDD
HYGFSPLEAA AFAIDVHNAS GAWNKALLDN SARPSGALVY ANREAGDRLS AEQFERLKAE
LSDAHAGTAN AGRPLLLEGG LDWRPMSLSP ADMDFIAGKH AAAREIALAF GVPPQLLGIP
GDATYANYRE ANGAFWRHTV APLAERAARA LSVWLEPKFP GARIACDLDA VPALSAERDA
LWARLEGASF LTDAERRRLA GLEG