Gene Caul_3918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3918 
Symbol 
ID5901380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4234138 
End bp4235457 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content72% 
IMG OID641564439 
Producthypothetical protein 
Protein accessionYP_001685541 
Protein GI167647878 
COG category[S] Function unknown 
COG ID[COG5323] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01630] phage uncharacterized protein (putative large terminase), C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0145321 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTC TGGTCGAACC CCCGAGCGAC TTGAAGCAGT TCGTGCCGAA ATCCGACGCC 
GACTGGCGCT GGTTGGAGGC GACCGGGAAT CTAAATCCCT GGACGGAGCT GGCCCCTTGG
CCGGTGGTGC AGGACGGACT GAAGACGTGG CGCGTGACGG AGGCGCACCA GAAGCCTCCC
GTTGACCCCT GGATCACCTG GCTCTTCCTA GGTGGGCGCG GCGCGGGCAA GACCTTCGCC
GGAGCCTCGT GGATCGCGAA CCAAGCCAAG CCAGGTCGAA ACCTGGCCCT GGTCGGACCC
ACCTTCCACG ACGTGCGCGA GGTGATGATC GAGGGGCCTT CGGGCATCAA GAGCCTCTAC
CTTCCCGGCG ACCGCCCCAA GTGGCAGGCC AGCCGCCGAC GCCTGGAATT TCGCAACGGC
GCGATCGCCC AGGCCTTTTC CGCCGAGGAT CCCGACGCCT TGCGCGGCCC GCAGTTCCAC
GCCGCCTGGG CCGACGAGTT CTGCGCCTGG CCGAAACCCG CCGAGACCCT GGCCATGCTG
CGCTTTGGCC TGCGCCTGGG GACCGATCCG CGCCTGGTGG TCACCACCAC GCCCCGGCCG
ATCCGCGCCC TGCGCAACCT GATCGCCGAG CCGGGCGCGG TTCAAACGCG CGCCCCGACC
TCGGCCAACG CCGACCACCT GGCGCCGGCC TTCCTGTCCA CCCTGCGGGG CCTCTATGGC
GGCACGCGCC TGGCCGCCCA GGAGCTGGAC GGCCTGATCG TCGAGGGTGA GGGCGGCCTG
TTCCGGGCCG AGGACCTGGC CCGCTGCCGG GGCGCGCCGC CGGCCGCCTT CGACCGCGTG
GTCGTGGCGA TCGACCCGCC GGCCACCGCC ACGGGCGACG CCTGCGGCAT CGTGGTCTGC
GGGCGGTTCG GCGACCGGGC GTTCGTGCTG GCCGACAGGA CCGCGAAAGG CCTGTCGCCC
AACGGCTGGG CTCGCCGCGC GGTGGACGCC GCCGTGCGGT TCGACGCCGA CGCCCTGGTG
GCCGAAGCCA ACCAGGGCGG CGACATGGTC CGCTCGGTCC TGGCCCAGGC CGCGCCGCCG
TGCCCGGTAA AACTGGTCAA GGCCTCGGTC GGCAAACGCG CCCGGGCCGA ACCGGTGGCG
GCCCTGTACG AGCAAGGCCG CGTCGTTCAC TGCGGGGCCT TCCCGGCCCT GGAGGAGGAA
CTGATGGCGC TGGGGTCGGG GGACCTTGGG CACAGTCCGG ACAGGGCGGA CGCCCTGGTC
TGGGCGTTGA GCGAGCTGAT GCTGGGGGTG GGGAAGAGGC CGCGGTTGAG CGTGTTGTAG
 
Protein sequence
MTTLVEPPSD LKQFVPKSDA DWRWLEATGN LNPWTELAPW PVVQDGLKTW RVTEAHQKPP 
VDPWITWLFL GGRGAGKTFA GASWIANQAK PGRNLALVGP TFHDVREVMI EGPSGIKSLY
LPGDRPKWQA SRRRLEFRNG AIAQAFSAED PDALRGPQFH AAWADEFCAW PKPAETLAML
RFGLRLGTDP RLVVTTTPRP IRALRNLIAE PGAVQTRAPT SANADHLAPA FLSTLRGLYG
GTRLAAQELD GLIVEGEGGL FRAEDLARCR GAPPAAFDRV VVAIDPPATA TGDACGIVVC
GRFGDRAFVL ADRTAKGLSP NGWARRAVDA AVRFDADALV AEANQGGDMV RSVLAQAAPP
CPVKLVKASV GKRARAEPVA ALYEQGRVVH CGAFPALEEE LMALGSGDLG HSPDRADALV
WALSELMLGV GKRPRLSVL