Gene Caul_5051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5051 
Symbol 
ID5902513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5450508 
End bp5451581 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content70% 
IMG OID641565572 
Producthypothetical protein 
Protein accessionYP_001686669 
Protein GI167649006 
COG category[S] Function unknown 
COG ID[COG4320] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.684708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0897321 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCT CGATCAAGGC GTCGGTGAAA GCTTACGAGG CTTGGCTAGA GGCGGCGCTC 
GGCGGCGACC TCGTCGAGAC CGATCTTCGC GACAAGCACA AGAAGATGCG GGACGGCGCG
TTCCCGTTCC TGCGGGCGAC CTACTGGCGG TGGGCCGAGA CCATCCTGGA GATTTGCCCC
GATCTGGCGA CCGCGCCGCC GGTGCTGGCG ATCGGCGACA CCCATGTCGA GAATTTCGGC
TGCTGGCGCG ACGCCGAAGG CCGGCTGGTC TGGGGGGCCA ACGACTTCGA CGACGCGGCG
GTCATGCCCT ATCCGCTCGA CCTGGTGCGC CTGGCGGCCA GCGCCCTGCT GGCGCGGAAG
GGCGGCGCCC TGGACTTTCG CCAGGTCTGC AACAGCATCC TGGCCGGCTA TGTTGCCGGC
CTGGCTGATC CTCGGCCGTT CATTCTGGAG CGCGCGCATG GCTGGCTGCG CGAGGCGGTG
ATGCTGTCGG AGCAAGAGCG CGCCGCGTAC TGGCCAAAGT ACGACAAGCC GGACGACCCG
TCGATCCAGC CGCGCTACCT GCGCCTCCTG CGCGAGGCCA TGCCGGATCC GACCGCCGCC
TTCGCCGCCT TTCCACGATC GGCGGGGCTG GGCAGCCTGG GCCGGCCGCG CTTCGTCGCC
CGGACGGCAT GGCGCGGCGG ACCGGTGCTG CGCGAGGCCA AGGCGGTGGT CGTCTCGGCC
TGGGTGCTGC GTCATGGCGG CGACGCGACG GTTCGGATCG CCGACATCGC CGGGGGTCGC
TTTCGGGCGC CCGATCCGCA CTACCGTGTC GCCGACGGCG TCGTGGTCCG TCGCCTGTCG
CCCAGCAGCC GAAAGATCGA GGCCAAGGAC TCGAAGGACC GGGCGCTGCT GCTGTCGCTC
GACATGCTGA CCGCCATGGG CCGTGAGATC GCCGCTTGCC ACGCCGGCGA CCGTGATCGC
GCCCCGGCGC TGGGCGAGCA CCTGCGGAGC CTGACGCCAG GCTGGCTGCA GGACCACGCC
AGGGTCGCGG CGAGTCAGGT CGAGGCGGAC CAGGCGGCCT TCTCTAAAGA ATGA
 
Protein sequence
MTTSIKASVK AYEAWLEAAL GGDLVETDLR DKHKKMRDGA FPFLRATYWR WAETILEICP 
DLATAPPVLA IGDTHVENFG CWRDAEGRLV WGANDFDDAA VMPYPLDLVR LAASALLARK
GGALDFRQVC NSILAGYVAG LADPRPFILE RAHGWLREAV MLSEQERAAY WPKYDKPDDP
SIQPRYLRLL REAMPDPTAA FAAFPRSAGL GSLGRPRFVA RTAWRGGPVL REAKAVVVSA
WVLRHGGDAT VRIADIAGGR FRAPDPHYRV ADGVVVRRLS PSSRKIEAKD SKDRALLLSL
DMLTAMGREI AACHAGDRDR APALGEHLRS LTPGWLQDHA RVAASQVEAD QAAFSKE