Gene Caul_5212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5212 
Symbol 
ID5897410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp136028 
End bp137269 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content68% 
IMG OID641555315 
Productreplication protein C 
Protein accessionYP_001676646 
Protein GI167621861 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACCG TCCAGAACGG TACGGGAGAC GTGCGTCGCC TGGCTGGCCC CCAATGGGAG 
GCCGCCCGGC TCGCCCAATC CTATGTGGGC CTGCCCGAAG GGATCAGTAA GGCCATGCTG
CTCGATCGTT TCGAGCGCGC GGCTTCGCGT CTGGGGATCA GCGACGGGGT GGTGCGCCTG
ATGCGGGCCC TGGTGCGGGT CACCTATGAG CAGGACTGGA CGGGCAAGAC CCGTCCGATC
GCCTGGCCGT CCAATGACGC CCTGGGCCAG GAGCTGCAGC GCTCGCGCGC GGCCATCCAG
CGCCTGATCC GCGCGGCCGT CTGTGCGGGC CTGGTCCACA TGAAGGACAG CGGCAATGGC
AAACGCTACG GCTACAGGGG CGAGCGCGGC GAGATCGTCG AGGCCTATGG CTTTGATCTG
TCGCCCCTGG CCGTGCGCTG GGACGAGTTC GCCGACCTCG CCGCGGCGCG GGGGCTGGAG
GAGGCTCGCC GGCGTGACCT CAAGCGCCGG CTGGGCGAGC TACGGCGCGA GATCCGCACT
GTCTGCGCCG ACGCCCTGGC GCAGGACCTG ACGGGGTTTG ACTGGCAGGA GGCGATCGAG
CGGGCCAGTG GACGCTTGCC GCGCTCCCCC ACCCTCGCCG AGCTGGAGCA CCTGCATGAC
CGCTTCGGAG CCCTGCTTGC GGCGGTTGAC AGGGCTTGGG TGGAGGGCCG GAAAACACAG
GATAGTAGGC CCAGGGGCTT CCAAAGCGAG GCCCATAAAG AACCTACAAC CCAGCCTAAA
GCTGAAAGAG CAACGTATTC GGCTTGGCGA GGAAGCGTAG CGGAATCCTC AGCGGATTCT
GACCGTGACG CCGCGCCCGC CGAGGGGTCC TTGAACGAGG AAGTCATTCC CCTGTCCCTG
GCACTGGAGG CCATCCCCGA AATCCACGAC CACCTGGACG ATGTGGCCGG GGCCGAGTGG
GAGGACTTCG TGCAGGCCGC CTATCAGGTC ACCGTGCTGC TGGGCGTGAA CCTCGCCGCC
TGGCGCGAGG CGCGCGAGAC CATGGGTCGA AACCGAGCGG CGGTGGCCGT GGCCACGGTG
CTCGCCCGTT GGAGGGACGG GGAGATCAAG AGCTCGGCCG GCGGCTATCT GCGCGCCATG
TGCGAACGTG AGCGCGTGGG GGCCCTGCAT CTGCTGCCCA GCCTCTATGG CCTGAAGGAA
CGCCACACGC CGCGCAAGTC GGCGGCGAAG CGGCGGCCAT GA
 
Protein sequence
MQTVQNGTGD VRRLAGPQWE AARLAQSYVG LPEGISKAML LDRFERAASR LGISDGVVRL 
MRALVRVTYE QDWTGKTRPI AWPSNDALGQ ELQRSRAAIQ RLIRAAVCAG LVHMKDSGNG
KRYGYRGERG EIVEAYGFDL SPLAVRWDEF ADLAAARGLE EARRRDLKRR LGELRREIRT
VCADALAQDL TGFDWQEAIE RASGRLPRSP TLAELEHLHD RFGALLAAVD RAWVEGRKTQ
DSRPRGFQSE AHKEPTTQPK AERATYSAWR GSVAESSADS DRDAAPAEGS LNEEVIPLSL
ALEAIPEIHD HLDDVAGAEW EDFVQAAYQV TVLLGVNLAA WREARETMGR NRAAVAVATV
LARWRDGEIK SSAGGYLRAM CERERVGALH LLPSLYGLKE RHTPRKSAAK RRP