Gene Caul_5344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5344 
Symbol 
ID5897098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp54501 
End bp55748 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content65% 
IMG OID641550636 
ProductHipA domain-containing protein 
Protein accessionYP_001672122 
Protein GI167621614 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.71584 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACG TTGAGGTTCA CATCGACTTC TCTGCCGGCC TGCGCCGGGT CGGCACGCTC 
CACCGTCAGC CTCGGCGCGG CGGGGAAGCT GTGGTCTTCG AATATCATCC CGCCTGGTTG
GCGGACGCGG CCCGCTTTTC ACTGGAGCCC GCATTGACCC TGGGCCAGGG CGCATTCGCG
CCGGCCGCGG GCCTGTCGAT GTTCGGCTCG ATTGGCGATT CCGCGCCCGA TACCTGGGGC
CGCCGGCTGA TGCAGCGCGC CGAACGCCGC CAGGCCGAGC GTGACGGCCG CCAGGTGCGC
GCCCTTTCGG ACGCCGACTA TCTCCTGGGC GTGGCCGACG TATCCCGGCT AGGCGCGTTG
CGCTTCCGCG AGCCCGGTGA AGCCGATTTT CGGGCTCCGA CCCAAACCGG TGTGCCTGGC
CTCGTCGAGC TTGGTCGGTT GATGGGCGTC ACCGAGCGCA TTCTGCGCGA TGAAGAGACC
GACGAAGATC TCGCGATGAT CTTCGCGCCC GGCTCCTCAC TGGGCGGCGC GCGCCCCAAA
GCCTCGGTGA TCGACCAGCA TGGCAGCCTG TCGATCGCCA AGTTTCCCAA AGAGGCCGAC
GACTATAGCA TCGAGCTTTG GGAGGAGGTG GCGCTTAGAT TGGCCAAGCG TGCCGGCATC
CGCACCCCAC GTCATGAACT GGTGAAGATC GCGGACAAGT CCATTCTGCT GTCCCGACGC
TTCGACCGAG ATGGCGAGAC GCGCATTCCC TTTTTGTCAG CCTTGTCGAT GCTGGGGCTG
CGCGACGGCG AACGGGGCAG CTATCCCGAG CTGGTCGATG TGCTCACCCA ACATGGCGCC
CAGGCCAAGC AGGACGCCGT CGAGCTCTAT CGGCGCATGG TGTTCAACGT CCTGATCTCC
AACGTCGATG ACCATCTGCG AAACCACGGC TTCCTGTGGG CGGGACAAAG CGGCTGGACG
CTTTCGCCCG CCTACGACCT CAACCCCACG CCGACCGACG TCCGGCCGCG CATTCTCACG
ACCAACATCG ATCTGGATGA AGGCACCTGC GACCTGGGCC TAGTGGAATC GGTCGCTGAA
CTCTTCGGCC TGGGTCCAAA GCCCGCACGC GAGATCATCG CGCAGGTTGG CCAAGCCACC
AGGATCTGGC GCGATGTCGC CGTCGAGATC GGCGCGCGGC CAGCTGAGGT CCGCCGTATG
CAAAGCGCCT TCGAACACAC CGATCTAGAG CGGGCATTGG CGATCTGA
 
Protein sequence
MADVEVHIDF SAGLRRVGTL HRQPRRGGEA VVFEYHPAWL ADAARFSLEP ALTLGQGAFA 
PAAGLSMFGS IGDSAPDTWG RRLMQRAERR QAERDGRQVR ALSDADYLLG VADVSRLGAL
RFREPGEADF RAPTQTGVPG LVELGRLMGV TERILRDEET DEDLAMIFAP GSSLGGARPK
ASVIDQHGSL SIAKFPKEAD DYSIELWEEV ALRLAKRAGI RTPRHELVKI ADKSILLSRR
FDRDGETRIP FLSALSMLGL RDGERGSYPE LVDVLTQHGA QAKQDAVELY RRMVFNVLIS
NVDDHLRNHG FLWAGQSGWT LSPAYDLNPT PTDVRPRILT TNIDLDEGTC DLGLVESVAE
LFGLGPKPAR EIIAQVGQAT RIWRDVAVEI GARPAEVRRM QSAFEHTDLE RALAI