Gene Caul_0477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0477 
Symbol 
ID5897932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp517174 
End bp518421 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content65% 
IMG OID641560960 
ProductHipA domain-containing protein 
Protein accessionYP_001682109 
Protein GI167644446 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACG TCGAGGTTCA TATCGACTTC GCGGCCGGCC CGCGCCGGGT CGGCACGCTT 
CATCGTCAGG CTCGGCGGGG CGGTGAAGCC GTCGTGTTTG AGTACCATCC CGACTGGTTG
GCGGATGCGA CCCGCTTCTC CCTGGAGCCT GCCCTGACAT TGGGCCAAGG CGCCTTCGCG
CCGGCGGCTG GCCTGACGAT GTTCGGCTCG ATCGGCGACT CCGCGCCCGA TACCTGGGGT
CGGCGGCTGA TGCAACGAGC CGAGCGGCGT CAAGCCGAAC GCGACGGGCG CCCCGTGCGC
GCGCTCTCCG ACGCCGACTA CCTGCTGGGC GTCGCCGATG TTTCCCGCCT TGGCGCATTG
CGTTTCCGCC GGCCTGGCGA AGAGGCCTTC CAGGCGCCGA CCGAGGCGGG CGTGCCGGGG
CTTGTCGAGC TAGGCCGGCT GATGGGCGTC ACCGAGCGCA TCTTGCGCGA TGAGGAAACG
GACGAGGATC TGGCGATGAT CTTCGCGCCG GGTTCCTCTC TCGGCGGGGC TCGGCCGAAG
GCGTCGGTGA TCGATCAGCA CGGTCGGTTG TCGATCGCCA AATTCCCGAA GGAGACCGAC
GACTACAGTA TTGAGCTTTG GGAAGAGGTG GCGCTAAGGC TAGCCAAGCA GGCCGGCGTT
CGTACCCCTG ATCATGAGCT GGTGGTGGTC GCCGGAAAGT CCGTTCTGCT GTCCCGGCGT
TTCGACCGGC AAGGCGAGGC TCGCATCCCC TTCCTGTCGG CTCTGTCCAT GATGGGTCTG
AAGGATGGCG AACGTGGAAG CTATCCCGAA CTCGTCGATG TCCTGACCCA GCATGGCGCC
CAGACCAAGC AGGATGCGGC TGAACTCTAC CGCCGCATGG TCTTCAACGT CCTGATCTCC
AACGTCGACG ACCACCTCCG CAACCACGGA TTCCTTTGGG CCGGCCAGGG GGGATGGGTG
CTGTCTCCGG TCTATGATCT CAATCCGACC CCGACCGATA TCAGGCCGCG CATCCTCACC
ACCAACATCG ATCTGGACGA AGGTACTTGC GACCTGGATC TGGTGCAGTC GGTCGCCGAA
CTCTTTGGAT TGGGGTTAAA GCCGGCGCGC GAGATCATCG CTGAGGTCGG CCAAGCAACA
GCCGCTTGGC GTGATGTCGC TGCGGCGGTC GGGGCGCGGC CAGCGGAAAT CCGGCGCATG
GAGAGCGCGT TTGAGCATGT CGACTCACAG AAGGCGCGAG CCCTTTAG
 
Protein sequence
MADVEVHIDF AAGPRRVGTL HRQARRGGEA VVFEYHPDWL ADATRFSLEP ALTLGQGAFA 
PAAGLTMFGS IGDSAPDTWG RRLMQRAERR QAERDGRPVR ALSDADYLLG VADVSRLGAL
RFRRPGEEAF QAPTEAGVPG LVELGRLMGV TERILRDEET DEDLAMIFAP GSSLGGARPK
ASVIDQHGRL SIAKFPKETD DYSIELWEEV ALRLAKQAGV RTPDHELVVV AGKSVLLSRR
FDRQGEARIP FLSALSMMGL KDGERGSYPE LVDVLTQHGA QTKQDAAELY RRMVFNVLIS
NVDDHLRNHG FLWAGQGGWV LSPVYDLNPT PTDIRPRILT TNIDLDEGTC DLDLVQSVAE
LFGLGLKPAR EIIAEVGQAT AAWRDVAAAV GARPAEIRRM ESAFEHVDSQ KARAL