Gene Caul_3398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3398 
Symbol 
ID5900853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3669512 
End bp3671344 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content70% 
IMG OID641563904 
Producthypothetical protein 
Protein accessionYP_001685023 
Protein GI167647360 
COG category[S] Function unknown 
COG ID[COG4805] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.15376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCAGC TCAATCGACG TGACGTTCTG GCGCTCGGCG CCGCGACCCT GGCCCTGGGA 
TCGGCGAGCG CGAGCCGGGC CGCCGCGCCG GGCGACACCG CCGCCGAGGC GCAGCTGTCG
GGCGTCGCCG AGGACCTGAT GCGGGAATAT CCCGAGAACG CCTCGGCCCT GGGCCTGGAC
AAGGGCGCGC GGGCGGCCCT GAAGTCCACC CTGACCGATC GATCGCTCGA AGGTCGCGCC
AAGCTGGCCG CCGCCGCCAA GGCGCGGGTC GCCAGGATGA AGGCCGTGGA CCGCAAGGGC
CTGAGCCCGG CGACCGCGCA GGACCTCGGC GTCGTCCAGA CCGCCCACGA GCTGACCGTC
GAGGGCTTCG ACTTCGCCTA TGGCGACGCC ATCGGCCTGA GCGCGCAGTG GTCCTATCGC
AACGCGCCCT ATGTCGTGGC CCAGAACACC GGGGCCTTCG TCGAGATCCC CGACTTCCTC
GACAGCCAGC ACGGGGTCGC CTCGGCGGCC GACGCCGAGG CCTATCTGAC CCGGCTGGAG
CTTTACGCCG CCCAACTGGA CGGCGAGACG GCGCGGCTGA AGCATGACGG CGCGCTGGGC
GTGGTCGCGC CCGACTTCTT GCTCGACAAG ACCCTGAAAC AGCAGAAAGG CGCCCGCGCC
CAGCCGATCG CCGACTGGGG CCTGATCACC GCCCTGGCCA GGAAGGCCAA GGACATCCCC
GGCGACCACG TCCGCCGGGC CACGGCCATC GTCGAGGGCA AAGTCGCCCC GGCCATGGAT
CGCCAGATCG CCGAACTGGC CGCCCACCGC GCCAGGGCCA CGTCCGACGC CGGGGCCTGG
AAGCTGCCCG ACGGCGAGGC CTATTACGCC TGGGCGTTGC GGGCCGGCAC GACCAGCCGA
ATGACGCCGG ACGAGGTCCA CCGGATGGGC CAGGAGCAGC TGAAGGCGCT GTTCGCGCGA
ATGGACACCC TGCTGAAGGC CCAGGGCCTG ACCCAGGGCA GCGTCGGCGC CCGGATGAAG
GCGCTGGGCG AGGATCCCAG GAACCTGTTC CCCAACACCG ACGAAGGCCG AGCCCAGATC
CTGGCCTATC TGAACGGCCG GGTGGCCGAC ATCCGCACTC GCCTGCCACG GGCCTTCGCC
ACGTTGGCGC CGGGCAATCT GCTGATCAAG CGGGTGCCGA TCGAGATCCA GGACGGCGCG
CCGGGCGGCT ATGCGGCGGC GGGCTCGATC GACGGCACGG TACCCGGCAA CTACTACATC
AACCTGCGCG ACACGAGCAT CTGGCCGCGC TACGGCCTGC CGACTCTGAC CTATCATGAG
GGCATACCGG GCCACGTCTG GCAGGGCGAA TACACCTACA AGCTGCCGCT GGTCCGGTCG
CTGCTGGCCT TCAACGCCTA TAGCGAGGGC TGGGCGCTGT ACGCGGAGCA ACTGGCCGAC
GAGCTGGGGG CCTATGACGG CGATCCGCTG GGCCAGCTGG GCTATCTGCA GTCGATCGCC
TTCCGCGCCT GTCGCCTGGT GGTCGACACC GGCATCCACG CCAAGCGCTG GACCCGGGAG
CAGGCGGTCG ACTGGTTGGT GACCACAAAT GGCTCGACCC GCGAGGAGGT GCAAGGCGAG
GTCGACCGCT ACTGCGCCTG GCCCGGCCAG GCCTGCGGCT ACAAGGTGGG CCACAGCGAG
ATCATCCGTC TGCGAACCAA GGCCCAGGCC GCGCTCGGCC GCCGCTTCGA CCTGCGCGCC
TTCGACGACG CGGTGGTGAT GGGCGGCAAT GTCCCGCTAA CCCAGCTGGA GGGCGTGATC
GGCGCCTATG TGGCGAGGCG GCGGGCTGCT TAG
 
Protein sequence
MSQLNRRDVL ALGAATLALG SASASRAAAP GDTAAEAQLS GVAEDLMREY PENASALGLD 
KGARAALKST LTDRSLEGRA KLAAAAKARV ARMKAVDRKG LSPATAQDLG VVQTAHELTV
EGFDFAYGDA IGLSAQWSYR NAPYVVAQNT GAFVEIPDFL DSQHGVASAA DAEAYLTRLE
LYAAQLDGET ARLKHDGALG VVAPDFLLDK TLKQQKGARA QPIADWGLIT ALARKAKDIP
GDHVRRATAI VEGKVAPAMD RQIAELAAHR ARATSDAGAW KLPDGEAYYA WALRAGTTSR
MTPDEVHRMG QEQLKALFAR MDTLLKAQGL TQGSVGARMK ALGEDPRNLF PNTDEGRAQI
LAYLNGRVAD IRTRLPRAFA TLAPGNLLIK RVPIEIQDGA PGGYAAAGSI DGTVPGNYYI
NLRDTSIWPR YGLPTLTYHE GIPGHVWQGE YTYKLPLVRS LLAFNAYSEG WALYAEQLAD
ELGAYDGDPL GQLGYLQSIA FRACRLVVDT GIHAKRWTRE QAVDWLVTTN GSTREEVQGE
VDRYCAWPGQ ACGYKVGHSE IIRLRTKAQA ALGRRFDLRA FDDAVVMGGN VPLTQLEGVI
GAYVARRRAA