Gene Caul_3869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3869 
Symbol 
ID5901331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4188935 
End bp4190122 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content69% 
IMG OID641564391 
Producthypothetical protein 
Protein accessionYP_001685493 
Protein GI167647830 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATTC TCTGGCTGCT GCCCCTGGCC GTGGCCCTCC CCACCCTGGC CCAGGCCGCC 
GAGCCGGCCC TGGGCCAGTT CCTGCTGGAC GCCCGCCTGC GCTATGAAAC CGTTTCGCAA
GACGGCCTGG CCAAGGACGC CCGCGCCCTG ACCCTTCGCA CTCGGCTGGG CTACGAGACC
GCCGCCGTTC ACGGCGTCAA GTTGCTGGTC GAGGGCGAGA ACGTCACAGC CCTGGATGGA
GACTATAACA GTCAGGTCAA CGGCAAGACC GCCTATCCGG TGATCGCCGA TCCGGAGACC
ACCCAGCTCA ACCGCCTGCA GTTGTCCTGG GCCGGCCCGC TCGGCGGCGG CGCGACGGTG
GGCCGCCAGC GCCTGATCCT CGACAACGCC CGCTTCATCG GCAATGTCGG TTTCCGCCAG
ACGGAACAGA CCTTCGATGC GGTCACCCTG GTCTATCGGC CCTCGCCCAA GCTCAGCCTG
ACCTACGCCT ATCTGGACAA GGTGCATCGC ATCCTTGGCG ACGACCACGC CCAGGGAAGC
TGGCGCAGCG ACTCCCATAT CGTCCAGATC GCCGCCAAGA CCGCGGTCGG CCAGGTTTCA
GCCAGCGCCT ATCTGCTCGA CTTCGCCAAC GCCCCGGCCC AGTCCAGCGC CACCTATGAC
ATCCGGCTGA GCGGCTCGCG CCCCTTGTCC TCCGGCCTGG CCGTCACCTA CGAAGCCCAG
TACGCCCGGC AGAGCGACTA CGGAAACAGC CCGACCCGGT TCACCCTCGA CTATCTCGAC
CTGGCCGTGG GCCTGAAGAC CAAGACTAGC GCCGTGGCCC TCGGGGTCGA GCGCCTCGAC
GGGGATGGTC GCCGGGGCTT CCAGACGCCG CTGGCCACGC TGCACGCCTT CCAGGGCTGG
GCCGACGTCT TCCTGACCAC GCCGGCCAGC GGCGTGCGCG ATCTGCAGCT GACCGCCTCG
ACCAGCGTCA CCGCCTCCAA GGCCCACCCC GTCAAGCTGC AGGCGGCCGT GCATCGGTTC
GACGCCGCCG ATGGCGACAC CCGGCTGGGC GACGAACTGG ATCTGGCCGT CTCGGCGCCG
CTGACGCCCA GGCTGTCGGC GGAACTGGCG GCGGCGGCGT TCGACGGCGA CCAACCGGCC
TTTCGCGACC GCACCAAGGT GTGGCTGACG CTCGCCTACA AGCTCTGA
 
Protein sequence
MRILWLLPLA VALPTLAQAA EPALGQFLLD ARLRYETVSQ DGLAKDARAL TLRTRLGYET 
AAVHGVKLLV EGENVTALDG DYNSQVNGKT AYPVIADPET TQLNRLQLSW AGPLGGGATV
GRQRLILDNA RFIGNVGFRQ TEQTFDAVTL VYRPSPKLSL TYAYLDKVHR ILGDDHAQGS
WRSDSHIVQI AAKTAVGQVS ASAYLLDFAN APAQSSATYD IRLSGSRPLS SGLAVTYEAQ
YARQSDYGNS PTRFTLDYLD LAVGLKTKTS AVALGVERLD GDGRRGFQTP LATLHAFQGW
ADVFLTTPAS GVRDLQLTAS TSVTASKAHP VKLQAAVHRF DAADGDTRLG DELDLAVSAP
LTPRLSAELA AAAFDGDQPA FRDRTKVWLT LAYKL