Gene Caul_0966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0966 
Symbol 
ID5898421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1017032 
End bp1018381 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content66% 
IMG OID641561448 
Producthypothetical protein 
Protein accessionYP_001682594 
Protein GI167644931 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.633294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCACA CCGTTCTGGC TTTGACGACG TCGGTCGTGG CTCTGCTGTC GACGTCCGCG 
CTCTGCGCCC CGACATCCGC GCCGCTGAAC GCGGGGGCGG CGAAGGTCGA CATCACGCCC
ACCAAGGCCC AGCTCCCGAA GGACTATGAG GGGGTCAACG ATCCGATCTA TGTGCGCGCC
GCCGTGCTCG AGCATGACGG CCAGAAGGCG GCTCTGGTGA GCGTGGACAT CGGCGGAATG
CCCGACGCCG TCTGGGCGGC CGTCGTGCAG GGCGCGCAAG GCCTTGGCAT TCCGTCGGCG
AACCTGATGC TGACCGCCAC CCACTCGCAC AGCGTGCCGC GCGGCATTCC CCAGCTCGAC
GAGAAGATCG TGTCGGCCCT GCGCCAGGCC GCGGCCAAGC TGCAGCCGGC CACGATCGCC
TACGGCCAGG GCGTCTCCTA CATCAACATC AACCGCAACG TCATTGACCC GAAAACCCAT
CGCTGGTGGG AGGGCCCGAA CTACGAAGGG CCGTCGGACA AGACCGTCGC CGTGGTCAAC
ATCACCTCGA CGTCGGGCCA GCCGATCGCG GTCTACTACA ACTACGCCGT CCACGCGGTC
GTGAACGGGC AACTGGACCA GGTCAGCGGC GACATTCCCG GCGCGGCCTC GCGCTATATC
GAAGACTCCC TAGGGGGCAA CACGGTCGCG ATCTGGAGCG AAGGCGCGGC TGGAGACCAG
AACCCGATCT TCTTCCAGCA GACCTATGAC CTGCGAAATA TCCGCATCGC CGACTACGCC
AAGCGCGGGG AAGACATCAG CAACGCGATG CCGCCCGGCG GCCGTGGCCT CGACAGGAAG
GATCCCCAGG TCGCCCGGTT GATGGGCCAG CAAAGGCAGA TGGTCGATTC GATGGGTCAG
TTGCTGGGCG AGGAGGTCCT GCGCGTGGGG CGTGACGTCC TGGATCGGCC GGAGGTCAAT
CCGCGCATCT TCGGCGCCTC CAAGACGGTG ACCTGCCCCG GCCGCGTGCG GACGAACGAG
GGCCGGGCCG GCTACCCGGG AGTCTATACC GACGGCCCCG ACGTGCCGAT CCGTCTGAGC
CTGCTGAAGA TTGGCGACAT CGAGATCGGC GGGGTCAACG CCGAGGTGTT CAATCCCATC
GCCCAGCGAT TCAAGGCCAG GAGCGGATCC AGCCGCACGA TGATGGCGAC CCTGACCAAC
GGAATGGCGC CGTCTGGCTA CATCCCCCAC GACGAGGCCT TCGGTCAGAA CACCTTTGAG
GTCGTTTCCT CCAGGCTCAA GCCAGGGTGC GCCGAGACCG CGATCATCGA GGGCCTGCTG
GACCTGGAAC AAGCCTCTGA ATCGAACTAA
 
Protein sequence
MKHTVLALTT SVVALLSTSA LCAPTSAPLN AGAAKVDITP TKAQLPKDYE GVNDPIYVRA 
AVLEHDGQKA ALVSVDIGGM PDAVWAAVVQ GAQGLGIPSA NLMLTATHSH SVPRGIPQLD
EKIVSALRQA AAKLQPATIA YGQGVSYINI NRNVIDPKTH RWWEGPNYEG PSDKTVAVVN
ITSTSGQPIA VYYNYAVHAV VNGQLDQVSG DIPGAASRYI EDSLGGNTVA IWSEGAAGDQ
NPIFFQQTYD LRNIRIADYA KRGEDISNAM PPGGRGLDRK DPQVARLMGQ QRQMVDSMGQ
LLGEEVLRVG RDVLDRPEVN PRIFGASKTV TCPGRVRTNE GRAGYPGVYT DGPDVPIRLS
LLKIGDIEIG GVNAEVFNPI AQRFKARSGS SRTMMATLTN GMAPSGYIPH DEAFGQNTFE
VVSSRLKPGC AETAIIEGLL DLEQASESN