Gene Caul_0964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0964 
Symbol 
ID5898419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1014911 
End bp1016068 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content68% 
IMG OID641561446 
Productlevansucrase 
Protein accessionYP_001682592 
Protein GI167644929 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.881689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAGTC TGTTTCCCTC GGCGGTCGCC GATAAGCGCG TTCCCCCGAA CCGTTGGGAG 
GCCGCCGATG TGGCGCGGAT CGATCGGGGC CGCATGGATG CGGCGCCGCT GATTGTCGAG
GCCGACATCG TGCGCATCGC GGCGGACTTG GACATCTGGG ACGCCTGGCC CGTTCAGACG
CGAGCCGGCG CGCCGGTGGA GTTTGGCGAA GGGGTGACGC TGTGGATGGC CCTGGGCGCG
CCGCGATTCG AAGATCCCGA CGCCCGGCAC GGACACGCGC GCATTCATCT GCTTCAGCAC
GACGCCCGAG GCTGGTCGCA CCGGGGCCTG CTGATGCCGG AAGGCTTTTC TCCCGGCAGC
CGGGAATGGT CCGGATCGGC GGTGCTCGAC GCCGATCAGC GCACGCTAAC CCTCTATTTT
ACCGCCACCG GTCGGGCCGG TGAAGAGACG CTGACCTTCG AGCAGAGGCT GTTCAGCGCT
CGCGCGACCC TTGAGCGGTC CGGCGAGCAT TTGACGTTTT CCGGCTGGCG GGACTTGCGC
GAGATCGTCT CGCGAGATCC CGAACACTAC ATGGCCAGCG ACGGCGGCGT GGGCGTCATA
GGGACGATCA AGGCCTTCCG CGACCCGGCC TATTTCCACG ACCCCCGGGA TGGCCGCCAC
TACCTGTTCT TCGCCGGCTC GGCGGCCGGG GCGGGATCGG AGTTCAACGG GGTGATCGGA
GCGGCCGTGT CTCAATCGGG GGAGGCGGGC GATTGGCGCC TTGCGCCGCC GCTGATCGAC
GCCACCGACG TCAACAATGA GCTGGAGCGG CCGCATGTCA TCATGGCCGG CGGCCTGTAC
TACATGTTCT GGTCAACCCA GACCCATGTG TTCGCGCCGA ACCTGAGGCA CGCGCCCACG
GGGCTCTACG GCATGGTCTC CAGCAGCCTA GCCGGCGGAT GGCGGCCGCT GAACGGCTCC
GGACTGGTCT TGGCAAATCC GCAGGGCGCG CCGCGCCAAG CCTACAGCTG GCTGGTGCTC
CCGGACCTTT CTGTGATCAG CTTCGCGGAC GACTGGGGCC GCGCGCAGGA TGCTCAGGGC
GCCCGACGGT TCGGCGCCAC CTTCGCCCCG ACGTTGCGCC TGCGCCTGGC GGCCGACGTG
GCTGGACTGG AGGCCTGA
 
Protein sequence
MSSLFPSAVA DKRVPPNRWE AADVARIDRG RMDAAPLIVE ADIVRIAADL DIWDAWPVQT 
RAGAPVEFGE GVTLWMALGA PRFEDPDARH GHARIHLLQH DARGWSHRGL LMPEGFSPGS
REWSGSAVLD ADQRTLTLYF TATGRAGEET LTFEQRLFSA RATLERSGEH LTFSGWRDLR
EIVSRDPEHY MASDGGVGVI GTIKAFRDPA YFHDPRDGRH YLFFAGSAAG AGSEFNGVIG
AAVSQSGEAG DWRLAPPLID ATDVNNELER PHVIMAGGLY YMFWSTQTHV FAPNLRHAPT
GLYGMVSSSL AGGWRPLNGS GLVLANPQGA PRQAYSWLVL PDLSVISFAD DWGRAQDAQG
ARRFGATFAP TLRLRLAADV AGLEA