Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0964 |
Symbol | |
ID | 5898419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1014911 |
End bp | 1016068 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641561446 |
Product | levansucrase |
Protein accession | YP_001682592 |
Protein GI | 167644929 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.881689 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAGTC TGTTTCCCTC GGCGGTCGCC GATAAGCGCG TTCCCCCGAA CCGTTGGGAG GCCGCCGATG TGGCGCGGAT CGATCGGGGC CGCATGGATG CGGCGCCGCT GATTGTCGAG GCCGACATCG TGCGCATCGC GGCGGACTTG GACATCTGGG ACGCCTGGCC CGTTCAGACG CGAGCCGGCG CGCCGGTGGA GTTTGGCGAA GGGGTGACGC TGTGGATGGC CCTGGGCGCG CCGCGATTCG AAGATCCCGA CGCCCGGCAC GGACACGCGC GCATTCATCT GCTTCAGCAC GACGCCCGAG GCTGGTCGCA CCGGGGCCTG CTGATGCCGG AAGGCTTTTC TCCCGGCAGC CGGGAATGGT CCGGATCGGC GGTGCTCGAC GCCGATCAGC GCACGCTAAC CCTCTATTTT ACCGCCACCG GTCGGGCCGG TGAAGAGACG CTGACCTTCG AGCAGAGGCT GTTCAGCGCT CGCGCGACCC TTGAGCGGTC CGGCGAGCAT TTGACGTTTT CCGGCTGGCG GGACTTGCGC GAGATCGTCT CGCGAGATCC CGAACACTAC ATGGCCAGCG ACGGCGGCGT GGGCGTCATA GGGACGATCA AGGCCTTCCG CGACCCGGCC TATTTCCACG ACCCCCGGGA TGGCCGCCAC TACCTGTTCT TCGCCGGCTC GGCGGCCGGG GCGGGATCGG AGTTCAACGG GGTGATCGGA GCGGCCGTGT CTCAATCGGG GGAGGCGGGC GATTGGCGCC TTGCGCCGCC GCTGATCGAC GCCACCGACG TCAACAATGA GCTGGAGCGG CCGCATGTCA TCATGGCCGG CGGCCTGTAC TACATGTTCT GGTCAACCCA GACCCATGTG TTCGCGCCGA ACCTGAGGCA CGCGCCCACG GGGCTCTACG GCATGGTCTC CAGCAGCCTA GCCGGCGGAT GGCGGCCGCT GAACGGCTCC GGACTGGTCT TGGCAAATCC GCAGGGCGCG CCGCGCCAAG CCTACAGCTG GCTGGTGCTC CCGGACCTTT CTGTGATCAG CTTCGCGGAC GACTGGGGCC GCGCGCAGGA TGCTCAGGGC GCCCGACGGT TCGGCGCCAC CTTCGCCCCG ACGTTGCGCC TGCGCCTGGC GGCCGACGTG GCTGGACTGG AGGCCTGA
|
Protein sequence | MSSLFPSAVA DKRVPPNRWE AADVARIDRG RMDAAPLIVE ADIVRIAADL DIWDAWPVQT RAGAPVEFGE GVTLWMALGA PRFEDPDARH GHARIHLLQH DARGWSHRGL LMPEGFSPGS REWSGSAVLD ADQRTLTLYF TATGRAGEET LTFEQRLFSA RATLERSGEH LTFSGWRDLR EIVSRDPEHY MASDGGVGVI GTIKAFRDPA YFHDPRDGRH YLFFAGSAAG AGSEFNGVIG AAVSQSGEAG DWRLAPPLID ATDVNNELER PHVIMAGGLY YMFWSTQTHV FAPNLRHAPT GLYGMVSSSL AGGWRPLNGS GLVLANPQGA PRQAYSWLVL PDLSVISFAD DWGRAQDAQG ARRFGATFAP TLRLRLAADV AGLEA
|
| |