Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5228 |
Symbol | |
ID | 5897321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010335 |
Strand | - |
Start bp | 151048 |
End bp | 152586 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641555331 |
Product | hypothetical protein |
Protein accession | YP_001676662 |
Protein GI | 167621877 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.298729 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCGACG CGGCTGACTG GAATCTTGAG CGCGAGTTGA ACTTGCGCGC GGCGTCGATC GACGGAATGC CCCGACTTTA CTTCCTCGGC CCTTTCGCCT CCCGCATCAA TTTCGCCGCT CAGCAGAACC GGGCCCTCAA CCTGATTTCC GCGCTCGAGG AGAGCAACGC CTTGGAGAAG GACAAGCCCA TCGCCGTCAT CGGCGCGGGT TTGTCAGGCG TCACCGCCGC CACCGCGCTC CATCTGCTCG GCTATGAGGT CCACCTGATC GAGGAGAAGG GCGAGATCCT GCCGCGCCAG AGCACGACCC ACCACCGGAT CGTCCATCCG ACCGTCAACG CCTGGCCGTT TTCAGCGGAC CTCCTGCCCA CCACCCAGCT GCCGTTCTTC GATTGGTGCG CGGACGTCTG TGACAAGGTG ATGGCCGAGA TCCGGCGCGA GTGGAAAGCC TTAGCCGGCG ATCGTCTGCA CGCCGACAAA CGCCTCCATC TCGGGACCCA CGTCGCAACG CATAAAATCC ACCGCGACGG GGTGACCCTG ACGGCCAAGC CGACCATTTC CACGCGATTT GGCGCGGTGA TCTTCGCGAC TGGCTTTGAA GAAGAAGCCG CGCTCAAGAA CCACAAGACC GGCACCTCAT ACTGGCGCGA CGACGCCCTC GAGCAGATCC GCGCCATCGA CACGGACGCC AGGTTCCTGG TCAGCGGCAC CGGTGATGGC GGCCTGATCG ACGCGCTTCG GCTTTGCCAC ACCGAGTTCA TGAGCGGCGC CTTGGCGCTC AACGCCGTGA CCCGTCTTTA TAAGTCGCCC CTGGCCGATG AGATCAAGGC GGCCGAACAG GCCTACCGCG ACTCCCAGGT CGAAGGGCGC GATCTGCTCC TGTGGGAGAC CTACCAGAGC GTCGCCGCAC GCTTGCCCAA GGGCTTGCGC GAGCTGCTGG ACGCCTCGCT GACCCCCCAT CGCCCGCTGG TCTATCTGGT CGGTGTGGAC CTGACGCCCG TGGCGCGTGA CGCCGCGCCG ATCCACAAGC TGTTGGTCGC CCATGCCGAA CGGGCCGGAG CGCTCGACTA TATCGACGGG GTGGTCAAGG CGAACGCGCG GGGCGTCCTG TCGATACAGC CGCGCGTGAA GGGAACCTAT GTCCCAACGC CCGCGCCCCA GTACGCGGTC ATTCGCCACG GCGCGGAAAA GCGCATCCAA AATATGCTCA AGGTCGGCCA CGAGAAGGCG CTGACGAAAT TGATAAGCAA CCAGACCGCC TTGCTGGACT CCCTGCTCTC GCCCTTCTGG CGTCGCCAGA CCTTCGTGCT GCCCAACGAC TATCCCCGGC CCGATCCGAC CGACCAGAAG TTCAGGGATT CCCGCAGGCC ACGGGCCCAG AAGATCCAGC GGATCTGGGA GCATCTGGAG GTCAGCGACG ACGCTCACGG CTACAAGCTG GAGACCTCGC TGCCAGAGGA GCCCTGGTAT CCCAAGTCAC TGTTCGGCGT GCCCGTTCAG CGCGTCGACC GACAGTTTCG CGATCGCGGA GCCCGCTAG
|
Protein sequence | MIDAADWNLE RELNLRAASI DGMPRLYFLG PFASRINFAA QQNRALNLIS ALEESNALEK DKPIAVIGAG LSGVTAATAL HLLGYEVHLI EEKGEILPRQ STTHHRIVHP TVNAWPFSAD LLPTTQLPFF DWCADVCDKV MAEIRREWKA LAGDRLHADK RLHLGTHVAT HKIHRDGVTL TAKPTISTRF GAVIFATGFE EEAALKNHKT GTSYWRDDAL EQIRAIDTDA RFLVSGTGDG GLIDALRLCH TEFMSGALAL NAVTRLYKSP LADEIKAAEQ AYRDSQVEGR DLLLWETYQS VAARLPKGLR ELLDASLTPH RPLVYLVGVD LTPVARDAAP IHKLLVAHAE RAGALDYIDG VVKANARGVL SIQPRVKGTY VPTPAPQYAV IRHGAEKRIQ NMLKVGHEKA LTKLISNQTA LLDSLLSPFW RRQTFVLPND YPRPDPTDQK FRDSRRPRAQ KIQRIWEHLE VSDDAHGYKL ETSLPEEPWY PKSLFGVPVQ RVDRQFRDRG AR
|
| |