Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3709 |
Symbol | |
ID | 5901165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4004649 |
End bp | 4005833 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641564220 |
Product | hypothetical protein |
Protein accession | YP_001685334 |
Protein GI | 167647671 |
COG category | [S] Function unknown |
COG ID | [COG3748] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.191189 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTATG ACGTCACCAC CTGGCTGAAC CTGGCCCTGC GCTGGCTGCA CGTGATCGCC GGCGTCGCCT GGATCGGCGC CTCGTTCTAC TTCGTCTGGC TGGACAACAA CCTGCGCGCC CCCGAGCCGC CCAAGGACGG CGTGAAGGGC GAGCTATGGG CCGTGCATGG CGGCGGCTTC TACCATTCGC AGAAGTACAT GACGGCCCCG GCCCACATGC CCGACCATCT GCACTGGTTC AAATGGGAGG CCTACACCAC CTGGCTCAGC GGCTTCGCCC TGCTGATCGT GCTCTACTAT GTCGGCGCGC CGGTCTATCT GATCGACGCC TCCAAGCACG CCTTCAGCCA GCCCGGGGCC ATCGCCACGG GCCTGGCCTT CATCTTCGGC GGCCTGGCCG TCTACGAGGC CCTGTGCCGT TCGCCGCTGG GGGGGAAGCC CCGCCTGTTT GGCCTGGTCT GGTTCCTGGC CCTGACCGGC GCGGCCTATG CCCTGACCCA TCTCTTCAGC GATCGGGGCG CCTTCATCCA TGTCGGCGCG ATCATCGGCA CGGCCATGGT CGGCAACGTG TTCCTGGTGA TCATCCCCAA CCAGCGCAAG ATCGTCGCCG ACATGCTGGC CGGCCGCAAG GTCGATCCGC GCCTGGGCGC GATGGGCAAG CAGCGCTCGG TGCACAACAA CTACATGACC CTCCCGGTCA TCTTCATCAT GATCAGCAAC CACTATCCGG TGGTGACAGG TCACCAGATG GCCTGGCTGC TGCTGGCGAT GATTAGCCTG GGCGGGGTGT CGATCCGCCA CTTCTTCAAC CTGCGCCACC ACGGGATCAT CAAGCCCGAC TTCCTGTTCA TCGGGGCGAT GCTGGTGTTC GCCGTCAGCC TGATCGCCAG CTCCAGGCCC AAGCCCGCCG AGACCGTCTC GAACGTGCCC TTCCCGGTCG CCCTGGCGAT CGTCCAGAAG CACTGCGTGA TGTGTCACGC GGCCGTTCCG ACCCACAAGG GCTTCACCGC CCCGCCGAAC GGCGCGGCCT TCGACACGCC CGAAGGCCTG GCCCGCTATG CGCCCAAGAT CCGTGAACGG GCGGTCGAGA CCACCAGCAT GCCCCTTGGA AACGAGACTC ACATTACCGA TCAGGAACGC GCCCAGCTGG GCGCCTGGAT TGAGGCGGGA GCGAAGACGA AGTGA
|
Protein sequence | MDYDVTTWLN LALRWLHVIA GVAWIGASFY FVWLDNNLRA PEPPKDGVKG ELWAVHGGGF YHSQKYMTAP AHMPDHLHWF KWEAYTTWLS GFALLIVLYY VGAPVYLIDA SKHAFSQPGA IATGLAFIFG GLAVYEALCR SPLGGKPRLF GLVWFLALTG AAYALTHLFS DRGAFIHVGA IIGTAMVGNV FLVIIPNQRK IVADMLAGRK VDPRLGAMGK QRSVHNNYMT LPVIFIMISN HYPVVTGHQM AWLLLAMISL GGVSIRHFFN LRHHGIIKPD FLFIGAMLVF AVSLIASSRP KPAETVSNVP FPVALAIVQK HCVMCHAAVP THKGFTAPPN GAAFDTPEGL ARYAPKIRER AVETTSMPLG NETHITDQER AQLGAWIEAG AKTK
|
| |