Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3783 |
Symbol | |
ID | 5901245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4100419 |
End bp | 4103382 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641564306 |
Product | TonB-dependent receptor plug |
Protein accession | YP_001685408 |
Protein GI | 167647745 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00272324 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGCTCAAGG CTGCAAGCTT CGTTTCAACG TCGCTGATCG CCCTGGCGGC GGCGACATCC GTCTCGGCGC AAACCGTGCA GACGGATAGC GGCGCATCGA CCAAGGCGGC GGAGTCTGTC TCGGTGGAAG AGGTGGTCGT CACCGGTTCA CGGGTGCGCA CGACCTACAA TTCGCCCACG CCGGTCAATG TGGTCGGCCA GGAGCGCATG CAGCAACTGG CCATCCCCGA CGTGGCCACC GCGCTCAACC AGATTCCGTC ATTCCGCGCG ACCACCTCGG CCTCGACGAT CCTGTTCCGG GTCTCGGGCG CGATCGGCGG CAACACGCCG GACCTTCGGG GCCTTGGCAC CAGCCGCACG CTGGTGCTGG TCGATGGACG GCGCTTCGTT CCCAGTCTGG ATTCCGGCGG CGTCGACCTC AACAGCGTGC CCAACGCGCT GGTCAAGCGC ACCGAGATCG TCACCGGCGG GGCCTCGGCC GCGTACGGCG CGGACGCCGT GGCCGGGGTG GTCAACCTGA TCCTCGACAC CAAGTTCAAC GGCGTCAGGA TCGACGCCAG CACCGGCGTC AGCGAACACG GCGACGGCAA GAACTACTTC ATCTCGGCGT CGGGCGGGCG CGGCTTCGCC AGCGACCGCG GCCACATCAT CGCCGGGATC GAATATCGCG ACGACAAGGG CGTCGGCAAT TGCTTCACCC GCGACTGGTG CGCCAAGCTG ACGAACTTCG TTCCCAACCC CGGCTATATC GGCGGCGTGA GCACGAACGG CCTGCCGGCG ACGCTCGTGC TCGACAACGT CAACTTCGTC TACAGTCCGA CCGGCGTCCT GCTGAGCGCG GTGCAGACGG TCGGCGGCGT CAAGACGACG TTGGGTCAGC AGGTGGGCAA TACGGGCGCG ACCGCGCTGC CCACCGCCCT GAGGGGGCTG CAGTTCAACG CCGCCGGTTC GGCGTTGACG CCCTTCCAGT TCGGCAACTA CCTCAGCGGC ACGTTCATGC AAGGCGGCGA TCCAGCCGCC AGCAACAACT GGGGCTGGGG CAACCCGCCG CTGGTGACGC CCACCTCGCA CGCCTCGGGC CTGGTGCATG TCGACTATGA CCTGACGCCC AAGACCCAGG CCTTCGGCGA GTTCATCGTC AGCCGCACGG AGGGCGGGCC GGTGCGCACC TCGGTGCTGC TGCAGGCGCC CGCCGGCGGC TCGGCGGGGC TCGACATCAA CAATCCCTTC ATCACCCCGG CGGTGCGCGC GCAGATCCTT GGCGCCAACC CCAACATCAC GGCGATCAAT GTCAACGTCG CCGTGGCGCA GGGCGGCGAC ACAGTGGTGG CCTCCAGCAC CAACGACATC TACCGGTTCG TGACGGGGCT GAAAGGTGAT CTCTTCGGCG ATTGGCGCTG GGACGCCTCG TACGAGTACG GCCGGACCAA CAGCGAGACG ATCGTCAAGA ACACCCGTCT GGCCGCGTTC GACACCCAGG CCACCAATGC GATCACGCCG CCGGCCGGCT ATACGGGGAC GATCTACACC ACGCCCGCCG GCGCGCCGGT GATCTGCGCC TCTTCGGTGG CCAATCCGTC AGACGGCTGT CTTCCGGTCG ACCTGCTGGG CTCGAACATC ACCCCCGCGG TGCTGAGCAA GTACTTCAAG GATGAGCGGC AGACCCGCAA GATCACCCAG AACGACGTGA TGGTGAACTT CCGCGGCACG TTGTTCAGCC TTCCAGCCGG GCCGATCCAG GCCGCGTTCG GCGCGGAATA CCGTCGCGAC AGCGTCTCCG GCGACGTCGA CGCCCTGACG GCGGCCGGAC GGTTCGCCGC GCCCCAGGTG ACGGCCTTGC CGGAGGTCGT GCAGAAGGTG ACGGAAGGCT ACGCGGAGGC GAACATCCCG CTGCTGGCCG ACCTGCCCTT CGCCAAGTCG CTGTCGGTCG ATGTGACAGG ACGCCTGACG CACTACAGCG GCTTTGGCAG CGCCAGGCCC TGGAAGATAG GGCTCGAATA CCAGCCCAAC GACCAGATCC TGGTCCGCGT GACGCGGTCG GCCGACATCC GCGCGCCCAG CGCGGCGGAA TCGAACCCCA ACACGGTCCA GACCTTCCTG CCCCTGAACG ATCCGTTCAG CGGCAGCAAC CACCTGATCG GCGCCCCGGC CGGCGGCAAT CCCAACCTGG AGCTGGAATC GGCCAAGACC AACACCGCCG GGATCGTGCT GAAGCCGAAC TTCCTTCCGG GCTTCCACGC CTCGGTCGAC TGGTATGACA TCACCGTAAA GAACGCGATC GACGCGGTGA CGGCGCCCAA CATCCTGTCG GCCTGCGCCA CCAAGAACCT GCTGTGCAAC CTGATCACCT TCAGCGGCGC GGCCAAGGCC AGCCCGGTGG TGTCGGTGCT CTCGAACTTC CAGAACGTCG CCCAGGTTCA CGCCGAGGGT TACGAATTCC AGTCCGACTA CACGATCCCG GACGTGTGGG ACGGCGCCGT CACCTTCCAG CTCAACGCCA ACTACGTCAA AGACCTGAAG TCGATCGGCG GCACGGGCCT GGTCACCCGG ATGAACGGCG TCACCGGCAA CGCCGGCTCG CTCGCCGGCA TCGCGGGGGT GCCGAAGTAC AAGATCGACG GCCTGGTCGC CTATACGCGG CCCAGCTGGA TGGTCGCGGC CCACATGCGC TACATCCCGG AGAGCATCCT GGATCCGACC AAGATCGGAC CCAAGCAGGC GGGCTACAAC ATCAACCTCC CGACCAGCAT CATGATCAAT TCGGTCAGCT CGCGTTTCTA TCTGGACCTT TCCGGCTCGG CCCACCTGCC GTCGATCTTC GGCAGCAGCA AGACGGAACT GTACGGCGGC GTCACCAACG TCTTCGACAA GGACCAGCCG CCCGAGCTGC GCCTGTTCGG CAACCCTCTG CAGTACGACA CGGTTGGCCG CGCCTTCCGA CTGGGCATCC GGGCCGCCTG GTAG
|
Protein sequence | MLKAASFVST SLIALAAATS VSAQTVQTDS GASTKAAESV SVEEVVVTGS RVRTTYNSPT PVNVVGQERM QQLAIPDVAT ALNQIPSFRA TTSASTILFR VSGAIGGNTP DLRGLGTSRT LVLVDGRRFV PSLDSGGVDL NSVPNALVKR TEIVTGGASA AYGADAVAGV VNLILDTKFN GVRIDASTGV SEHGDGKNYF ISASGGRGFA SDRGHIIAGI EYRDDKGVGN CFTRDWCAKL TNFVPNPGYI GGVSTNGLPA TLVLDNVNFV YSPTGVLLSA VQTVGGVKTT LGQQVGNTGA TALPTALRGL QFNAAGSALT PFQFGNYLSG TFMQGGDPAA SNNWGWGNPP LVTPTSHASG LVHVDYDLTP KTQAFGEFIV SRTEGGPVRT SVLLQAPAGG SAGLDINNPF ITPAVRAQIL GANPNITAIN VNVAVAQGGD TVVASSTNDI YRFVTGLKGD LFGDWRWDAS YEYGRTNSET IVKNTRLAAF DTQATNAITP PAGYTGTIYT TPAGAPVICA SSVANPSDGC LPVDLLGSNI TPAVLSKYFK DERQTRKITQ NDVMVNFRGT LFSLPAGPIQ AAFGAEYRRD SVSGDVDALT AAGRFAAPQV TALPEVVQKV TEGYAEANIP LLADLPFAKS LSVDVTGRLT HYSGFGSARP WKIGLEYQPN DQILVRVTRS ADIRAPSAAE SNPNTVQTFL PLNDPFSGSN HLIGAPAGGN PNLELESAKT NTAGIVLKPN FLPGFHASVD WYDITVKNAI DAVTAPNILS ACATKNLLCN LITFSGAAKA SPVVSVLSNF QNVAQVHAEG YEFQSDYTIP DVWDGAVTFQ LNANYVKDLK SIGGTGLVTR MNGVTGNAGS LAGIAGVPKY KIDGLVAYTR PSWMVAAHMR YIPESILDPT KIGPKQAGYN INLPTSIMIN SVSSRFYLDL SGSAHLPSIF GSSKTELYGG VTNVFDKDQP PELRLFGNPL QYDTVGRAFR LGIRAAW
|
| |