Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0591 |
Symbol | |
ID | 5898046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 650054 |
End bp | 653152 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641561073 |
Product | TonB-dependent receptor |
Protein accession | YP_001682222 |
Protein GI | 167644559 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.880997 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAAAT ATTGGCTGTG CAGCGCGGCT GCAGTCTTCA TCGCGGGGGC GGCCATGGCA CAGACCGCGC CGATCGATTC GGTCGAGGAA GTCGTGGTGA CCGGTTCGCG GATCGCGCGC GCCGGTTTTG ACACGCTTGA GCCAGCAACT ACCCTTTCCG TCGAGCAGCT GCAGAACCGC AATGCGACGA ATGTCGGCGA AGTGCTTCTG CAGGTGCCTG GCTTTTCGAT GGGTCAGACG ACGACAGGCG CGCAGTCGGA ATTTTCGCCC GGCGCGTCCT ATGTCGATCG GTTCGGGCTC GGGTCTTCGC GCACCCTCAC GCTGGTGAAC GGTCGACGTT TCGTCTCGAC CAATCCGCCG TCGACCCAAA ACGGTCAGAC CCCCGGCAAT CAGGTCGACC TCAACGTCAT CTCGCCGCTG ATGGTCGAGC GGATCGAGAA CCTGGCGATC GGCGGCGCGC CAACCTATGG CACTGACGCC ATCGCCGGCG TGGTGAACAT CATCCTGCGC AAAAAGTACG ACGGCGCCAT GGCCAACGTT CAGGCCGGCG TCACCGAGCT GGGCGACAAC GAGCGAATTG CGTTTGCGGG CCTAATCGGC CGCAACTTCG CCGACGGCCG CGGCAACATC ATGCTGGCGG CATCCTATGA CAAGTCCGAT CCCGCCGACT ATCGCAAATG GCAGAGCCCT CAGAACTTCT TCGTCGGTAA CCCCCTGGCG ACGTCGGCGG CGGCCAACAT TCCCGGTCGT ACGCCCGGGA ACGATGGCCG GATCAACCCG AACGTGCCCT TCAACACCGG ACCGGCCGAC GGGATTCCCA ATTCCGTCCT GATCCACGAC TACACCATTC CCTCCGTGAG CCGCGGCGGG ATCATCCTGC CGGTGGGCAC GATCCAGCAG GCGAATTTCT ATCCGACGGG GTTTGGCGCC AACGGTCAGA CACTCATCCA GTTTGGCCCG AACGGCGATA TCGTACCCTT CAATCCCGGA TCGCCCTTCG CGCCGACCTT TGCCTCCGGG GGCGACGGCT ATCGTAACGC GATCGCCAAC GTGGTGGCCG GCACGAAGCG GAAGACCGTC AACCTCAACG CGTCCTACGA TGTCACCGAT AAGATCTCGG CCTTCTTCGA GGGGAGCTAC TTCAACGGCG GCGGCACGCT CAGCAACGCG ACGCCGAAAT ATTTCTCGCG CATCTTCGGC GGCTCGCAAG TCGTCATGGG CCCGTTGTTG GCCTCGATCA ATGACCCGCG TCTGACGCCC CAAGCCAAGG CGACGCTGCA GTCTCTGGGC GTCCAGAATT TCGAGATCTC GAAGGTCACG CAGGAAATCG GCCAAGCGTA TCCGTCGACC GACAACTCGG TCTATCGCGG CGTCGTCGGC CTGAGCGGCC AGTTTGCCGC GGTGGGCCGG ACCTTCCGGT TTGACGCGTC GCTAAATCGC GGCCGGACCG AGGGCCACGC GTTCAAGACG AGCGTGATCC AGCAGAACTT CGTCAACGCG ATGAACGTCA AGCTCGACGC GTCCGGCAAG ATCGTCTGCG ATCCCAATCC CACGCAACTG GCCACGGGCG GCGCCGTCAA ACCTATCGCC GATGCGGCCT GCGTGCCTCT AAACCTGTTC GGCGCCAATC AGGTGACCGA CGCCGCCCGC GCCTACGTCG AAGCCCGCGC CGAGGCCAAG TCCCAGCTTG ACCAAACCGA CGTCCTGGTC AACTTCGGCT CGTCGAATCT GTTCTCGCTC TGGGGGGCTG AGCCGGTCGG CTTCAGTGTC GGCCTGGAGT ACCGCAAGGA GTCTGGCGAG TTTAATCCGG ATCCGCTGCA GGCCTCCGGG CGGACGCAGG AGGCGCTCGT CAGGCCGGTC GCAGGCGAAT ACACGACCAA GGAGGTCTTC GGCGAAGTGC TTGTGCCCCT GGCCTCGCCT GGGCAGTCCA TTCCGCTGAT CGACACCCTC GAGTTCGAAG GCCGCATACG CTACGTCGAC AACTCGCTCA CCAACGGCTT CACCGCCTAC ACCTACGGCG GGCGTTATCG GCCGGTTCCA GACATCGAAT TGCGGGGCAA CTTCACCAAG TCGCTGCGCG CGCCGTCCAT CGCCGAACTG TTCACGCCGG CCTCGGTGGG CTCAGGCTTC TTCCCCGATC CCTGCGACGT CCGCAACATC ACCTCTGGTC CAAACCCGAC GGTCCGCCAG AAGAATTGCG CGGTGTTCTT CAAGGCCTAC GGGATCACCG ATCCGACGAG CTTTTTCTCA ACGACCGTCG GTGTCGCCAT CCCGATCCAG CTTGGCGGCA ATCCCAAGCT CCAGAATGAG ACCGCCAAGT CCTACACCTA CGGCGTCGTG CTGCGCCCGC GCTTCCTGCC GAAGTTCCAG GCGGCCATCG ACTGGAATCG TATCCTGGTG AACGGCAATA TCACCGCCCT GACCTCGCTG GACATCGCTC AAGGGTGTTA CGACGACCCG GACTTCAACG CGGCCAATCC GGACGCGGGC AACGCTTTCT GCTCGCTCTT CAGGCGGACG AAAGGCGGCC CACAGAACGG TCAGCTCGTG GTGGATCCGC AAAATCCCGG GCTCTCGAAC CAGTTCGTCA ACGGGGCTTC AATTCGCTTC CAGGGGCTGA CGGTCGACGC CGCCTATCGC GACATTCCGA TCAAGGCGGC GTTCGTGGAG GGCTCGCTTC GGATCGACGC CAGATTCTAC TACCTCGACA AGCTCTGTAC CTCCAACAAT GGGGTCACCA CCATCTGCCT GCAGGGCACG CACACGCAAC CGCGCTACAC GGCGCAAGTC GACGCTACCT ATGTTCAAGA GCGCTTCGCT CTGAACCTCC AGGCTAACTA TCGGCCCTCG ACGCAATACG ACCTGCTTTT CACCGAGGAG AATCAGGATG TCCTGAAGCG GGGTTCTCAG GTCCTGTTCA ATCTTGGGGC CAGCTACAGG CTCGGTGAGA ACACTCAGAT CCGGGGCGCC ATTCAGAACC TGCTGGATTC GTCTCCGCCC GGCCCGATCG CCGGTTTCAA CAACTCCTTC GGCAACACCA CGGCGGTCGG AGACATCCTG GGCCGGCGGT ACTCCGTGGC CGTGACCCAC ACCTTCTAA
|
Protein sequence | MRKYWLCSAA AVFIAGAAMA QTAPIDSVEE VVVTGSRIAR AGFDTLEPAT TLSVEQLQNR NATNVGEVLL QVPGFSMGQT TTGAQSEFSP GASYVDRFGL GSSRTLTLVN GRRFVSTNPP STQNGQTPGN QVDLNVISPL MVERIENLAI GGAPTYGTDA IAGVVNIILR KKYDGAMANV QAGVTELGDN ERIAFAGLIG RNFADGRGNI MLAASYDKSD PADYRKWQSP QNFFVGNPLA TSAAANIPGR TPGNDGRINP NVPFNTGPAD GIPNSVLIHD YTIPSVSRGG IILPVGTIQQ ANFYPTGFGA NGQTLIQFGP NGDIVPFNPG SPFAPTFASG GDGYRNAIAN VVAGTKRKTV NLNASYDVTD KISAFFEGSY FNGGGTLSNA TPKYFSRIFG GSQVVMGPLL ASINDPRLTP QAKATLQSLG VQNFEISKVT QEIGQAYPST DNSVYRGVVG LSGQFAAVGR TFRFDASLNR GRTEGHAFKT SVIQQNFVNA MNVKLDASGK IVCDPNPTQL ATGGAVKPIA DAACVPLNLF GANQVTDAAR AYVEARAEAK SQLDQTDVLV NFGSSNLFSL WGAEPVGFSV GLEYRKESGE FNPDPLQASG RTQEALVRPV AGEYTTKEVF GEVLVPLASP GQSIPLIDTL EFEGRIRYVD NSLTNGFTAY TYGGRYRPVP DIELRGNFTK SLRAPSIAEL FTPASVGSGF FPDPCDVRNI TSGPNPTVRQ KNCAVFFKAY GITDPTSFFS TTVGVAIPIQ LGGNPKLQNE TAKSYTYGVV LRPRFLPKFQ AAIDWNRILV NGNITALTSL DIAQGCYDDP DFNAANPDAG NAFCSLFRRT KGGPQNGQLV VDPQNPGLSN QFVNGASIRF QGLTVDAAYR DIPIKAAFVE GSLRIDARFY YLDKLCTSNN GVTTICLQGT HTQPRYTAQV DATYVQERFA LNLQANYRPS TQYDLLFTEE NQDVLKRGSQ VLFNLGASYR LGENTQIRGA IQNLLDSSPP GPIAGFNNSF GNTTAVGDIL GRRYSVAVTH TF
|
| |