Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2400 |
Symbol | |
ID | 5899855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2609677 |
End bp | 2612691 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641562891 |
Product | TonB-dependent receptor |
Protein accession | YP_001684025 |
Protein GI | 167646362 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.204664 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTCAA CGGGCAGTGA GCCTTCGCGC CCCAACGCCA AGGGCGCCGA GGTCTCCATG CAGGACGCCG CGGCGGCAAC GAACAGAGGA CGTTTAGTCC GGCAAATGAG ACGCGAGGCT TACGGCTCGG CCGGAGCGGC TGCTCTGCTC GTCAGCCTGA TCGCCACACC CGGTCTGGCG CAGACCGTTC CCGCGTCCAA GCCGGAAGCT GTCACGGTCG GCGAGATCGT CGTCACCGGA ACGCGCATTC AGACCACGGG CTTCACCGCG CCGACCCCGA CCTCCGTGAT CGGCGAATCG CAGATCCAGA ACAACGCCCA ACCCAACGTC TTCGCCACCA TCGTCCAGTT GCCGTCCTTG CAAGGCTCAA GCGGATCGGC GACGAACACC TTCAGCACCT CCAGCGGCCA ACAGGGACTC AGCTCGTTCT CGCTTCGCGG CCTGGGCACG ATCCGCACCC TGACCCTGCT CGACGGCCAG CGCGTGGTCG GGGCCTATTA CACCGGCGTC ACCGACGTCA GCCTGTTCCC GCAACTTCTG ATCCAGCGCG TGGACGTGGT GAGCGGCGGC GCCTCGGCCT CCTATGGGTC CGACGCGGTG GGCGGCGTCG TGAACTTCAT CACCGACACC CGCTTCAATG GGTTCAAGGG CAACGTGCAG GCCGGTGTCA CGACTTACGG CGACGACGAG CAAGGTCTCG TCCAACTGGC GGCGGGCCGC AGCTTCTTCA ATGATCGCCT GCACGTGGTC GGCAGCGGCG AATGGGCCAA GGAGGACGGC GTTGGTCCTG GCGGCTTTGG CCTGGACTTG GCCGGCGATC GCGACTGGTT CACCCAGACG ACGATGATAA ATCGCAACGT CAACAACGAC GGCGCGCCGC AGTACGTGAT GCGCGACTTC GCTCAACCCT ACAACTACAC CAAATACGGC CTGATCTCGG CGGGTCCGCT GCAGGGCACA GCCTTCGACC AGAGCGGCCA ACCGTTCCAA TTCCAGTACG GCTCCAACGG CGTCCCCACC AAGAACGCCT CGGGCGCCGT CACCGGTTGC TTCCCCGGCT TCTGCGTCGG CGGCGACCTG TCTGGCAATG TCGACAGCGG TCGCACGCTC CAATCGGCGA TCGAACGCCG AGTGGCTTAT GGGCGGGTCG GCTATGACTT CGCCGAGAAC AACGAGGCCT ATGTCTCCTT CAACCTGGGA CAGGTCAAAA CCAGCAATCA GCCCGTAAAT GGGGAAAATC GCCCGGGCCT CACCTTGCAG TGCGCCAACC CGTACGTGCC CGCCTCGGTG CAGGCCGCCT GCGCGACCGC GGGCGTCACC AGCTTCCAGT TCGGCACGAG CAACGCGCTG CTGCCCAATA CCGAGGTCCA CACCGACCGG CGCCAGTATC GCGTGGTCAC CGGGCTGAAG GGGAAGTTCG CCCTCCTCAA CTCCGACTGG ACCTACGACG CCTATTACGA GCACGGTGTG AACAAGACGG CGATCGACGT CGATCACATC CTTCTCACCC CGCACTACAA CCAAGCGATC CAGGCGATCA CGCTCAACGG CGTGATCGCT TGCGCCGATC CCGTGGCGCG GGCCAGCGGC TGCCAGCCGC TCAACATCAT TGGCGGCAAG CCCCCGTCAG CGGCGGCTCT GGCCTATGTC CAGCCGGAGA ATGGCCCGTT CCAGCGTCTG CGCATGACCC AGGACGTGGC CAGCCTCGCC TTCTCGGGCG CGCCGCTCAA CCTGTGGGCC GGTCCGCTGT CGGTGGCGTT CGGGGCCGAA TATCGCCGGG AGTTCTACAC CGTTCGCGCC GACGCCTACG GCGCTGGCGT GTCGGGCCGC AGCCCCAACA CCGCCGAGTT CCCGGCCGAC CCTGTGCTCC TGCCGGGCGG CAACAACTGG TACGCGGGCA ACTACAAGAA CGGCAACGGC GCCTACAACG TCAAGGAAGC CTTCCTCGAG CTTGATCTTC CGCTGTTCGA CTCCGACGCC CTGGGTCGCG CCAACCTCAA CGGCGCGGCG CGGGTGACCG ATTACAGCAC CTCGGGGACC ATCTGGACAT GGAAGGCGGG CGGCACCTGG GACACGCCGA TCAAGGGCCT GCGCCTGCGT GGCGTGACCT CGCGAGACGT GCGCGCGCCC AACCTGTCGG AACTGTTCGC CGCGCCGGTG ACGACCACGC TGCCCAACTT CCTCGACCCT GTCCGCAACG TGAACGTGGT GGCGATCCAG AACGCGGTCG GCAACCCCGA CCTGACGCCG GAAATCGCGC GCAACACCTC GTTCGGGGTG GTCCTGGCCA ACCCGTCGTG GCTGCCCGGC TTCAGCGCCT CGTTCGACTA CTACAAGATC AAGGTCGATG ACGTGATCTC CAGCCTGGGC GCGGCTCAGA TCGTCGACCT ATGCTACCGC AACATCCTGC CGGAAACCTG CGGGGCCTAT AACCTCAACA ACACCAGTGG CCCCAACTAC ATCAACGTCC AGGCGTTCAA CCTGGCTTCG ATCAAGACAA GCGGCTTCGA TATCGAGGCC AGCTATCGCT GGCGGCAGCC GCTGGGCCTG CCGGGCGCCT TCACCGTGCG CGCGCTGGCG ACGCATATCC GTGAATTCAT CACCGATACG GGTCTGCCGG GCACGGCTCC CACCGACTCG GCCGGCGTCA ACACCGGTGC GACGCCGGAC TGGAAGTGGC TGGCGATCCA GACCTATGAG GGCGACCGGT TCAGCCTGAC GGTGCAGGAA CGTTGGTTCA GCGATGGCAA TTACGGCAAC CAGTATGTCG TCTGCGCCGC GGGCAGTTGC CCCGTCTCGA CGGCGATCGC GCCCACCATC GACAGCAACT CCATGCCGGG GGCGTTCTAT CTGGATGTCG GCGGCACCTA TAATATCCGC AAGGACGTCA CGGCCTATTT CAAGGTCGAC AACGTCTTCG ATCACGACCC CGCCAAGTCG CCGCAGTACG CCAATCCGGC GCTCTACGAC ATCGTCGGCC GCATCTATCG CGGCGGCGTT CGCTTCCGCT TCTAG
|
Protein sequence | MFSTGSEPSR PNAKGAEVSM QDAAAATNRG RLVRQMRREA YGSAGAAALL VSLIATPGLA QTVPASKPEA VTVGEIVVTG TRIQTTGFTA PTPTSVIGES QIQNNAQPNV FATIVQLPSL QGSSGSATNT FSTSSGQQGL SSFSLRGLGT IRTLTLLDGQ RVVGAYYTGV TDVSLFPQLL IQRVDVVSGG ASASYGSDAV GGVVNFITDT RFNGFKGNVQ AGVTTYGDDE QGLVQLAAGR SFFNDRLHVV GSGEWAKEDG VGPGGFGLDL AGDRDWFTQT TMINRNVNND GAPQYVMRDF AQPYNYTKYG LISAGPLQGT AFDQSGQPFQ FQYGSNGVPT KNASGAVTGC FPGFCVGGDL SGNVDSGRTL QSAIERRVAY GRVGYDFAEN NEAYVSFNLG QVKTSNQPVN GENRPGLTLQ CANPYVPASV QAACATAGVT SFQFGTSNAL LPNTEVHTDR RQYRVVTGLK GKFALLNSDW TYDAYYEHGV NKTAIDVDHI LLTPHYNQAI QAITLNGVIA CADPVARASG CQPLNIIGGK PPSAAALAYV QPENGPFQRL RMTQDVASLA FSGAPLNLWA GPLSVAFGAE YRREFYTVRA DAYGAGVSGR SPNTAEFPAD PVLLPGGNNW YAGNYKNGNG AYNVKEAFLE LDLPLFDSDA LGRANLNGAA RVTDYSTSGT IWTWKAGGTW DTPIKGLRLR GVTSRDVRAP NLSELFAAPV TTTLPNFLDP VRNVNVVAIQ NAVGNPDLTP EIARNTSFGV VLANPSWLPG FSASFDYYKI KVDDVISSLG AAQIVDLCYR NILPETCGAY NLNNTSGPNY INVQAFNLAS IKTSGFDIEA SYRWRQPLGL PGAFTVRALA THIREFITDT GLPGTAPTDS AGVNTGATPD WKWLAIQTYE GDRFSLTVQE RWFSDGNYGN QYVVCAAGSC PVSTAIAPTI DSNSMPGAFY LDVGGTYNIR KDVTAYFKVD NVFDHDPAKS PQYANPALYD IVGRIYRGGV RFRF
|
| |