Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2123 |
Symbol | |
ID | 5899578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2287471 |
End bp | 2290248 |
Gene Length | 2778 bp |
Protein Length | 925 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641562612 |
Product | TonB-dependent receptor |
Protein accession | YP_001683749 |
Protein GI | 167646086 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.154655 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCAC AGAACACGAA CGGCCGCCGC GCGCGCCGCA TGGCCTGGCT GATGACCGGG TGCGCCGCCA TCGGGCTCTC CGCCGCGACC GGCGCTCAAG CCCAGACGTC GCGGGCCGAG CCGAACGATA GCGTGGAGGA AGTCGTCGTC ACGGGCAGCT ACCGCCGCAG CCTTGAGAAG GCCGTGGATA TCAAGCGCGA CACTGTCGGC TTTTCGGACT CGATCGTGGC GACCGACGTC GCCAACTTCC CGGATCAAAA CCTGGCCGAA GCGCTGCAGC GCATTCCCGG CGTGACGATC GAGCGCAACA AGGGCCTGGG CGGCCGTGTC AGCGTTCGCG GCCTGCCCAG CGAGTTCACC TTCGTCACCA TCAACAATCT GGCGACCGCC TCGGGCAGCG GCGGTCGTGA CGTCGAGTTC GACATCTTCG CCTCGGAAAT CATCCAGCAG GTCACGGTCC AGAAGTCGCC CCGGGCGGCG GACGAAGAAG GCGGTATCGC CGGCGCGATC AACATCTCGA CAACCCGTCC GTTCGACTAC AGTGGCCGCA AGCTGATCGC CTCGACCGAG GGGGCCTATA ACTCGATCTC CAAGAAGACA GACCCCAAGG TCTCGTTTCT GGCCAGCGAC ACGTGGGGGG ACTGGGGCGG CCTGGTGTCG TTCTCGGCGG CGCGCCGCAC GAACCGCACC GACTCCAACT CGGGCATCAA CTTCCGCCCG ATGTTCCGCT TCCTCGAAGC GGGCGGTGCG CGCGCCTCGC AGGCGGCCGC CGTCCTGGCC CGCGACGCCG GCGTGATCGT CAAGAGCAAC ACCGACCGCA ACGAGACTGG CCGCATCATC TTCCAGGACA AGGTCGGCGA CCGCGCCTAC CTGAACACCC AGGACCAGTG GGGCGGCACC GCTTCGCTGC AGTACAAGCC CTCGGCCAAT TTCGACATCG CCTTCGACCT GATGCTGGGC GGCTATGACG CGACCGAGGA CCAGTACGAC GCGGCCGCTT ATTCGGCCTC AAGCAAGAGC ACGTTGGAGA CCATCCACAG CTACGACAAG ACCACCCTGG CCGACTACAA CATGGTCGTG CTGCGCGACG TCTCCTACAC CGCGACCCAG CACGAGATGC TCAGCAAGGA GCAGATCAAC AAGACCGACT ACGCCCAGTT CGGCTCGGAC CTGAACTGGC GCGGCGAGAC CTGGAAGCTG CACGCCCTGG CCGGCTATTC GGGCGCCAAG AAGACGCTCG ACTATTCGAA CCTGAAGCAC GTGGCCTACG CCCCGTCGCG CACCCGCTGG ACGGCCACCG GCGGCGAGAC GATCAAGAGC GCCAACCCGG CCTCGATCGA CATGTACAAC TCGCCTTCGA AATATCTGTT CGAGGCCTAT GAGACGACCC TCGAGAAGAT CACCGATGAC AAGTACGCGG CTCAGGTAGA CTTCACCAAG GACTTCGCCT TCGACTTCTT TCCCGCGCTC AAGACCATCC AGATCGGCGC TCGCCACACC GACAAGTCGA AGGAGCGCCA GTACGGCGCC CTGAACATCC AGGGGCCGGG TCCGGGCAGC ACCGCCTATC TCAACACCCG CACCATGGCC GACAGCCCGC TGACCCCGAT CGGCGATCTG GTGCCGGGCG GCGACTACAC GGTCCGCGAT ATCACCTGGA GCCAGATCTC GAACGATTAC GCGCGCAAGA CCTTCCGCTA CGCCGGCTTC ACCACGCCGT TCACGCCGGG CGACTACTAC AAGGTCGATG AGAAGGTCAC GGGCCTGTAC GCCATGGCCG ACCTGGGCTT CGACGTCGGT CCCGTGCCGG TGGCGGTGAA CGGCGGCGTT CGCTACGTCG ACACCTCGAT CACCTCGTCG GGCTATCATC AGATCCAGAA GCCGAATGGC TCGACGGGCT ACACCCAGGC GCCGGTGTCA AGCGACGGCA GTTATAACAA GTTGCTGCCC AGCCTCAACG TCACCGCCGA GCTGACTGAT AGCATCGTGC TGCGCGCCGC GGCGTCCAAG ACCCTGATGC GTCCGGCCCT GACGGACCTG GCCTACAAGC GTACGGCCAG CTTCAACTCG TTCCGCTTCA CCGACGGCAA CCCGAACCTC AAGCCGACCT TCGCCGAGCA GTATGAAGTC GGCCTTGAGA AGTACCTGCC GGAAGGCGGC CTGCTGGCCG TCTCGTACTT CAAGAAGAAG ATCGAGGGCG TCGTCCGCCA GGCCCTGACG GGCACGGTCA AGGGCGTCAC CAAGTACAAC GCCAACGGCA CGATCGACGG CGTCTACGAC TTCGACGTCT ACCAACCGAT CAACGCCGCG GGTTCGTACA ATGTCGACGG CGTCGAGCTG GTCGCCATAG TGCCGTTCGG CCTGCTGTGG GAGCCGGCCA AGGGCTTTGG CGTCAACGCC AACTACACGA TCCTGGACAG CTCGCTGAGC GGCCAATCGA TCATCGGCGT CCCGACCCCG CCGGTGGGCC TGGCCGACAA GGCTTACAAC TTCACGCTCT ACTACGAGAA CGACAAGTTC CAGGCCCGCG TGTCCTATAG CTACAAGGGC AAGTATGTCG AAGGTATCGG CTACGAGATG TATCCGATCT GGCGCTCGGG CTTCGGCCAG ACCGACATCT CGGTCAGCTA TAACATCAAC GAGCGCCTTC AGTTGAGCCT GGAAGGGATC AACGTCACCG ACGAGGTCAC CAAGGGCTAC ACGATGGATC CGTCGTTCCC GACCATGTAC GAGAAGTCCG GACGGCGCTT CTCGCTTGGC CTACGGATGA ACTTCTGA
|
Protein sequence | MTAQNTNGRR ARRMAWLMTG CAAIGLSAAT GAQAQTSRAE PNDSVEEVVV TGSYRRSLEK AVDIKRDTVG FSDSIVATDV ANFPDQNLAE ALQRIPGVTI ERNKGLGGRV SVRGLPSEFT FVTINNLATA SGSGGRDVEF DIFASEIIQQ VTVQKSPRAA DEEGGIAGAI NISTTRPFDY SGRKLIASTE GAYNSISKKT DPKVSFLASD TWGDWGGLVS FSAARRTNRT DSNSGINFRP MFRFLEAGGA RASQAAAVLA RDAGVIVKSN TDRNETGRII FQDKVGDRAY LNTQDQWGGT ASLQYKPSAN FDIAFDLMLG GYDATEDQYD AAAYSASSKS TLETIHSYDK TTLADYNMVV LRDVSYTATQ HEMLSKEQIN KTDYAQFGSD LNWRGETWKL HALAGYSGAK KTLDYSNLKH VAYAPSRTRW TATGGETIKS ANPASIDMYN SPSKYLFEAY ETTLEKITDD KYAAQVDFTK DFAFDFFPAL KTIQIGARHT DKSKERQYGA LNIQGPGPGS TAYLNTRTMA DSPLTPIGDL VPGGDYTVRD ITWSQISNDY ARKTFRYAGF TTPFTPGDYY KVDEKVTGLY AMADLGFDVG PVPVAVNGGV RYVDTSITSS GYHQIQKPNG STGYTQAPVS SDGSYNKLLP SLNVTAELTD SIVLRAAASK TLMRPALTDL AYKRTASFNS FRFTDGNPNL KPTFAEQYEV GLEKYLPEGG LLAVSYFKKK IEGVVRQALT GTVKGVTKYN ANGTIDGVYD FDVYQPINAA GSYNVDGVEL VAIVPFGLLW EPAKGFGVNA NYTILDSSLS GQSIIGVPTP PVGLADKAYN FTLYYENDKF QARVSYSYKG KYVEGIGYEM YPIWRSGFGQ TDISVSYNIN ERLQLSLEGI NVTDEVTKGY TMDPSFPTMY EKSGRRFSLG LRMNF
|
| |