Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2142 |
Symbol | |
ID | 5899597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2317924 |
End bp | 2321028 |
Gene Length | 3105 bp |
Protein Length | 1034 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641562632 |
Product | TonB-dependent receptor |
Protein accession | YP_001683768 |
Protein GI | 167646105 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000142845 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0159239 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCAATC CACGAAACAG GTCCGCCTTG GCGCGGGGCA TGACGCTGGC GCTGCTGGCG GGCGCCTCGT CCACGGCCTT GGCCACGGGC GCCTGGGCCC AGACCGCGCC GGACGCGGGG TCCCAGGTCG AGGAAGTCGT CGTCACCGGC ATCCGCGGGT CGCAGATGCG CTCGGTCGAC GTCAAGCGGC GCGAGGCGTC GATCGTCGAC GCGATCTCGT CGGAAGACAT CGGCAAGCTG CCCGACGTCA CCATCGCCGA CTCGCTGCAG CGCATTCCCG GCATCCAGAT CAAACGCGAC GCCGGCGAGG GCGCGACGGT CAATATCCGC GGCCTGGCCC AGGTCATCAC CCTGCTGAAC GGCGAGCAAT ATCTCAGCGC CGGCAACATG GGTTCGGCCC AGCCGAACCT GCTGGACGTG CCCTCGCAGC TGATGAACCA GGTGCTGGTC TACAAGTCGA CCGATCCGAA GAACGCGCTG TCGGGCATCA CCGGCACGAT CGACCTGCGC ACCCGCCGTC CGTTCCAGAT GGAGCAGGGC CTGCAGATCG CCGGCGGCGT CGAAGGTCAG CGCGGCGAGC GCACCAAGGA CAACGATTAT GTGATCAACG GCCTGGCCAG CTGGCGCAAT GAACGCGTCG GCCTGCTGGT CTCGGCCGCC GCCAGCAAGG CGAACCTCGG CAACAACTCG TCCGGCGTCG TCGGCCTGTC GGGCAATAAC GACTGGGGCG GCTCGGCCGC CAACAACTTC ATCTCGCCGC ATGGCTTCGA AAGCTTCCAC CGCGAAGTCG AGCGCAAGCG TCTTGGCGTC AATGTGGCCT TCGAAGCCGA CCTGGGCGAG GGTTTCACCC TGGTCGCCGA GGGCTTCTAC GCCAAGCTCG ACGAATATAA CCGCGCCGCC GGCATCAATA TCTCCAACCG TTGGGACGGC GGCGCCTTCG GCGTCTGGAC CACCCCGACG GTGTTCGAGA ACACCGGCCA GACCAGCGCC GGCAACGGCC GCCCGTGGGT GGCGGTCGAT GAGTACGACA TCAACGCCTG GTGGGTGAAC AGCTTCGCGG TGAACCGCAC CACCAAGTCG ACGACCAAGA ACTACAACCT CGAACTGAAG TACGACAACG GCGGCGCGTT CACGAGCGAA GTGCGGGCCA TCCGCGCCAA CGGCAACCGT CTCAGCATGA ACGGCCAGGC CCAGGGCGAC CTGAGCAACT GGCAGTACGC GCCCGGCCGC TTCAACCTGT TCCGGGATCC GGGCGACCGT ACGCGCGGTC CGTTCTATCC CGCTTCGATC TGCGCCCAGT ATCCGACCTC GCAACGCAGC AACGCCGTCG TCGGCAGCGC CGGCGGCTGC TACCTCAACC CCAACCCGCA AGGCTACGGC GCCAACCCGC AGCTGCATTA CAATATCGGG GGCAACAAGG CGATCTGGAG CGGCTTCGAC AATCCGCTGG CCGGCGGCCT GGGGGCGGGC AAGACCCTCA AGGACTACAT GGCCAACAAG GACAGCTACG CGATCGCGGC GTTCTCCTCG GAAGGCAATA ACGAAATCGA CTCGGACATG AACGTGTTCC GGGCCGAGGG TCACTACAAG TTCGACAACA AGTTCCTGGG CTTCATCACC AAGATTGACG CCGGTGTCCG CCAGAGCGAC CGCACGGTCA ATGTCGAGGC CTTCCACCTG TTCTCGCCGT TCTACGGCGG CACGCCGGGC GCGGTGCAGG CCAACGGCAC GCCGGTGCCG GCCGCGGGCT GCTTCGCCCA GTGGAAAGCC ATCGACGTGG TCATGAACCA GGACCAGTGT CAGGCTGGCG AGTTCGTGCC CAACCCCCTG GGCGGCGCTC CGGTGTTCCA GGGCTACACG GTCAATCGTC CGACCAAGAT CGACGCCTAT AACAACGTGA TCTTCGAGAC CAACCTGGGC AGCATCACCC AGGGCATCCC CGGCTTCTGG GTCGTCGATC CGCATGACTT CGACAACGTG GCCGCCTTCC AGACCAAGGT GTTCGGCGCG GCGGTCCGCT ACCAGATCCC CGGCGCGACC TACGACGTGA CCTTGAAGGA ACAAAGCGCC TACGGCGCGG CCAACTTCGA GATCGGCAAG CTGTCGGGCA ACGTTGGCCT GCACGTCATC CAGAGCCACT TCCTGGTGAA GCAGAACCTG ACGGGCGCCA CCCAGAACTA CGGCGACACC AACCTCGATG TCGGCGACAC GGTCACCAAC CGCAAATACA CCGACTGGCT GCCGTCCTTG AACGCCGTCT ACGACTTCGA CGACCACCTG AAGTTCCGCT TCGCCTATTC CAAGACCATG CAACCCTTGG ATCTGGGCAA CTACGGCGGC GGCCTCAGCA TCTCGACCGC CGACTGCGGC ACCAAGCCCG GCCGTTGCGT GACGGGCGCC AGCTCGTCGG GCAACCCCTA CCTGGATCCG TGGCGCTCGA CGAACTTCGA CGGCGCGATC GAATACTACT TCGGCGGCGC CTCGATGGTG AACCTGTCGG CCTTCAAGCT GAAGATCGAC AGCTTCGTCA CCGGCGGCAC CACCACCGGC ACGTTCAAGG ACGAGGACGG CACACCGCGC ACGGTCAATG TCAGCCAGCC CATCCAGGGC GACGGCGGTT CGGTGAAGGG CGTCGAGGTC GGCACGAAGC TGGCGTTCAG CGACCTGCTG ACGGATGGCG GCTTCTTCTC GAACTTCGGT GTCGACGCCA ACGCCACCTA TTCGCCCAGC TCGGAGTCGC GCCTTGGTCT GGACGGCAAG AAGCTGCCGT TCACCGACAA CTCCAAGTAC CAATACAATC TGATCGGCTG GTACCAGGAC GACAAGTGGC AGGCCCGCGT GGCCTACAAT TATCGCACTG ACCGCCTGTC TAGCGTCTCG GGAAGCTCGG GTAACAACCT GCCGATCTAT CAGGACGCCA CGGGCTTCGT GGACGTGAAC CTCAGCTACA ACGTGCGCGA CAACATCGTC GTCTACGCGA ACGGCTCGAA TGTCACCGGC GAGATCGAGA ATTACTACGT GAAGTTCGCG GACGGTAAGA CTCAGTACGC CAACCAGAAC CAGTTCGAAC CCCGCTACAC CATCGGCATC CGCGCCAAGT GGTGA
|
Protein sequence | MSNPRNRSAL ARGMTLALLA GASSTALATG AWAQTAPDAG SQVEEVVVTG IRGSQMRSVD VKRREASIVD AISSEDIGKL PDVTIADSLQ RIPGIQIKRD AGEGATVNIR GLAQVITLLN GEQYLSAGNM GSAQPNLLDV PSQLMNQVLV YKSTDPKNAL SGITGTIDLR TRRPFQMEQG LQIAGGVEGQ RGERTKDNDY VINGLASWRN ERVGLLVSAA ASKANLGNNS SGVVGLSGNN DWGGSAANNF ISPHGFESFH REVERKRLGV NVAFEADLGE GFTLVAEGFY AKLDEYNRAA GINISNRWDG GAFGVWTTPT VFENTGQTSA GNGRPWVAVD EYDINAWWVN SFAVNRTTKS TTKNYNLELK YDNGGAFTSE VRAIRANGNR LSMNGQAQGD LSNWQYAPGR FNLFRDPGDR TRGPFYPASI CAQYPTSQRS NAVVGSAGGC YLNPNPQGYG ANPQLHYNIG GNKAIWSGFD NPLAGGLGAG KTLKDYMANK DSYAIAAFSS EGNNEIDSDM NVFRAEGHYK FDNKFLGFIT KIDAGVRQSD RTVNVEAFHL FSPFYGGTPG AVQANGTPVP AAGCFAQWKA IDVVMNQDQC QAGEFVPNPL GGAPVFQGYT VNRPTKIDAY NNVIFETNLG SITQGIPGFW VVDPHDFDNV AAFQTKVFGA AVRYQIPGAT YDVTLKEQSA YGAANFEIGK LSGNVGLHVI QSHFLVKQNL TGATQNYGDT NLDVGDTVTN RKYTDWLPSL NAVYDFDDHL KFRFAYSKTM QPLDLGNYGG GLSISTADCG TKPGRCVTGA SSSGNPYLDP WRSTNFDGAI EYYFGGASMV NLSAFKLKID SFVTGGTTTG TFKDEDGTPR TVNVSQPIQG DGGSVKGVEV GTKLAFSDLL TDGGFFSNFG VDANATYSPS SESRLGLDGK KLPFTDNSKY QYNLIGWYQD DKWQARVAYN YRTDRLSSVS GSSGNNLPIY QDATGFVDVN LSYNVRDNIV VYANGSNVTG EIENYYVKFA DGKTQYANQN QFEPRYTIGI RAKW
|
| |