Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1080 |
Symbol | |
ID | 5898535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1142135 |
End bp | 1144948 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641561562 |
Product | TonB-dependent receptor |
Protein accession | YP_001682708 |
Protein GI | 167645045 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.72185 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.584922 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCAAT CTCTGCGTAT GCGGAGCCTG CTGGCCATGG GCGCGTCCAT GACAGTCTTG GCCGCCGTCG CCGCCCCAGC CTTCGCCCAG ACCACGCCGG CGCCCCAGCA AGGCGGCGGC AATGTGCTCG AGGAACTGGT CGTCACCGCC CAGAAGAAGG AAGAAGCCCT TCAGGACGTG CCGATCGCGG TGTCGGCCTT CAGCCAGAAC AGCCTCGAGG CGCAGAAGAT CGACGGCGGT CCCAACCTGC AGCAGGCGAT CCCCAACGTC TCCTTCGCCA AGAGCAACTT CACCAACAGC TTCAACTTCG CGATCCGGGG CATCGGCAAC AAGGCCGTCG GCGTCTCGAC CGACGGCGGC GTCGGCGTCC ACGAGAACAA CGCCCCGCTG CAGTCGGGCA ACCTGTTCGA CGCCGAGTTC TTCGACGTGG AGCGCGTCGA AGTGCTGCGC GGGCCGCAAG GCACGCTGTA CGGCCGTAAC GCCACCGGCG GCGTGGTCAA TATCATCACC GCCAAGCCTG TCGACACCTT CGAGGCCAAT GTCCGAGCCG AATACGGCAA CTACAACTCG CAAAAGGTTC GCGGGATGAT CAACATCCCG ATCCTGGGCG ACAAGCTGGC GATCCGCGCG GCCGGCAACT ACCTCAAGAG AGACGGCTTC GTCACCAACA CCTTCAACAA CCACAAGGTC GACGACCGGG ATCTGTATTC GACCCGCGTT TCGGTGATGT TCAATCCCAT CGACTCGCTG CGCACCAACT TCATGTGGGA GCACTTCAAG GAAGACGACA GCCGGGCCCG CGTGGGCAAG CAGCTCTGCA CCAAGGACCT CGGCCCGGCC ACGGTGGGCG GCGTCGCCTC CGGTTCGGCC CGCAACTTCC TGACCCAGGG GTGCCTGTCC GCCTCGCTCT ACGGCGACAG CGCCTACGGC ACGGTCAACA CCTCGGGCAC CCTGACCGGC GAACTGGGCA ACCTGGTGGG CTTCACCAGC GGCGACGCCA ACGCCGGCGA CACCGCCAGC CATAACCTGC GCGAAATCGA GTCGCTGCTG GACCCGATCT ACCGGTCCAA GTCCAACATC TATCAGTTCA ACCTGGCCTA TGACCTGACC GAGAACCTGA CCCTCACGGC CATGACCTCG TACAGCAAGG GCGATGTCTA CACCAAGCTG GACTACAACC GGAACGTCTC GACGGTGCCG TTCAACAGCA CCCCCTTCAC GCCGGGCGGC TTCTTCGCCG ATCCGCAGGT GGGCGCCACC AACAAGTTCA CCACCCTGGA CGTCTCGTCG GGCTGGAGCA AGCAGTGGAG CCAGGAAGTT CGACTGCAGT CGAACTTCGA CGGCCCGCTG AACTTCAACG TCGGCGGCAT CTGGTTCGAC TACAAGACCG TGACCGACTA CTACGTGATC GGCAATTCGC TGACCCTCTC GGCCCTGGCC CTGAACTACC AGAACACCGG CAACCCCAGC TGCAACCCGG TGACCTTGCC GGCCAGCTGT ATCGGCATCG ACGCCAACGC CACGCCGGAC GGCAGCGGCC ATAACTACTA TGACAATCGC TCGCCCTATC ACCTGAAGTC CAACGCCATC TTCGGCGAAC TGTACTGGCA GGCCAACGAG AAGCTGAAGT TCACCCTGGG CCTGCGCCGC ACCCACGACG ACAAGATCCA GAAGAACTAC GAGACCGTGC TTCTGGCGCC AGGCATCGGC CTGGAGCAGG ACCCGACCAC GCCACAGAAC CGCACGGTGT TCAACGAGCT GACCGGCCGC TTTGGCTTCG ACTACAAGCT CAGCGACGAC AACCTGCTCT ACGCCTTCTA TTCCAAGGGC TATAAGGGCG GCGGCTCGAA CCCGCCGGCG GCGGCGGGTG GCGCTGGCAC GCAGGCCACC TTCGCGCCCG AGTTCGTCAA CGCGTTCGAG CTCGGTTCGA AGAACACCCT GATGGGCGGC AGCGTGATGC TGAACGCCAC CGGCTTCTTC TATGACTACC AGGGCTACCA GATCTCCAAG ATCGTCAACC GCACCTCGGT CAACGAGAAC GTCAACGCCA CGGTCTACGG CCTTGAGCTG GAATCGGTCT GGTCGCCGAT CCACAATCTG AAGCTCAACG CCAACATAGG CTATCTGCAC ACCAAGATCG GCAGCGGCGT CAGCTCGATC GACACCATGA ACCGCACCCA GAGCAACCCC GCTTACCAAG TGGTCAAGGC GGGTCCCAGC ATCCCCGGCG TTGCGGTCGG CTCCAACTGC GTGGTCACCG CGGCGGGCAT CGCCACCGTC CTCAGCATCA ACCCCGCCTT GGGGGCGACG GTGCCGTTCG CCTGCGGCGG CAAGGCCTTC TACCAGGGCT TCCTGCAACT TCAGGGCGTG CCAGCTCCGT TCGCGGCGGC CGCCGCCAAC GCCATGTTCA ACTACGGATC CGGCTACTCG ATCGAAGGCA AGGCCGCGGA CCTGTCGGGT AACGAACTAC CCAACTCGCC GCACCTGACC GCCTCGGTCG GCGCCCAATA CACCTGGGAT TTCGCGGACG GCTGGTCGGC CACTCTGCGC GGCGACTACT ATCGCCAGAG CAAGCAGTAC ACGCGGGTCT ACAACACCAC CTACGACCAG TTGAAGCCCT GGAACAACGC CAACATCACC CTGAAGATCG AAAAGCCCGA GTGGGGCCTG CAGATCGACG CCTACGTCAA GAACCTGTCG AACAAGACCC CGATCACCGA CGCCTACACG ACGGACGACA GCTCGGGCCT GTTCACCAAC CTGATCACCC TGGAACCGCG CCTCTACGGC GTCAGCATCC AGAAGTCGTT CTAA
|
Protein sequence | MSQSLRMRSL LAMGASMTVL AAVAAPAFAQ TTPAPQQGGG NVLEELVVTA QKKEEALQDV PIAVSAFSQN SLEAQKIDGG PNLQQAIPNV SFAKSNFTNS FNFAIRGIGN KAVGVSTDGG VGVHENNAPL QSGNLFDAEF FDVERVEVLR GPQGTLYGRN ATGGVVNIIT AKPVDTFEAN VRAEYGNYNS QKVRGMINIP ILGDKLAIRA AGNYLKRDGF VTNTFNNHKV DDRDLYSTRV SVMFNPIDSL RTNFMWEHFK EDDSRARVGK QLCTKDLGPA TVGGVASGSA RNFLTQGCLS ASLYGDSAYG TVNTSGTLTG ELGNLVGFTS GDANAGDTAS HNLREIESLL DPIYRSKSNI YQFNLAYDLT ENLTLTAMTS YSKGDVYTKL DYNRNVSTVP FNSTPFTPGG FFADPQVGAT NKFTTLDVSS GWSKQWSQEV RLQSNFDGPL NFNVGGIWFD YKTVTDYYVI GNSLTLSALA LNYQNTGNPS CNPVTLPASC IGIDANATPD GSGHNYYDNR SPYHLKSNAI FGELYWQANE KLKFTLGLRR THDDKIQKNY ETVLLAPGIG LEQDPTTPQN RTVFNELTGR FGFDYKLSDD NLLYAFYSKG YKGGGSNPPA AAGGAGTQAT FAPEFVNAFE LGSKNTLMGG SVMLNATGFF YDYQGYQISK IVNRTSVNEN VNATVYGLEL ESVWSPIHNL KLNANIGYLH TKIGSGVSSI DTMNRTQSNP AYQVVKAGPS IPGVAVGSNC VVTAAGIATV LSINPALGAT VPFACGGKAF YQGFLQLQGV PAPFAAAAAN AMFNYGSGYS IEGKAADLSG NELPNSPHLT ASVGAQYTWD FADGWSATLR GDYYRQSKQY TRVYNTTYDQ LKPWNNANIT LKIEKPEWGL QIDAYVKNLS NKTPITDAYT TDDSSGLFTN LITLEPRLYG VSIQKSF
|
| |