Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0308 |
Symbol | |
ID | 5897582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 345170 |
End bp | 347545 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641560792 |
Product | TonB-dependent receptor |
Protein accession | YP_001681943 |
Protein GI | 167644280 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.264582 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACGT CCATCTGGCT GGTTTCGAGC GCGCTGTGTT CGATCCTGGC CGCAGGCCCG GTCCTGGCCC AGGACAAGGC GACCGCCGCG ACGGGCGGCC AGACCCTCGA CGAGGTGGTG GTCACCGCCG AGCGCTTCGG CTCGGGCCTG GCCCGCGCCA CCTTCACCCT CGGCGCTGAG GACATCCAGG AGCGTCCGCT GGGCGCCGAG ATCACCCAGG CCCTGGTCAA GGTGCCGGGC GTGCAGGTCT CGACCGGCGA CGCGCGGGGC GGCAGCTTCT CGTTCGAGAT CTACATGCGC GGTCTCAGCG ACGAGCAGAT CGGCCTGACC CTGGACGGCG TCCCGACCGG CGACTCGCGC TTCAACGGCG GCTCGCCGCC CGCGCGGTTC ATCGAGTCCA GCAATATCGG CAAGATCACC GTTTCGCAGA GCGCCGGCGA CATCGGCGCG CCGTCGCGCT TCGCCCTGGG CGGCTTCATC GACTTCGCCA CCGACGCGCC GCGCCACGAC CTGGGCGCCA CGGTCGAGGC TGGGGTCGGC TCGTTCGACT TCCGCCGGAT CTATGGCCGC GTCGACAGCG GCGAGATCGC GCCGGGCCTT TCGGGCTACC TGACCTATTC GCACCAGGAG AACGACATCT GGGCTGGTCG CGAGAGCCGC GGCTCCGAGC GCGGCCACTA CGAGCTGAAG CTGGTCAAGG ACTTCGACAA CGGCTCGTTC CTGAAGGCCC GGGTCTCGTA CAACGACCAG ACCGACAATG ACTTCAACAT CGTCACCAAG GGCGAGTTCA AGGCCGCGCC GCGCAGCGAC CGGGCTCTGG ACGCCATCAC CGGCCTACCG GCCAAGGACA TCGACTTTGG CGGCGCCCTG GGCGGCTGGC GCAAGGACTG GCTGACCTAT CTGAACGGCC ACTTCAAGCT GAACGACGCG CTCAGCCTCG ACGTCAATCC GTACTACCAG ACCCTGAATG GGGAATCCTT CCGCTACCAG GACCGCCAGC GGATCCTGAC CGGCGGCGAT CCGCGCGCCG TGACCGGCTA CAACGCCAAC GGCGGCGCCA TCCGCCCGGC CCTGTCCACC CTGCGCAACA GCAATGTCGT GGGCGGCCCG GCCGACATGC GGGTCACCCC GCGCGAGCGC GACCGTTACG GCGTGACCGG CGAGATCAAG GCGTCGAACG TCTTCGGCTC GGGCCACAGC CTGCGGGTCG GCGGCTGGTG GGAAGGCGGC GAGTCCACCG AGAAGCGCAA CTTCTTCCCG ATCATCGACT CGTCCAGGAG CATCGCCTAC GACCGCTCCA AGCTGAACTA TGTCGAGTAC GAGCGCACGG CCTCGGTCGA GACGACCATG CTGTACGCCC AGGACGAGTT TCGGGCCTTG GACGACAAGC TCAAGGTCGA CCTGGGCCTG ACCTGGTACG ACGTCAAGTA CGACGCCAAG TCGCCGCTGG AGTACAAGGC CAACGTCAAG TTCTCGCAGC ATTCGGAGGT CAATCCGAAG CTCGGCGCGA CCTATCAGCT GGCGCCGGCC TGGGAACTGT TCGGCGGCTA CGCCAAGAAC TTCGCCGGCA TCCCGGAAGA CGCCTTCCTC GGCTCGACGG CAGTGATCAG CCCAAAGGAC CTGGACCCGG TCGAGACCGA GAACCTGGAC CTGGGGCTGC GCTATGTGAA GCCGAACATG GCCTTCTCGA TCCAGGCCTA TGACGTGGAC CTGAAGAACA ATGTCGGCAT CGTGCCGCGC GATCCGACCG CGGCCCTTGA CCCCGACGAA GTGGTCCGGG GGAATGTCGC GACCAAGGCG GTCAATATCG CCGGCATCAA GACCAAGGGC GTGGAGCTGA CCGGCTATTA CGACTTCGGC GCCTTTGACC TCTACGGCGC CTATTCGCGC CAGGACGCCA AGCACGACAA CCCGGCCGTC GGCAGCGCCG CGCGCAAGGC CCTGGCGGCG GTGGCGGTGA TCGGCGGGGC GGGCGTGCGA GACATCCCCA AGAACAGCTT CTATGGCCAG GTCGGCTGGA AGCCGCTGGA GGGGCTGAAG CTGGACGCCA ATGTCCGCTA TGTCGGCGAC CGTGTCGGCG GCCACATCGT CGCCCCGACC ACCTTCCAGG AGATCGGCGT CGAGATGATC GACGGCTACG CCCTGGTCGG GCTGACGGCG ACCTACGATC TCAAGCGGGC CGGCGTTCCC GACCTGCGGT TCCAGCTCAA CGTCGATAAC CTATTCGACG AGGAATACAT CGGCGCGGTC AGCGGCTCGA CCGCCACCCA ACCGGAGTTC GGCTACACGG TCGCGACGCC GAACGCCCGC ACCCTGGATC GCTACTTCAT CGGCGCGCCG CGCACCTACA CCCTTTCGGT GCGGACCCGC TTCTGA
|
Protein sequence | MKTSIWLVSS ALCSILAAGP VLAQDKATAA TGGQTLDEVV VTAERFGSGL ARATFTLGAE DIQERPLGAE ITQALVKVPG VQVSTGDARG GSFSFEIYMR GLSDEQIGLT LDGVPTGDSR FNGGSPPARF IESSNIGKIT VSQSAGDIGA PSRFALGGFI DFATDAPRHD LGATVEAGVG SFDFRRIYGR VDSGEIAPGL SGYLTYSHQE NDIWAGRESR GSERGHYELK LVKDFDNGSF LKARVSYNDQ TDNDFNIVTK GEFKAAPRSD RALDAITGLP AKDIDFGGAL GGWRKDWLTY LNGHFKLNDA LSLDVNPYYQ TLNGESFRYQ DRQRILTGGD PRAVTGYNAN GGAIRPALST LRNSNVVGGP ADMRVTPRER DRYGVTGEIK ASNVFGSGHS LRVGGWWEGG ESTEKRNFFP IIDSSRSIAY DRSKLNYVEY ERTASVETTM LYAQDEFRAL DDKLKVDLGL TWYDVKYDAK SPLEYKANVK FSQHSEVNPK LGATYQLAPA WELFGGYAKN FAGIPEDAFL GSTAVISPKD LDPVETENLD LGLRYVKPNM AFSIQAYDVD LKNNVGIVPR DPTAALDPDE VVRGNVATKA VNIAGIKTKG VELTGYYDFG AFDLYGAYSR QDAKHDNPAV GSAARKALAA VAVIGGAGVR DIPKNSFYGQ VGWKPLEGLK LDANVRYVGD RVGGHIVAPT TFQEIGVEMI DGYALVGLTA TYDLKRAGVP DLRFQLNVDN LFDEEYIGAV SGSTATQPEF GYTVATPNAR TLDRYFIGAP RTYTLSVRTR F
|
| |