Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1838 |
Symbol | |
ID | 5899293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1950541 |
End bp | 1953480 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641562328 |
Product | TonB-dependent receptor |
Protein accession | YP_001683465 |
Protein GI | 167645802 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.534381 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00298176 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCATAT CCATTCATTC AAACGCCCTG CGACTGGCGC TTATCGGCGC CAGCTGCCTG ACCGGCCTCG CCGCCGCGCC CGCCTTCGCC CAACAGACGC CCGCGCCAGC CGCCAGCGCG GATGCGGTCG AGGAAGTCGT GGTCACGGGC TTTCGCAAGA GCCTCGCGGA CGCCACCAAC GCCAAGCGCG ACAGCATCGC CTTCACCGAC TCGGTGTTCG CCGAAGACAT CGGCAAGTTC CCGGATCTGA ACATCGCCGA GTCGCTGAAC CGCATTCCCG GCATCCAGCT GACCCGCGAA ATCAACGGCG ACGGCCTGAA CATCGCCATC CGCGGCCTGG GCACCGACTT CACCAAGATC GTGCTGAACG GCGCCCAGAT CGGTGTGGCC TCCAGCGGCC GGACCGACGC CCAGAACCAG AACCGGCAGG TCGACCTCGA CCTGTTCCCG ACCGAACTGT TCACCCGCCT CGACGTCAGC AAGACGCCAA TGCCCAGCCA GCTCGAAGGC GGCGTCGCGG GCATCGTCAA CATGCGCAGC TCGCGACCGC TGGACCGGCC CGGCCAACAC CTCACCTACT CGCTGCAAGG CGCCTATCAG GACTCCGCCG GCAAGTGGAG CCCGCGCGGC GCCCTGATCG GCAGCAAGTC GTGGGATGTC GGCGACGGCG AGTTCGGCCT GCTGGTCGGC TACGCCGGCG CGCGCTCCAA GAGCCGCACG GACGGGTTCG AGACCATCGG CTGGACCAAC GCCAGCACGG GCGGCTCGAG CAACTTCGCC TGCGGCGGCT GCAACTCGAC CTTCGGCGGC AACGGCTTCA CCTGGGCGCC GACCGTTCCG GCCAACGCCG GCAACGGCCT GACCACGGGG GCGACCGTCA ATGACGCCTT CCTGCAGGCC AACAACCCCG GCACGACCCT GCAGCAACTG AGCGACGGCC TGCTGCCGCG CCTGGGCCGC CAGTCCTACA GCGCCGGTCA TCGCGACCGC GACTCGCTGC TGGTCTCGCT GCAGTACCAG CCCAACGACC ACGCCGACTT CTACATCGAC ACCCTGCTGG GCAAGACCAA CCGCGAGTTC AGCCGGATCG ACATGGACTG GGTGGTCCGC AATTCGAACT TCATGGTGCC GACCAACGTC AAGGTCGGCG CCAACAACGT GATCACCAGC GGTACGTTCG CCAACTCGCA GTTCTTCCTC GAGGCGCGCC CCTATCACGA GACCAACAAG TTCGTGAACG TGAACCCGGG CGGCAGCTGG CGCTTCAGCG ACACCCTCAA GCTGGACGGC CAGTTCAACT ACAGCCGCAG CGTCTTCTTC CGCGAGGCGC CGACCATCCT GATCAACACG CCGCTCAACA GCGGCCTGAC GGTCACCTAC GACAACACCG GCGGCGATTT CCCCAGCATC AAGACCAGCG CCAACCTCAA CGACCCGAGC CTCGGCTGGA CCTTCGTCGG CGGCCGGGTG AACATCCAGA ACGAGAAGCG CGTCACCTCG ACCAAGGGCA CGCACTGGGA CCTGACCTGG GGCGACGAGC GCAACTACAT CAAGGGCGGC GTGGCCTATG ACGAGGCCTC GCGCTCCATC ATGGCGCTCG ACAATAGCGA CCGCTGGCAG CAGATCACCT GCGGCGGCGG CGGGACCTAT CTGCCGCGTC CCAACACCCA ACCCGCCTGC ACCGGCGGCG CGGGCTCGGC GATCACCAAC GCCCAGTTGG CCTCGTACCT GAAGCCCGGT CCGCTCGGCT TCATCACCGT CGACTACGAC AAGTTCAAGG CCGCGACCAA CTATCAGGCC CTGAACGACA CCGCGCCGTT CAGCAGCTCG GCCGCCACGG CCGCCAACTC TGGCGAGATC GAGGAAAAGA ACACCGGCGC CTATATCGAG TTCGCCGGCG TCGCCACCAT CATGGACCGC GAACTGCGGA TCGCCGGCGG CTCACGCTAC GTCTCGACCG ACCAGGACGT CACCGGCCCG GTCTCGATCC CGTTCCCGAA CGTCGCCAAC TGCACGCCCA ACTGCGTGCC CAACACCCTG ACGTTCAAGA CGACCTCGCA GCGGTATGAC GCCTTCCTGC CGTCGTTCAA CGCCGTCTAC GCCGTGCGCG ACAACATCAA TCTGCGGATG TCCGCGTCGC GCACCCTCAC CCGTCCGGAC CCCAGCGCCA TGCTGCCGGG CACCACGTTC AGCGACCCGT CCGCCCAGAA CGCCAACCAA GGCAACCCGG CGCTGCGCCC ATACACCTCC AACAACTTCG ACGTGGGCGG CGAGTGGTAT ACCGGCGGCG CGGGCTATGT TGGCGTGGCC CTGTTCCAGA AGGTGGTCAC GGGCTTCACC GCCGTGGGCG CCACCACCCA GCCGTTCACC GCGCTGGGCA TCCCGTTCGA CAGCCTGACC GATCTGCAGA AGACCGCCAT CAATAACCGT GGCGGCCCCA GCGCCGCGAC GGTGACGGTC AGCCAACAGG TCAACACCGG CTCGGACCTG ACCATCCGCG GCTACGAGCT GAACTGGGTT CAGCCCCTGG ACTTCGTGTT GCAAGGCGCC GGCTTCACGG CCAACTACAC CCGCGTCAAC CAGACCGGCA CCGGCGGCGT CGTGGCGCTG GGCGTCTCGC CCTACACCTA CAACCTGACG GGCTACTACG AGAACCACGG CGTGACGCTG CGGGTGTCCT ACAACTACAA CGACGCCCAG ATCAGCTCGG GCTTGAACCA GAACAGCGTG CCGACGGCGC GGATCAAGAC CGACGCCTAC AAGCAGATGG ACCTCTCGGC CAGCTATACC CTGCCCATCC TGGGCGGCGC GCAGATCACC TTCAACGCCA TCAACATCAC CAGCGAAACC CAGCGGCAGA CCTTCCAATA CCCGAACGCC GCCTACACCT TCTACGATCC AGGTCCGACC TATCTGATCG GCATCCGCGG TCAGTTCTAG
|
Protein sequence | MAISIHSNAL RLALIGASCL TGLAAAPAFA QQTPAPAASA DAVEEVVVTG FRKSLADATN AKRDSIAFTD SVFAEDIGKF PDLNIAESLN RIPGIQLTRE INGDGLNIAI RGLGTDFTKI VLNGAQIGVA SSGRTDAQNQ NRQVDLDLFP TELFTRLDVS KTPMPSQLEG GVAGIVNMRS SRPLDRPGQH LTYSLQGAYQ DSAGKWSPRG ALIGSKSWDV GDGEFGLLVG YAGARSKSRT DGFETIGWTN ASTGGSSNFA CGGCNSTFGG NGFTWAPTVP ANAGNGLTTG ATVNDAFLQA NNPGTTLQQL SDGLLPRLGR QSYSAGHRDR DSLLVSLQYQ PNDHADFYID TLLGKTNREF SRIDMDWVVR NSNFMVPTNV KVGANNVITS GTFANSQFFL EARPYHETNK FVNVNPGGSW RFSDTLKLDG QFNYSRSVFF REAPTILINT PLNSGLTVTY DNTGGDFPSI KTSANLNDPS LGWTFVGGRV NIQNEKRVTS TKGTHWDLTW GDERNYIKGG VAYDEASRSI MALDNSDRWQ QITCGGGGTY LPRPNTQPAC TGGAGSAITN AQLASYLKPG PLGFITVDYD KFKAATNYQA LNDTAPFSSS AATAANSGEI EEKNTGAYIE FAGVATIMDR ELRIAGGSRY VSTDQDVTGP VSIPFPNVAN CTPNCVPNTL TFKTTSQRYD AFLPSFNAVY AVRDNINLRM SASRTLTRPD PSAMLPGTTF SDPSAQNANQ GNPALRPYTS NNFDVGGEWY TGGAGYVGVA LFQKVVTGFT AVGATTQPFT ALGIPFDSLT DLQKTAINNR GGPSAATVTV SQQVNTGSDL TIRGYELNWV QPLDFVLQGA GFTANYTRVN QTGTGGVVAL GVSPYTYNLT GYYENHGVTL RVSYNYNDAQ ISSGLNQNSV PTARIKTDAY KQMDLSASYT LPILGGAQIT FNAINITSET QRQTFQYPNA AYTFYDPGPT YLIGIRGQF
|
| |