Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4016 |
Symbol | |
ID | 5901478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4349536 |
End bp | 4352583 |
Gene Length | 3048 bp |
Protein Length | 1015 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564537 |
Product | TonB-dependent receptor |
Protein accession | YP_001685639 |
Protein GI | 167647976 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCG GGCTTACGGG CGTATCGAGC GTCGCGCTGA TGATCGGGGT CGCCGGCGTG GCGCTGCCGG CATGGGGCCA GGACGCGCAA CGGCCGGCGG ACACGGCCAC GCAGGTCGAA GAGGTCGTCG TCACCGGTTC CAACATCCGC GGTTCGGCGC TCGACAACGC CCTCCCGGTG GAGGTCTATT CTCAACAGGA TCTGGAAAAG CAGGGCTCGC CCACGGCGCT GGAGTTCGCC AAGAGCCTGA CCATTTCGGG TCCGACGACG GGCGAGTCGT ACTATTTCGG CGGTCCGGCC CTGGTGGGTT CGGTGAACTA CAATCTGCGC GGCCTGGGCG CCGACAAGAC CCTGGTCCTG CTCAACGGCC GGCGGATGAA CCAGAACACC GCCAACGTGC CGTCGATGGC CCTGGCCCGC ACCGAGATCC TCAAGGACGG CGCGGCGGTG ATCTACGGCG CCGACGCCAC CGGCGGCGTC GTCAACTTCA TCACCCTGGA CCACTTCACG GGCCTGCAGG CTCAGGGCCA GTACAAGCAG ATCAAGGGCT CCAAGGGCGA CTATTCGGTC GGCGTCATGG CCGGGATCGG CGAGGATCGG GTCAACCTGC TGGTGTCGGC CGAGTACGAG CACCGCTCGC GGCTGGGCAC GCTGGAGCGC GACTTCACCA AGCCGTCGCT GACGCCGGGG GCGGGCTATA ATCCGGCGCC CTGGTCGACC CTGACCAACC TGACGGGCTG GCTGCCGCGC GGCGCCCTGC CCGCCGTTCC CAGCGCCACG GATGTCGGCG AGTGGGGCGC GGCGGTCGGC GGGATCGTCT CCGACTTCAC CGCCTCCAGC TGCGCGGCGG TCGGCGGTCG GCCCGACAAC GCCTTCACCT GCGCCTACAA CTACATTCCC TACTACCGGC TGGTGGAGAA CCAGGACACC TATCGGCTCT ACGCCCAGCT GAAGGCCGAC ATCACCGACA AGATGAAGTT CCACGCCGAC GCCTCCTACG GGCGCGTGAC GCTGCCGCAG GTGATGGGCT CGCCGGCCCA GCCCGTGACC CGCGGCCCCG CCCTGACCAC CGGGGCGGTG AACCAGTTCT ACGTGCCAAT CACCAACCCG TTCGCGGCCG AGTTCGCCGC CGCGAACGGC ATCGTCGGCG CCCAGGGCTT CACGCCGATC ACCTACCGCC TGTTCGGCCA CGGCGGTAAT CCCTACTATT CGGGCGGGGA CGGCTTCGGC GTCGCCGACC GGATCGACAA CAAGGTCTGG CGCATCTCGG GCGGGATCAC CGGCGACCTC GGCGACCTGG CGACCTTCGC CAAGAAGGTC GGCTACGACT TCGCCCTGAC CTATAACGAC GCCTACAACT ACAACACCCA TGCCGATACG ATCGGCTATC GCCTGGAAGA GGCGCTGAAC GGCTTCGGCG GCCCCAACTG CCATGCCGTC GACCTGGACC CGTCGCGCTT CGGAACCCAG AACGCCGCCG CCGCGGGCAA GAACGGCTGC ATGTGGTGGA ACCCCTTCTC CAGCTCGTTC AAGGGCCAGC CGGTCCGCGG CCTGGCCAAC CCCAACTACA TCGCCGGCCA CGAGAACCCG CAGGACCTGA GCCTGTGGAT GTTCGATCCG CGCGCCGTCG AGACCCGGAG CAACAACTTC ACCGCCGACC TGGTGTTCAA CGGCATGTCC GGCCTGACGC TGCCTGGCGG CGAAGTGGGC TGGGCCCTGG GCGCCCAGTA CCGGACGTTC AAGAGCCGCC AGACCGTCAC CAGCCTGTTC AACAACGGCA CGGTTCAGTG CGAATGGCCG CACGGCACCA CCAGCGCCAA CGGCGCGGGC TCGCCGAACC TGGAGGCCAA CCCTACGCCG ACGAATGACC CGAACTTCCG GGGCTGCACG CCCGACGCCC CGGGTCCGTT CGTCCTGTTC GCGCCCAGCA TTCCGGCCCA GGCCGACCAG AGCCAGTATT CGCTGTTTGG CGAACTGCAG GTGCCGGTGC TGTCGAACCT CAGCTTCCAG CTGGCCGCCC GCCGCGAGCG GTTCTCCAAC GATCTGGGCG CGACGGTCTA CAAGGTGTCG GGCAAGTGGA ACGTCTGGGG TCCGCTGACC TTGCGCGGGT CGTACGGCAC CAACTACCAG ACCCCGCCCC TGGGCGTGAC GCCCGGCGCC GTGACCATCG CCGCGCGCAC CTACACGGTG GCGGCCAGCA ACTGGCTGGC GGCTCAGTTC GTCACCGACG CCGACCTCAA GCCCGAGACC GCCAAGACCT CGAACCTCGG CGCCATCTGG CAAAGCCGGG GCTTGGCGGA TGATCACAAC TTCCGCCTGA TCATCGACTA TTTCGACATC CGGACGAAGG ACCAGATCGG CCAGGTCGCC GACCCCAACC AGATCGCCAG CCTGGTGTTC AACGGCGCGG GCGGCACGAT CACCACCTGC GACCCGGCCA AGCAGCCCCT GCTGGCCCGC ATCACCTTCA ACGCCGGCTG CGCGGTGGGG ATGAGCGGCG TCGGGACCTT CTCCGCCGTT TCCACGCGCT ACGGCAACGG GCCGGGCCAG ACGACCAAGG GCTTCGACAT CCAGGCCAAT TACGGCCTGC CGCTGGGTCC CGGCGATCTG GACGTCAACC TGACCGCCAC CCGGGTGACC GAGTTGCGCA CCGGGGCCAC CACCCTGGAC GGCGTCGTGA TCTCCACCGG CGACGACCGC CTGGGCACGC TGAACTTCGC GACCTTCGCG CAGGCGGCTC CGAAGTGGCG CGCCAACCTG GGCGTCAACT ATCGCCTGAA CCGCCAGAAC TTCCGCCTCG GCGTGAACTT CGTCTCGGCC GTCCAGGACG AGCGGGCCGG CGTCCAGTAC GGCGAGGACG GCGAGGACTG GGTCACCGCC GACTTCACCT ACCGGATCGA GCTGAACGGC GACATGGCCC TCACCGCCAC GGTCGCCAAC ATGTTCGATC GCGACCCGCC GCCGGCCCAG GAAGAGTTCG GCTACGATCC GTGGACCGGC AATCCGCTAG GCCGGACCTT CGAGATCGGC TTCAAGAAAT CGTTCTAA
|
Protein sequence | MKIGLTGVSS VALMIGVAGV ALPAWGQDAQ RPADTATQVE EVVVTGSNIR GSALDNALPV EVYSQQDLEK QGSPTALEFA KSLTISGPTT GESYYFGGPA LVGSVNYNLR GLGADKTLVL LNGRRMNQNT ANVPSMALAR TEILKDGAAV IYGADATGGV VNFITLDHFT GLQAQGQYKQ IKGSKGDYSV GVMAGIGEDR VNLLVSAEYE HRSRLGTLER DFTKPSLTPG AGYNPAPWST LTNLTGWLPR GALPAVPSAT DVGEWGAAVG GIVSDFTASS CAAVGGRPDN AFTCAYNYIP YYRLVENQDT YRLYAQLKAD ITDKMKFHAD ASYGRVTLPQ VMGSPAQPVT RGPALTTGAV NQFYVPITNP FAAEFAAANG IVGAQGFTPI TYRLFGHGGN PYYSGGDGFG VADRIDNKVW RISGGITGDL GDLATFAKKV GYDFALTYND AYNYNTHADT IGYRLEEALN GFGGPNCHAV DLDPSRFGTQ NAAAAGKNGC MWWNPFSSSF KGQPVRGLAN PNYIAGHENP QDLSLWMFDP RAVETRSNNF TADLVFNGMS GLTLPGGEVG WALGAQYRTF KSRQTVTSLF NNGTVQCEWP HGTTSANGAG SPNLEANPTP TNDPNFRGCT PDAPGPFVLF APSIPAQADQ SQYSLFGELQ VPVLSNLSFQ LAARRERFSN DLGATVYKVS GKWNVWGPLT LRGSYGTNYQ TPPLGVTPGA VTIAARTYTV AASNWLAAQF VTDADLKPET AKTSNLGAIW QSRGLADDHN FRLIIDYFDI RTKDQIGQVA DPNQIASLVF NGAGGTITTC DPAKQPLLAR ITFNAGCAVG MSGVGTFSAV STRYGNGPGQ TTKGFDIQAN YGLPLGPGDL DVNLTATRVT ELRTGATTLD GVVISTGDDR LGTLNFATFA QAAPKWRANL GVNYRLNRQN FRLGVNFVSA VQDERAGVQY GEDGEDWVTA DFTYRIELNG DMALTATVAN MFDRDPPPAQ EEFGYDPWTG NPLGRTFEIG FKKSF
|
| |