Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3922 |
Symbol | |
ID | 5901384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4240152 |
End bp | 4242998 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641564443 |
Product | TonB-dependent receptor |
Protein accession | YP_001685545 |
Protein GI | 167647882 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.719329 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0833572 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATTGC AGATCACCCG CGGACGCCTT CTGGCGACGA CGATGATGGC CGGGGTCGCC ACCTTGGCCG CGTTCACCGC CAACGCCCAG ACCGCCCCGG CGCCCGCCCC TGCTCAGGCG GATAACGAAG TCGAAGCCGT CGTCGTCACC GGTTCGCTGC TGCGTCGCAC CGACACCGCC ACCCCGTCGC CGGTCACGGT TCAGACCACC GAGCAGCTGA AGGCCCAGGG CATCACCACG ATCGCCGACG CCATCCGCAG CCTGTCGGCC GACAACTCCG GTTCGATCCC CGCCGCCTTC GGCAACGGCT TCGCGGCCGG TTCGACCGGC GTGTCGCTGC GCGGCCTGTC GGTCAACTCG ACCCTGGTGA TGATCGACGG CCTGCGCAAC GCCAACTACC CGCTGGCCGA CGACGGCCAG AAGGCGTTCG TCGACCTCAA CTCGATCCCG TTCAACGCGG TCGAGCGGAT CGAGACCCTC AAGGACGGCG CCTCGTCGCT GTACGGCGCC GACGCCATCG GCGGCGTGGT CAACATCATC ATGAAGTCGA ACTACCAGGG CATGGGCGCC GACATCTCTT ACGGGTCCAG CCAGCACGGC GGCGGCGACC AGTACCGTTT CACCGGCGAC ATCGGCTACG GCGACCTGGA TACCGACAAG TACAACCTGT ATTTCGACGT CGAGTACCAG CTGGACAAGG CCATCCGCGG CGACCAGCGC GGCTTCCCGT ACAACACCAA CGACCTGACC TCGATCCCCG GCGGCCAGAA CAACAACCCG CAGACCGGCG GCAGCACCTT CTACGGCAAC GTCCTGCGGG CCAGCCTGGG CACGCCCGGC AACCTGGCCA CCGGCGTGGG CATCGACGGC GCCCTGTGGC AGCCGCTGCG GACTTGCGCC GCCAACGCCC CGCTGACCAC GCAGAAGGAT GAAGACGGCA ACGTGATGGG CGCCTACTGC GCCCAGAACG TCGCCAATTT CGGCGACATC CAGCCAAAGC AGTCGCGCGC CGCGGTCTAT GGCCGCTTCA CGATCAAGCC GACCGACCAT CTGGAAGCCT ATCTGAGCGG CAGCTTCGTG CAGAGCAAGA CCTCGGTCCG CGGCACCCCC GTGACCGTCT CGTCGAGCAC GCCCAACAAC CTGACCAACC TGGTCCTGCC GGTGTGGATC TGCGACACGG GCGTCAACTG CGCCACCGAC ACCACTGCGG TGAACCGTCG CCTGAACCCG AACAATCCGT TCGCGGCCGA CGGCGACTAC GCCCTGATCA AGTACCGCTT CAGCGACCTG CCGGCCTCGG CCCGCTACAA CAACCGCATG CTGCGCATGG TCGGCGGCGT GAAGGGCGAT GTGGCGGATT GGAACTACCA GGCTAACCTG GTGATCGCCC ATGACTCGCT GGAAAGCGTG CAGACCGGCT TCCTCAGCTA CAAGCAGCTG ATGAGCGACA TCCAGACGGG CGCCTACAAC TTCGTCGATC CGTCGAAGAA CAGCGCGGCT GTTCTCAAGG CCCTGTCGCC GGATCTGGCC AAGACCTCGA CCACCGACCT GGACTCGATC AACGTCCAGG CCTCGCGTCC GGTCTTCAGC CTGCCGGGCG GCGACGCTCA GTTGGGCCTG GGCGGCGAGT TCCGCTACGA AGCGACCAAC GACCCGGCCC TGAACCCGGT CAACGACGCC CAGGGCCTGG GCAACGCCCG CACCGAGGGC AACCGCACCG TCGCCTCGGC CTTCGCCGAG CTGGGCCTGC CGATCACCGA CAAGATCGAG GTCAACGTCT CGGGCCGCTA CGACCACTAT TCGGACTTCG GCGGCAACTT CTCGCCGAAG ATCGGCGTGA AGTTCACGCC CATCAAGACG GTCGCCCTGC GCGGCACCAT CTCAAAGGGC TTCCGGGCGC CGAGCTTCTC CGAAGCGGGC AACTCGGCCA GCCAGGGCTT CACGACGTTC TCGTTCAAGT CGGCCAAGTA CGCCGATTTC CGCGCCTCGC ACGGCAACAA CGAGTACACC AAGGACTACT CGCTCTCGTC GATCACCACG GCCAATCCGG ACCTGGATCC GGAAAAGTCG ACCAGCTACA CCCTCGGCGC GGTTTGGGCC CCGACCCGCA GCTTCAGCGT GTCGCTGGAC TACTACCACA TCAAGAAGAC CGACGTGATC GCGCAAGCCA GCGCCGGCGT GGCCCTGGCC GCCTACTACG CGGGCCAAGC CATCCCGGCT GGCTACGTCG TCACAGCCGA CGCGGCGGAC CCGCAGCATC CCGACGCCCT GGCGCGCGTG CTGTCGGTGG CCTCGCCCTA CATCAACGCC GACTCGCTGG TGACCAGCGG TCTGGACCTG AACGTCCAGG CCACGTTCTA CCTGCCGGCC GACGTGAAGT GGACCAGCAA CCTCGACGCC ACGACCCTGT TCGACTTCAA GTACACGCAG GACGGCACGA CCTATAACTA CGTCGGCAAG GAAGCGCCGT ACGTGCTGTC GTCGGGCGCC GGCACCCCGA AGAACCGCCT GTCGTGGACC AACACCTTCG AGCATGGCCC GATCCACGTG ACCGGCGTCA TGAGCTATGT CAGCGGCATG CGTGAAGTCG ACGACAGCTA CGACAGCGCC TGCCTCTATG GCGATGCGGC CTTCAACTGC CGCGTCAAAT CGTTCACCAC GGTCGACCTG ACCGGCGCCT ACGACGTGAC CCAAAAGGCC ACCGTCTACG CCGACGTCAT GAACCTGTTC GACACCGGCC CGATGTTCAA CCCGGCCAAC TACGCCGGCG TGAACTGGAA CCCGACCTAC TCGCAGTCGG GCATCGTGGG CCGCTACTTC CGCGTCGGGG TGCGCGTGAA GTACTAG
|
Protein sequence | MRLQITRGRL LATTMMAGVA TLAAFTANAQ TAPAPAPAQA DNEVEAVVVT GSLLRRTDTA TPSPVTVQTT EQLKAQGITT IADAIRSLSA DNSGSIPAAF GNGFAAGSTG VSLRGLSVNS TLVMIDGLRN ANYPLADDGQ KAFVDLNSIP FNAVERIETL KDGASSLYGA DAIGGVVNII MKSNYQGMGA DISYGSSQHG GGDQYRFTGD IGYGDLDTDK YNLYFDVEYQ LDKAIRGDQR GFPYNTNDLT SIPGGQNNNP QTGGSTFYGN VLRASLGTPG NLATGVGIDG ALWQPLRTCA ANAPLTTQKD EDGNVMGAYC AQNVANFGDI QPKQSRAAVY GRFTIKPTDH LEAYLSGSFV QSKTSVRGTP VTVSSSTPNN LTNLVLPVWI CDTGVNCATD TTAVNRRLNP NNPFAADGDY ALIKYRFSDL PASARYNNRM LRMVGGVKGD VADWNYQANL VIAHDSLESV QTGFLSYKQL MSDIQTGAYN FVDPSKNSAA VLKALSPDLA KTSTTDLDSI NVQASRPVFS LPGGDAQLGL GGEFRYEATN DPALNPVNDA QGLGNARTEG NRTVASAFAE LGLPITDKIE VNVSGRYDHY SDFGGNFSPK IGVKFTPIKT VALRGTISKG FRAPSFSEAG NSASQGFTTF SFKSAKYADF RASHGNNEYT KDYSLSSITT ANPDLDPEKS TSYTLGAVWA PTRSFSVSLD YYHIKKTDVI AQASAGVALA AYYAGQAIPA GYVVTADAAD PQHPDALARV LSVASPYINA DSLVTSGLDL NVQATFYLPA DVKWTSNLDA TTLFDFKYTQ DGTTYNYVGK EAPYVLSSGA GTPKNRLSWT NTFEHGPIHV TGVMSYVSGM REVDDSYDSA CLYGDAAFNC RVKSFTTVDL TGAYDVTQKA TVYADVMNLF DTGPMFNPAN YAGVNWNPTY SQSGIVGRYF RVGVRVKY
|
| |