Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1988 |
Symbol | |
ID | 5899443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2136193 |
End bp | 2139255 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641562477 |
Product | TonB-dependent receptor |
Protein accession | YP_001683614 |
Protein GI | 167645951 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGTCA TGGTTGCAGC GTTTCGTCGG GGCCTCGTCG GCCTTTCCCT TCTGGCCCTG ACCACCGCGA TCGCCGGCCC GGTCCAGGCC GCCCAGGCGC CCTCGGCGCT GGTGCCCGTC TCGGCGGTGC GAACGCCCAA GGGGCCGCTG AAGGCCGCCT TGCTCGCCCT GGGCCGCCAG ACCGGCGTGC AGATCATCTT CACCAGCCGG GCGGTCGAAG GCCGCCAGGC CCCGGCCCTG GACGGCCAAT TCAGCGTCGA CGAAGCGCTC GACCGCCTGC TGGCCGGCAG CGACCTGGAG GCCCAGCGCG TGGGCCCGAC GGTCCTGGTC GTCCGGCCGC GCGCCGTCCT CGTTCCCGCC TCGGCGACCT CGCCGACGGC GCCCGATCCG ATGGCCTCGC CGCCGACCGA TCCGATTGAG ACGGCCCCGC CCGTCCAGGC CGTCAACGAC GCCACCATGC TGTCGGAGGT CGTGGTCGGC AGCCACATCC GGGGCGCCCG TGACAGCGCC TCTCCGGTCG TCATCCTGGA CCGCGAGACT CTGGACCGCG CCGGCCGCAC GAGCGTCGCG GAGGCCATGT CCAACCTGCC GCAAGCATTC AACGGCTCGG GCAGCGAAGA CACCACCTCG ACCGGCGCCG ATCCTTTGGG CACCAATTCC AGCCGCGGCG TCGGGGTCAA TCTGCGCGGC TTGGGAACCG ACGCGACCCT GGTGCTGGTC AATGGCCGCC GCCTGGCAGG CACGGGATTG AAGGGCGATT TCGCCGACGT CTCGTCGATC CCGATGGCGG CGGTGGACAG GATCGAGGTG CTGCTCGACG GGGCCTCGGC GCTCTATGGC TCCGACGCGG TCGGCGGCGT GGTCAACATC GTGCTTCGCA AGCGCTACGA AGGCGCGGAA ACCCGGGCTC TGGTCGGCGG CGCGACGCGC GGCGGGGCCA CGCAGTGGCA GTTTGGCCAG ACGGTCGGCC ACGCCTGGGA CAGCGGAAAC CTGGTGGTCA GCTACGAGCA CAGCGCCCGC GACCGACTGC GCGGCCGCGA CCGCGACTTC ACCGGCAACG CCGACCTGCG CGACCAGGGC GGGACCGACC ACCGCCGCTA CTACAGTCAG CCCGGCAATA TCCTGCGCGC CAACGGCAGC GGGGTCCTGG TCCCGACCTA CGCCATTCCT GGCGGACAGA ACGGCGTGGG CCTTTCGCCA GCCAGCTTCG CGGCCGGCCA GACCAACCTG GAAAACCAGC AGCTGGCGTT CGACGTCCTG CCGCGCCAGC GCCGCGACAG CGTCTACCTC GCGTTCGCCC AGGACCTGAC CCCCGCCATC GAACTGTCGG CCGACGCCCG GGCCACCCGC CGTGACTTCA CCAGCCGGGG CGGCGCCTCG ATCACCACCC TGACGGTCAA CGCCAGCAAT CCCTATTTCG CCTCTCCGAC CGGGGCGGCG TCCGAGCGCA TCGCCTATTC GTTCCTCAAC GAGCTGGGCG GCCAACGGGT CAAGGGCGTG GCCGACACCC TGGGCCTGTC GCTTGGCGGG ACGGCGCGCC TGCCGGCCGG CTGGCGGCTG GAGACCTACG GGGCCTATGG TCTTGAGACG ATGCACTCCC TGACCAACAA CCTGGTCAAT TCGTCCGCCC TGGCGGAGGC CCTCGGCGCC ACGCCCGACA ATCCCGCCAC CGGTTTCAGC ACCGCCGCGT CCGGCTATTT CAATCCGTTC ATCGGCGCGG GCTCCAATCC ACGCTCGATC CTGGATTTCA TCAACACGGC CTTCGTCGAT CGCAAGACCC GCAGCGACAC ACGCTCGATC AACCTGAAGC TCGATGGGAC CTTGTGGTCC CTGCCCGCCG GACCGGTGGG ACTGGCGGTC GGCGGCCAGA TCCGCCGCGA GGGGCTCAAG AGCGGCGGGC AGAGCCTGGC CTCGGGCGTT TCGCCGATCC CGATCGCCCG AAAGGACACC GACCGCACGG TCGACGCGGC CTTCGCGGAG GTCCGACTGC CATTGTTTGG CGGAGCGTTC ACGCGGCCGG GCCTGCGGCG TCTGGAACTG TCGGCGGCGG TTCGTCACGA GGACTATGGC GGCGCGCTCA AGAGCACGGA CCCCAAGCTT GGAGTGATCT GGTCGCCGGT GGCGGGCGCC ACCCTCAAGG CCTCCTACGG CACGTCGTTT CGCGCGCCAG CCCTCACCGA GCTGAACGAT CCGCAGATCT TCGCCCCGAC CACGATCAAC ACCAGCGGTC GCGAGACCAT CGTGATGATC CTCTATGGCG GCAATCCCAA TCTGAAGCCC GAGACGGCGA CCTCCAAGAC CCTGACCCTG GAGCTCGCGC CGCCCGACTG GTCGCGATTC AAGGCCTCGC TGACCCTGTT CGACACCCGG TTCACCGACC GGATCGGCCA GCCGGGCAAC GAATATATCG ACAGGGTCCT GACCTCGGCG GAGTTCGCGC CGTTCGTCAC CCTGGTGTCG CCCGCGACCA ACACCGCGGA CCGCGCGCGC ATCCAGGCCC TGATCGACGA CCCGCGCTCC TACGCCCAGG GGGTCTTTCC GGCGGAAGCC TATGGCGCGA TCGTCGACGG CCGTTACGTC AACACGGGTC AGCTTCGGGT GCGCGGTCTG GACGTCTCGG CCCAGTATCA GGCCAGGCTC GGCGGCGATC CTCTGGTGCT GTCGGCGGAC CTGTCCTGGA TGATGGACTA CAGCCGCAAG ATCACGCCCG GCACACCCAG CGTGGATCGC GCCGGCTTCG TGGGCGAGCC CGCCGACCTG CGCGCGCGCT ATGCGGCCAG CTGGACGCAC GGGTCCCTGA CGACCACGGC TTCGATCAGT CAGGTGGGGG ATCTATCGAC CGACGGCGGC GGCCGCATCA AGGGCTGGAC CACCGCCGAT CTCAACCTGA GCTACCGTTT TGGCAACGGC AGGCTGGAGG GCTCGGGCCT GTCCCTGAAT ATGCAGAACC TCTTCGACAG CGATCCGCCC TTCTACGACT CGCCGCTCGG GGTGGGCTAT GACCCGGCCA ACGCCGACCC GCTGGGTCGC GTCGTGACGC TGCAGCTGAC CCGGACCTGG TAG
|
Protein sequence | MVVMVAAFRR GLVGLSLLAL TTAIAGPVQA AQAPSALVPV SAVRTPKGPL KAALLALGRQ TGVQIIFTSR AVEGRQAPAL DGQFSVDEAL DRLLAGSDLE AQRVGPTVLV VRPRAVLVPA SATSPTAPDP MASPPTDPIE TAPPVQAVND ATMLSEVVVG SHIRGARDSA SPVVILDRET LDRAGRTSVA EAMSNLPQAF NGSGSEDTTS TGADPLGTNS SRGVGVNLRG LGTDATLVLV NGRRLAGTGL KGDFADVSSI PMAAVDRIEV LLDGASALYG SDAVGGVVNI VLRKRYEGAE TRALVGGATR GGATQWQFGQ TVGHAWDSGN LVVSYEHSAR DRLRGRDRDF TGNADLRDQG GTDHRRYYSQ PGNILRANGS GVLVPTYAIP GGQNGVGLSP ASFAAGQTNL ENQQLAFDVL PRQRRDSVYL AFAQDLTPAI ELSADARATR RDFTSRGGAS ITTLTVNASN PYFASPTGAA SERIAYSFLN ELGGQRVKGV ADTLGLSLGG TARLPAGWRL ETYGAYGLET MHSLTNNLVN SSALAEALGA TPDNPATGFS TAASGYFNPF IGAGSNPRSI LDFINTAFVD RKTRSDTRSI NLKLDGTLWS LPAGPVGLAV GGQIRREGLK SGGQSLASGV SPIPIARKDT DRTVDAAFAE VRLPLFGGAF TRPGLRRLEL SAAVRHEDYG GALKSTDPKL GVIWSPVAGA TLKASYGTSF RAPALTELND PQIFAPTTIN TSGRETIVMI LYGGNPNLKP ETATSKTLTL ELAPPDWSRF KASLTLFDTR FTDRIGQPGN EYIDRVLTSA EFAPFVTLVS PATNTADRAR IQALIDDPRS YAQGVFPAEA YGAIVDGRYV NTGQLRVRGL DVSAQYQARL GGDPLVLSAD LSWMMDYSRK ITPGTPSVDR AGFVGEPADL RARYAASWTH GSLTTTASIS QVGDLSTDGG GRIKGWTTAD LNLSYRFGNG RLEGSGLSLN MQNLFDSDPP FYDSPLGVGY DPANADPLGR VVTLQLTRTW
|
| |