Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1120 |
Symbol | |
ID | 5898575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1186824 |
End bp | 1189652 |
Gene Length | 2829 bp |
Protein Length | 942 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641561602 |
Product | TonB-dependent receptor |
Protein accession | YP_001682748 |
Protein GI | 167645085 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00264761 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGATGC ACCTGCTCCG GGCCAGCGCC CTGGCCGGCG CGGCGGGCCT GATGATCGCC GGACAGGCGT TGGCCCAGAC GACGCCAACG CCGGCTCCGG CCAGCCCAAC CGATGAGACC GCGGCCGTCG AGGCCCTGGT CGTCACCGGC TCGCGCATCC CCCGCATCGC CACCGAGGGT CCCGCCCCGG TGACCGTGAT CACCAGCGAC ACGATCAAGG CCGCCGGCTT CACCAGCGTG CCCGACGTGC TGCGCAGCCT GACCCAGAAC GGCGGCGAGA CCCAGACCCA GCAATCGTCC AGCGGCGCCG ACTGGACGCC CGGCGCCCAG CAGGTCGATC TGCGCGGTCT TGGCCCCAAC CACACCCTGG TGTTGGTCAA CGGCCGCCGT ATCGCCGACT TCCCCCTGCC CTTCAACGGC AAGAGCGCCT TCACCGACAC CTCCTCCATT CCCCTGGGCA TGATCGACCG GATCGAGGTG CTCAGCGGCA GCGCCTCGGC CGTCTACGGC TCGGACGCCA TCTCGGGCGT GGTCAACTTC AATCTCAAGA AGAAGGTCGA CGGCACGACG GTCGACATGA CCTTCGGCGG ACTGGAGCAT GGCGGCGCGG CCAGCCAGCG GCTCAATGTC TCGACCGGCT ATTCCAAGGG CGACTTCGAC CTGGTGATCG GCGGCGAGTT CGTCAATCAA AAGCCCCTCT GGGCCTATGA CCGCGACATC CAGGACTCGA CCAAGGACGC CCCCACCGCC GGCGCCCGGA TCGCCCGCCG CGACTTCCTG CGCATGGATC CCGCGGAGGA CGTCTATGTC GATCCCGGCA AGACCACCTG CGACAGCCTC AAGACTTTGA ATGGCGGCTC GGTCGAATAC GCCAGCCGTC CGCGCTGGGG CGCCTATGAC CCCGACACCG ACGACTACGG CCCGGGCTAC TACTGCGGCA GCTACAGCTC GATCGGCTAC GGCACGATCA TCAGCGAGCG CAAGAGCGCC AATGTCGTGG CCTCGCTGAA CTACGCCAAG ACCGACACCC TGGCCTTCTT CGCCGACATC TCGGCCGGCT ACAGCCGCAC CCGCCAGTTC CAGGACGTGC TGTCGTGGAA CTACCAGGAC GCCAACGGCA GCGAAGACGG GATCTTCTAC AACCAGTTCA CCGGCGCCCT GGATTTCTGG CAGCGCAACT TCACGCCCGA AGAGATGGGC GGCCTGCACA AGGGCTACAT CACCAACACC TCGCGCACCT TCAGCATCAC CCCGGGCGTC AAGGGCTCGC TGGGCGACGG CTGGGACTAT GAGGCCTTCT ACAATTTCAG CCAGTACAAG TCGTCGATCA GCTGGCCCAA GGTGGTCAAT TCCAAGGCCA CGGCCCTGTT CCTGGGTCCG CAACTGGGCG TGGACGCCGA CAGCGGCTAC GCGATCTTCA ACGCCGATCC GGCCCGGCTC TACAAGCCTC TGACCACGGC CGAATACGAC TCGATCACCG CCCGCACCAC CTACAAGCCG GTGGCCCGCC AGCAGGGCGT GTCGTTCCAG GTCAACAAGG CCGATCTGTT CACCCTGCCC GCCGGTCCGG TCGGCTTCGC CGCCGTCGCC GAATACGGCA AGCAGTCCTA CAAGCTGGGG CTCGATCCCC TGGCCACTCA GAACTACTAT TACCACCTGC GCGACGCCGA CGGTTCAGGC TCGCGCGATC ACTGGGGCGC CGGCTACGAG TTCCGCGCCC CGCTGCTGAA GAGCCTCGAG CTTTCGACCG CCGGCCGCTA CGACAGCTAC AAGTACGGCG GCAACACCAT CGACAAGTTC ACCTATAACG GCGGACTGGA GTGGCGGCCG GTCAAGTCGC TGCTGGTGCG CGGCGCCTAT GGCACCGGCT TCCGCGCCCC TGACCTTCAC TACGTGTTCG CCAAGGAGGG CATCAGCCAC CCGTCGGGCA CCGACTACTA CCGCTGCCGC ACCGAGGAGC CGGACGAGGA TATCGGCGAC TGCTCCTATG CCGACGAGGG CCTGGTCAAG GTTCGCCTGG GCAACGCCAA GCTGAAGCCC GAGACCTCGA AGTCGCTGAA CTACGGCGTC GTCTGGTCGC CGGTGCGCAA CTTCGACATC TCGGTCGACT ACTTCCGGGT TCAGCTGAAC AACGCCGTTC TGGACATGAG CCTGGACAGC ATTCTGCGCC AGGAAGGCGA CTGCCGCGTC GGCACGACCG CCGCCGGCAC TCCGGTCAGC ATCACCTCGC CGACCTGCGT CGACGCCCTG GCCCGCGTGG TCCGCAACCC GCTGACCGCG GCGATCGATC CCGGCGGCAT CTCGACCGTC ACGATCAACC CGATCAACGT CGCCACCGAG AAGACCAACG GCATTGATGT GGCCGCCCAC TATCGGCTGG CCACCGACGG ACTGGGAACC TTCGACTTCA GCCTGGCCCA TACCTGGGTC GACAAGCACA CCAGCCGGCA ATATCCGGGC GACCCGATCG AGAACCAGCT GGCCTATGAC AGCGGCTACG ACGTCCCGCG CACCAAGAGC AGCGCGGCCA TCAACTGGAA CAAGGACGCC CTGTCGATCG GCCTGCACGG CCAGCGGCTG GAGCGCCTGC CCAACTACGC CGAGGACGGC TGGATCAAGG CCACCTACCT GGTCAACGCC ACGATCCAAT ACGAGATCGA TCCGCGCACG CGGGTCAGCC TGGCGATCGA CAACCTGCTG GACAAGGCCC CGCCCCGCGA CCCGACCTAT TCGGGCTATC CGTACTACGA CACCTCGTGG TTCGACTCGA CGGGCCGCAG CTACTACCTG CAACTGACGC ACAAGTTCGG CGGCAATAGC GGGCTGTAG
|
Protein sequence | MKMHLLRASA LAGAAGLMIA GQALAQTTPT PAPASPTDET AAVEALVVTG SRIPRIATEG PAPVTVITSD TIKAAGFTSV PDVLRSLTQN GGETQTQQSS SGADWTPGAQ QVDLRGLGPN HTLVLVNGRR IADFPLPFNG KSAFTDTSSI PLGMIDRIEV LSGSASAVYG SDAISGVVNF NLKKKVDGTT VDMTFGGLEH GGAASQRLNV STGYSKGDFD LVIGGEFVNQ KPLWAYDRDI QDSTKDAPTA GARIARRDFL RMDPAEDVYV DPGKTTCDSL KTLNGGSVEY ASRPRWGAYD PDTDDYGPGY YCGSYSSIGY GTIISERKSA NVVASLNYAK TDTLAFFADI SAGYSRTRQF QDVLSWNYQD ANGSEDGIFY NQFTGALDFW QRNFTPEEMG GLHKGYITNT SRTFSITPGV KGSLGDGWDY EAFYNFSQYK SSISWPKVVN SKATALFLGP QLGVDADSGY AIFNADPARL YKPLTTAEYD SITARTTYKP VARQQGVSFQ VNKADLFTLP AGPVGFAAVA EYGKQSYKLG LDPLATQNYY YHLRDADGSG SRDHWGAGYE FRAPLLKSLE LSTAGRYDSY KYGGNTIDKF TYNGGLEWRP VKSLLVRGAY GTGFRAPDLH YVFAKEGISH PSGTDYYRCR TEEPDEDIGD CSYADEGLVK VRLGNAKLKP ETSKSLNYGV VWSPVRNFDI SVDYFRVQLN NAVLDMSLDS ILRQEGDCRV GTTAAGTPVS ITSPTCVDAL ARVVRNPLTA AIDPGGISTV TINPINVATE KTNGIDVAAH YRLATDGLGT FDFSLAHTWV DKHTSRQYPG DPIENQLAYD SGYDVPRTKS SAAINWNKDA LSIGLHGQRL ERLPNYAEDG WIKATYLVNA TIQYEIDPRT RVSLAIDNLL DKAPPRDPTY SGYPYYDTSW FDSTGRSYYL QLTHKFGGNS GL
|
| |