Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4091 |
Symbol | |
ID | 5901553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4440617 |
End bp | 4443376 |
Gene Length | 2760 bp |
Protein Length | 919 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641564611 |
Product | outer membrane insertion C-terminal signal |
Protein accession | YP_001685713 |
Protein GI | 167648050 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.798393 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACC AATTCATGCC GCGCCGGGCG CGATTGATGA GCGGCGCGGC CACCGGCCTG GCGCTGCTGC TGACGACCGC GGGCCTCGCC CACGCTCAGG ACAAGGCCGC TCCCGCCAAC GACACCGTCG AGGAAATCAC GGTCACCGGC ATCCGCGCGG GCATCGAGAA CGCCATCGCG CTGAAGAAGT CCTCCAGCTC GATCGTCGAG GCCGTGTCGG CCGAAGACAT CGGCAAGCTG CCCGACACCT CGATCGCCGA GTCCCTGGCC CGCCTGCCCG GCCTGACCGC CCAGCGCCTG GACGGTCGCG CCCAGTCGAT CTCGATCCGC GGCCTGGGCC CCGACTACAA CACCGTGCTG CTGAACGGCC GCGAGCAGGT CTCGACCGGC GACAACCGCG GCGTCGAGTT CGACCAGTAT CCTTCGGAAA TCCTCAGCGG CGTGCTGGTC TACAAGACCC CGGACGCGGC CCTGATCGGC CAAGGCCTGG CCGGCACCGC CGACCTGCAG ACCATCCGTC CGCTGAAGTA CGGCAAGAAG GTGCTGTCGG CCAACGCCCG CTACGAGTTC AACAGCGAAG ACAAGCTGAA CCCGGACGCC AAGGACAAGG GCCACCGCTT CAGCGCCACC TATGTCGACC AGTTCATGAA CGACACGCTG GGCGTCGCCA TCGCGGTCTC CGACATCTCG ACCCCGACCC AGAGCCGCCG CTTCAACGCC TGGGGCTATC CGACCACCGG CACGGGCGAC CTGGTCATCG GCGGCGCCAA GCCTTACGTC CAGTCCAACA ACCTCAAGCG CACCGGCGTG ATCGGCGTGC TGGAATACAG CCCGACCGAC AAGTTCCACA CCTCGCTCGA CCTCTACTAT TCGAAGTTCC GCGAGAAGCA GATCCTGCGC GGCATCGAGC TGCCGCTGTT CTGGAGCTCG GCCACGCTGC AGCCCGGCTC AACCGCCAAC GGCGGCCTGA TCACCAACGG CGTCTATTCC GGCGTGAAGG GGGTCATGCG CAACGACCTC AACACCCGCC ACACCACCCT GAAGTCAGCC GGCTGGAACA TCGCCTACGA CACCGACAAC GGCTGGACGC TGGCGGCCGA CCTCAGCCAC TCCGAAGCCA AGCGCACCGA CGTGATCTTC GAGTCCTACG CCGGCACGGG TCCGTCCGGC GTCGGCGCCA CCGACACCAT GACCTTCCAC ACGACGCCCG GCGCGGGCAC CACCTTCGGC TCGACGCTCG ACTACACCAA TTCGACCCTG TTCATGCTGA CCGACCCGCA GGGCTGGGGC GCGGGCGCGG CCGGCGGGGC CCTGACCCAG GCCGGCTTCT ACAACACCCC CTCGATCAAG GACGAACTGA ACGCCGTCCG TCTTAGCGCC AAGCGCGACC TGGCCTGGGG TCCGATCAAC AAGGTCGAGT TCGGCTACAA CGCCAGCCGC CGCGAGAAGT CCAAGCAGGT GCACGAAAGC TTCCTGACCT TCGGCGGCCG CATCGCCGAC GGCGCGCCTC AGAGCCGCGC CATCCCGAAG GAAGCCCTGC TGGGCACGGT CAGCCTCGAG CTGATCGGCA TCAAGGCCAT GCTGGCCTAT GACCCGACCT ACCTGCTCGA CAACGGCTAC TACACGCTGA TCGCCGACCA GAACCCGGCC GTTCAGACCC GCAACTGGGC GGTGAAGGAA GACGTCCAGA TCGCCTACGC CAAGTTCAAC ATCGACAGCA CGGTCGGTTC GATCCCCGTC ACCGGCAACA CCGGCCTGCA GGTTGTCCAT ACCGACCAGT CCTCGACCGG CACGCGCATC AACCCCGCCG ACACCGCCCA CCCGGCCAAC AACGACGGCG GCGCCAAGTA CACCTATGTG CTGCCCAGCC TGAACCTAAC GTTCGACCTG AGCAACGAGA CCTTCCTGCG GTTCGGCGCC GCCCGCACCC TGGCGCGGGC CCGGATGGAC GAACTGCGCG CCAGCCAGTC GTTCAACATG AACGCCGGCA ACCTCACCTC GACCGACCCG AACAACGCGT ACTTCAGCAC CGACGGCGGC AATCCGCAGT TGCGCCCCTA CATCGCCGAC GGCGTCGACG TCTCGCTCGA GAAGTACTTC GGCCGCTCGG CCTATATCTC GGCGGCCGGC TACTACAAGA AGATGTCGAA CTTCGTGAAC TCCAGCTCTT CGCACCTGGA GGACTTCTCG GCCTTCAAGC CGCTGCTCAG CCCGGCGCAA CAGGCGGCCC TAGGCACGAC CCAAGGCGTG GCCAAGGGTC CCGAGAACGG CAAGGGCGGC TACATCCGCG GGATCGAACT GTCGGCCTCG ATCCAGGGCG ACATCTTCTA CGAGCCCCTG CGCAACTTCG GCCTGATCAT CAGCGGCTCG TACACCGACA GCTCGGTCAA GCTGGACGAC AACTTGCCGG CGATCGACAT GCCCGGCCTG TCGAAGAAGG TGATCAACAC CACCTTCTAC TACGAGAACA ACGGCTTCAA CGCCCGGATC AGCAACCGCT ATCGCAGCAA ATTCCTGGGC GAAGTGGCCG GCCTCAGCGC CGCGCGGATC TATCGGACCG TCGACACCGA GTCGGTGCTC GACGCCCAGA TCGGCTACGA GTTCCGCCAG GGACCGCTCG AGGGCCTGTC GATCCTGCTG CAGGCCAACA ACATCACCGA CGAGCCGTTC AAGACCTACG AGAACGGCGA TCCCCGCCGG ACCATCGACT ACCAGAAGTA CGGCTCCACC TACATGGTCG GGGCGTCCTA CCGGTTCTAG
|
Protein sequence | MTNQFMPRRA RLMSGAATGL ALLLTTAGLA HAQDKAAPAN DTVEEITVTG IRAGIENAIA LKKSSSSIVE AVSAEDIGKL PDTSIAESLA RLPGLTAQRL DGRAQSISIR GLGPDYNTVL LNGREQVSTG DNRGVEFDQY PSEILSGVLV YKTPDAALIG QGLAGTADLQ TIRPLKYGKK VLSANARYEF NSEDKLNPDA KDKGHRFSAT YVDQFMNDTL GVAIAVSDIS TPTQSRRFNA WGYPTTGTGD LVIGGAKPYV QSNNLKRTGV IGVLEYSPTD KFHTSLDLYY SKFREKQILR GIELPLFWSS ATLQPGSTAN GGLITNGVYS GVKGVMRNDL NTRHTTLKSA GWNIAYDTDN GWTLAADLSH SEAKRTDVIF ESYAGTGPSG VGATDTMTFH TTPGAGTTFG STLDYTNSTL FMLTDPQGWG AGAAGGALTQ AGFYNTPSIK DELNAVRLSA KRDLAWGPIN KVEFGYNASR REKSKQVHES FLTFGGRIAD GAPQSRAIPK EALLGTVSLE LIGIKAMLAY DPTYLLDNGY YTLIADQNPA VQTRNWAVKE DVQIAYAKFN IDSTVGSIPV TGNTGLQVVH TDQSSTGTRI NPADTAHPAN NDGGAKYTYV LPSLNLTFDL SNETFLRFGA ARTLARARMD ELRASQSFNM NAGNLTSTDP NNAYFSTDGG NPQLRPYIAD GVDVSLEKYF GRSAYISAAG YYKKMSNFVN SSSSHLEDFS AFKPLLSPAQ QAALGTTQGV AKGPENGKGG YIRGIELSAS IQGDIFYEPL RNFGLIISGS YTDSSVKLDD NLPAIDMPGL SKKVINTTFY YENNGFNARI SNRYRSKFLG EVAGLSAARI YRTVDTESVL DAQIGYEFRQ GPLEGLSILL QANNITDEPF KTYENGDPRR TIDYQKYGST YMVGASYRF
|
| |