Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2137 |
Symbol | |
ID | 5899592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2308396 |
End bp | 2311437 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641562626 |
Product | TonB-dependent receptor |
Protein accession | YP_001683763 |
Protein GI | 167646100 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.255977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.143212 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCAAT TCGCACACGC GCGGCGCGGC ATGTTGCTGG CGGCCGTTTC GGGACTGGCG ATCGCCGGGC CAAGCTTTGC CCAGGCTCAG GACGCGGTGG CGTCGGGCCA GACCACCGAG AGCGCCTCGA CGGTCGACGA GATCATCGTG ACCGGCATCC GTGCTTCGCA ACAGCGCGCG GTGAGCATCA AGCGCGAGGC CGCCTCGGTC GTCGACGCCA TCTCGGCCGA AGACATCGGC AAGCTGCCCG ACAACACCAT CTCCGACTCG CTGCAACGGA TCCCGGGCGT TCAGATCCTG CGCAGCGCCG GCGAGGGCTC GACCGTCAAT ATTCGCGGTC TGCCGCAGGT CTCCACGCTG CTGAACGGCG AAGTCTATCT GGGCGCCCAG TCGATCACGA CCGTCCAGCC CAACTTCAAC GACATCCCCT CGCAGCTGTT TTCGGGTGCG GACGTCATCA AATCGACGAC CGGCGACCAA CTGAACGCCG GGATCAGCGG TACGATCAAC CTGCGCACGC GCCGGCCCAT GGACCTGAAG GAGGGCCTAA CCCTCGCGGC CGCGGCCGAA GGCTCCTACG GCGACAAGAC CCAGAAGTTC GATCCGAACG TCAACGGCCT GATCTCCTTT CACAACGACC GGTTCGGCGC CCTGTTGTCG GCCGCCTATT CCGACGTGCG CCTGTCGAAC TCGCACAACG GCATCCAGGA AACCTACGGC GCGACACTGC ATAACGAGAG CACGGCCGAC GCGACCTCCA GCGGCGGCTT CTCGCCGACC AAACGCCCGC ACGGCACGCC GGTGGCCGGC GGCATCGACG TCAACGGCGA CGGCGACGCC AACGACGCCT TCATTGTTCC CCAGGGCTTC ACCGGCTGGA ACAAGATCAA CCAGCGCGAA CGCCTGGGCG TCAACGCCTC GGCCCAGTGG AAGATCAACG ACGCCCTGGA GCTGACCGGC GACGCCTTCT TCACCAAGCA GGACGAGCAC GACCGCACGG CCGGCTTCCA GATGCAGGAC GTCAACTGGC AGGCCGCGGA ATTCACGCCG GGCCAGTCGC GTGACACCGG GACGATCGTC AACAGCTATC ACTTCAACAC CACCCAGATT TACAACTACG ACCTGGGCAA CTTCGACTCC TACGCCCAGA CCGACCGCTA TCAATCCAAG TCGCAGAACT ACAATCTGGA ACTGAAGTAC GATAACGGCG GCAAGTTCAC CGGGTCGGTG CGCGGCATCT ACGGGAAGGC CCACCAGGAC TACGATCAGA GCTATCTGCA GTTCAGCCTG TCGAACGGCG CCCAATGGCA GCCGGGCGGC GTGGGCCACT ATCCCGCTTC GCTGGGCGGC GATCGGCCGT TCAACACCGG CGGCTATGCG GTCAACACCA TCGCCGGCGC GGCCTCCTTG CCCGCCAAGG TGGACTATAC CGGCAACAAG CCGGTCTTCA CCCTGCCCAG CCAATTGCTG ACTGAACTGG GCGACATCAA CAGTTATGCG CTCAAGACCA TTTCCTCGGA AGGCAACTAC CGCCGCGAAG GCGACCTGAA GGTCATCCGA GCCGACGGCA AGTACGAGTT CAACGACAGC TTCAAGCTGT CGGCCGGCGC GCGCTATTCC GAGCGCTCGG TCGACGACTT CGAGTTCGAT CGCGCCGCCC CGCTCTACGG CAGCGCCGCA TCGAACGGCA CGGGCTGCCT GGTCAAGTGG AAGGCCTTCG ACGTCCCGCT CAGCGACAGC AGCTGCAGCG CCGGCAACGC CGCGGGCTTC TACACCGCCG GTCTGACCCG CAAGGCCAAT GACCCGACCC TGAACGGTGA AGTCAAGCTG TTCAACCCCG GCGTCGCGGG CGTGCCGTCG ATGTACGTGC TCGACCCGAA GGCCATGGAC CACGCCCTGG CGTTCCAGAA CCGCTTCTAT CCGGGCAATG TCGAGATCAT GAACCCGGGC GCCTCGTTCA ATGTCGGCGT CAAGCAGACC TCGGCCTATC TGCAAGCCGA CTTCAAGGGT GAAGTCTTCG GCCTGGGCTT CACCGGCAAC GCCGGCGTCA AGGTCATCCA GACCAAGCTC GACATCACCC AGTACGTCAC CGGCAGCCCG CGCCCCTACG GCGTGGCCAA CCTGCTGGCC GGCAGCGTCG AGACCAACCG CAAGTTCACC GACGTCCTGC CGGCGATGAA CGTCGCCTTC GATGTCGCCG AGAACGTCAA GCTGCGCTTC GCCGCTTCCG AGACCATGAC GCTGCTGGAT CTGAACCAGT GGGGCGGCGG TCTGAACCCG ACCTACGCCA TCGACACCAC CAATCCCGGT TCGCCGGTGT TCCGCGTCAC CGGCGGCAGC CAGAACGGCA ACCCCGCGCT CGATCCCTGG CGCGCCAAGA ACTTCGAAGG CTCGCTGGAG TATTATCTCG GCAGCGCCAG CATGCTGAGC GTCGGCGCCT TTTACATGAA GGTCGACAGC TTCATTCAGA ACGGCTCGAT CGTCCGCACC GACCTGCCCG ACAACGACGG GGTGGTGCGC AACCGCACCG TCTCGATCAG CACCCAGGTG CAAGGCGACG GCGGTACGCT GAAGGGTCTG GAAGCCGGCG CCAAGCTGGC CTTCAACGAC CTGTCGTTCA TGCCCGCGAT GCTGTCGAAC TTCGGCGTCG ACACCAACTT CACCTACGCG CCGTCGAAGT CGGGCAAGAA AGATCTGGCC GGGGCCTCGA TCCCCTTCCA GGACAACTCG AAGTACCAGG CCAACCTCGC GGCCTACTAT CAGGACGACA GGCTGCAGGC CCGGATCGCC TGGAACTACC GCTCCCGCCG CGCCGTGTCT CAAGACTTCG GCGGAACCAC GGGACTGGAA ATGTACCAGG CCTCGACCAA CTATCTCGAC GCCTCGGTCA GCTACGACGT CAAGCCGAAC CTGACCGTCT ACGTCCAGGG CACCAACCTG ACCAGCGAGT ACGAGAAGTA CTACCTCACC TGGAAGGACG AGCACGCCTA CAACAACGTG TTCGAGGCCC GCTACGTGGC TGGCGTCCGC TTCAAGTATT GA
|
Protein sequence | MNQFAHARRG MLLAAVSGLA IAGPSFAQAQ DAVASGQTTE SASTVDEIIV TGIRASQQRA VSIKREAASV VDAISAEDIG KLPDNTISDS LQRIPGVQIL RSAGEGSTVN IRGLPQVSTL LNGEVYLGAQ SITTVQPNFN DIPSQLFSGA DVIKSTTGDQ LNAGISGTIN LRTRRPMDLK EGLTLAAAAE GSYGDKTQKF DPNVNGLISF HNDRFGALLS AAYSDVRLSN SHNGIQETYG ATLHNESTAD ATSSGGFSPT KRPHGTPVAG GIDVNGDGDA NDAFIVPQGF TGWNKINQRE RLGVNASAQW KINDALELTG DAFFTKQDEH DRTAGFQMQD VNWQAAEFTP GQSRDTGTIV NSYHFNTTQI YNYDLGNFDS YAQTDRYQSK SQNYNLELKY DNGGKFTGSV RGIYGKAHQD YDQSYLQFSL SNGAQWQPGG VGHYPASLGG DRPFNTGGYA VNTIAGAASL PAKVDYTGNK PVFTLPSQLL TELGDINSYA LKTISSEGNY RREGDLKVIR ADGKYEFNDS FKLSAGARYS ERSVDDFEFD RAAPLYGSAA SNGTGCLVKW KAFDVPLSDS SCSAGNAAGF YTAGLTRKAN DPTLNGEVKL FNPGVAGVPS MYVLDPKAMD HALAFQNRFY PGNVEIMNPG ASFNVGVKQT SAYLQADFKG EVFGLGFTGN AGVKVIQTKL DITQYVTGSP RPYGVANLLA GSVETNRKFT DVLPAMNVAF DVAENVKLRF AASETMTLLD LNQWGGGLNP TYAIDTTNPG SPVFRVTGGS QNGNPALDPW RAKNFEGSLE YYLGSASMLS VGAFYMKVDS FIQNGSIVRT DLPDNDGVVR NRTVSISTQV QGDGGTLKGL EAGAKLAFND LSFMPAMLSN FGVDTNFTYA PSKSGKKDLA GASIPFQDNS KYQANLAAYY QDDRLQARIA WNYRSRRAVS QDFGGTTGLE MYQASTNYLD ASVSYDVKPN LTVYVQGTNL TSEYEKYYLT WKDEHAYNNV FEARYVAGVR FKY
|
| |