Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1859 |
Symbol | |
ID | 5899314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1991688 |
End bp | 1994354 |
Gene Length | 2667 bp |
Protein Length | 888 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641562349 |
Product | TonB-dependent receptor |
Protein accession | YP_001683486 |
Protein GI | 167645823 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGCT CCAAGAATTC GTCGGCGCGC TGCGCCGGCG CCGTCGCCAT TCTCGCCTTC GCCGCCCTGG CCTCCAGCCA GGCCGCCCAA GCCGCCGAAC CCGCCGCCGC GCCGGTCGCC GAGCCCGCCG AAACGGCGGA CACCGTCGAC ACCATCGTCG TCACCTCCTA CGGCAAGAGC CTGCAGGCCG CCGCCGCCCT GAAGCGCTCG GCCGCCTACG GCCTGGACGC CATCAACGCC GAGGACATCG GCAAATTCCC GACCCGCAAC GCCGCCGAGG CCCTGCAACT GGTCACCGGC GTGACCATCG ACCGCCAGCG CGGCGCGGGC CTCTATGTCA GCGTCCGGGG CTTGGGTCCG CAGTTTCAGT ATGTCGAGCT GAACGGCCGC TCGATCGCGG TCAACGAGCT GATCGAGAAC GGCGGCGCCA AGGGCCGCCA GTTCCGGTTC GAGGTGCTGC CGGCCGAACT GATCTCGCAG ATCGAGGTGG TCAAGACCCC GACCGCCGAC ATGGACGAAG GCGCCCTGGG CGGCAACATC AACATCAAGA CCTTCAAGCC GCTGGACGTG GGCGACAAGG CAGCCTTCTC GATCCGCGAG ACCTATGGCG ACGTGCGTAA GAAGGCTGAT CCCTCGGCCT CGGGCCTGCT CAGCTGGACC AATGCCGACA ACACCTTGGG CCTGCTGGCC TCGGGCCTCT ACGACGAACG CCACGTGCGC AACGACCGCC TGTACCAGGT CGGCTGGAAC CTCAACAAGT TCACCTCGAC CCTGGGCGCG GGCCGCTACA CCCCCTCGCG GACCCGCCCG ACCATCGAAA CCGAACACCG CAAGCTCTAT TCGGGCGCGG TGACCGGCCA GTGGCGGCCC AATAGCGACT TCCAGACCGA TGTGGACGTG CTCGTGACCC GCCTGGACGT CGACTATGAC GAGTTCGGCC TCGACATCTA CCCCGACGAC ACCACCTTCC AGAAGCCGAT CTTCGTGGCC GGCAGCGAGA AGCTCATCGG CGACACCGTG GTCGGCGGCA CGATCAACAA CGTGCGCTGG ATGGCCTCGC GCGAAACCAG CCTCAACCGC CATGACCTGA CCGCCGTGGG CCTCAAGCAG GCCTGGACGC CGGGGGCGTG GACTTTCAAC GGCGAATACG CCTATTCCCG AGCGCGCAGC TATCACCCCG ACGGCGAAGC CACCAAGCGC AACCGCCTGG CGTTCTTTGG CCCGCTGACC TACGATTTCT CGCGCGGCTA CAAGTCGATC CCGCAACTGA CCACCACGGT GGACTACACC AACCCGGCCA ACTTCGTTGG TCAGGCCTTC GACTATACTT GGAAGGACTC GCGCGACACC GACGAGACCA TCCGTGCCGA CGGCGGGCGG GTGTTCGACG GCTGGTTCAG CAAGATCGCC TTCGGGGCCG AGCGCCACAA CCTGAAGCGC GACTACCGTC GCCGCGACTG GGTGCTCAAC AACGACCTGA ACGTGCCGAC CAGCACGCTC GGCTCTGCCT ACTACGAGCC CCTGCCCTAT TCGGGCTTCC TTAAGGACTT CGACGGCAAT ACCCCGCGCA ACTGGGTTAC GCCGACCCGC GACGCCTTCT ACAATGCGCT GTTCACGCCG ACCGTCGCCG GCCAGCCGAT CTCGGCGGCG GATGCGCGCA ATTCGTTCGT GGTCGACCAG CACATCACCA GCGCCTATGT GCGCGGCGAT TTCGCGTTCG CCACCGCCAT GCCGATCAGC GGCAATGTCG GCGTTCGCTA CGCTCGCACC GAACAGGTGG CCAGCGGCAC GTTGACCAGC ACCGACGCCA GCAACAATCC CGTGCTGACC CCGGTGTCCT ATAAGCAGGA CTATGGCAAC TGGCTGCCCA GCCTGAACGT GAAGATCGAA CTGCGGGACG ACCTGATCGG CCGCTTCGCC GCTTCGCGCG TGGTCAACCG GCCCAACGTC ACCGACAGCG CCCCGCGCAT CAGCGTGGCG CGGGACACGC CCAGCGCCTC GGGCGGCAAC CCGGACCTGA AGCCGTTCCT CGCCGACCAG TTGGACGCCT CGCTGGAATG GTACCCGGCC CCGACCACGG CCCTGACCGG GGCGGTGTTC TACAAGAAGA TGGACAACTA CATCACCCAG CAGAACACCA CGATCCAAGT CCCGGGTCGC GGCGACGTGC TGCTGTCGAC CAGCGTCAAC GGCGGCGACG CCAAGCTGAC CGGCGTCGAG GTGGCGTACA ACCAGAGCCT GGCCTTCCTG CCGGGTCCGT TCGACGGTCT GGGCGTCCAG GCCTCGGTCA CGCTCGTGGA CAGCAAGGCC AGCTACTTCG CCGGCAATCG CCAGATCAAG GACGATCTGA TCGGCCTGTC CAAGACCAGC TACAACCTGG TCGGCTACTA CGAGAAGGGT CCGCTGGCCG CGCGCCTGGG CTGGTTCTGG CGCTCGCGCT ACCTGTCGGG CACGGGCAGC ACCACCACAG CCGAGTCCTA CATCGACGCC TACGGCTCGC TGGACGGCTC GATCTCCTAC GACCTGACCA AGAACTACGC CCTGACGCTG GAAGGCTCGA ACCTCACGGA CGAGATCCGC TACGTCTACG GCAAGACCAA GGACCAGCCG ATGGAGACCT ATCACTGGGG CCGCACGGTC TCGCTGACCC TGCGCGGCAA GTTCTAA
|
Protein sequence | MIRSKNSSAR CAGAVAILAF AALASSQAAQ AAEPAAAPVA EPAETADTVD TIVVTSYGKS LQAAAALKRS AAYGLDAINA EDIGKFPTRN AAEALQLVTG VTIDRQRGAG LYVSVRGLGP QFQYVELNGR SIAVNELIEN GGAKGRQFRF EVLPAELISQ IEVVKTPTAD MDEGALGGNI NIKTFKPLDV GDKAAFSIRE TYGDVRKKAD PSASGLLSWT NADNTLGLLA SGLYDERHVR NDRLYQVGWN LNKFTSTLGA GRYTPSRTRP TIETEHRKLY SGAVTGQWRP NSDFQTDVDV LVTRLDVDYD EFGLDIYPDD TTFQKPIFVA GSEKLIGDTV VGGTINNVRW MASRETSLNR HDLTAVGLKQ AWTPGAWTFN GEYAYSRARS YHPDGEATKR NRLAFFGPLT YDFSRGYKSI PQLTTTVDYT NPANFVGQAF DYTWKDSRDT DETIRADGGR VFDGWFSKIA FGAERHNLKR DYRRRDWVLN NDLNVPTSTL GSAYYEPLPY SGFLKDFDGN TPRNWVTPTR DAFYNALFTP TVAGQPISAA DARNSFVVDQ HITSAYVRGD FAFATAMPIS GNVGVRYART EQVASGTLTS TDASNNPVLT PVSYKQDYGN WLPSLNVKIE LRDDLIGRFA ASRVVNRPNV TDSAPRISVA RDTPSASGGN PDLKPFLADQ LDASLEWYPA PTTALTGAVF YKKMDNYITQ QNTTIQVPGR GDVLLSTSVN GGDAKLTGVE VAYNQSLAFL PGPFDGLGVQ ASVTLVDSKA SYFAGNRQIK DDLIGLSKTS YNLVGYYEKG PLAARLGWFW RSRYLSGTGS TTTAESYIDA YGSLDGSISY DLTKNYALTL EGSNLTDEIR YVYGKTKDQP METYHWGRTV SLTLRGKF
|
| |