Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5256 |
Symbol | |
ID | 5897386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010335 |
Strand | - |
Start bp | 189990 |
End bp | 192185 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641555359 |
Product | TonB-dependent receptor |
Protein accession | YP_001676690 |
Protein GI | 167621905 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.650389 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000349677 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGAAAGA TGGATCTTCT ACGCCTGAGC AGCCTTGCGG CCCTAGCCGC CGCACTGGGA ACGGCCTCGG CCTCGGCCCA GGCCCGGTCG CCGTCCGAAA GCAACACGGT GGCCGATATC ATCGTGACCG CCCAGCGCCG CTCGGAGAAC CTACAGTCCG TGCCGGTCGC GGCCACCGCC TTTGGGGCCG AGGCGCTGGA AAAGAGCGGC GCGGTCAACA TCCTGGACGT CGCCGCCCAG ACCCCCGGGG TGACCATGAC CGAGTATAAT ATCGGTGAGC CGCAAGTCTA TGTGCGCGGG GTCGGCTCGC AAAGCGACAG CGCCAGCTCC GAGCCGTCGG TGACGGTGTC GGTCGACGAA GTGCCGATCA GTCGCGGGGG CGCGACGGGC GCAGCCTTCC TCGACACCCA GCGCGTGGAG GTGCTGCGCG GGCCGCAAGG CACGCTCTAC GGCCGCAACG CCTCGGCCGG GGCCATCAAC CTCTACACCA ACCGACCGAC CTTCGATTTC AGCGGTCGTG TCGAGGCCAG CGCCGGGCGG TTTGGGGGCT ACGGCGCCAA GGGCGTGGTC AATGCGCCGC TGTCGGAGGC AGTGGCCCTG CGCGTGGCTG CGCAGTATTC CGACAGCGAC GGCTATGCCC GCACCACCCC AAGCGGCGAG CGGCTGCAGG GCGGCGAGCG CTACGCCGCC CGGGTCCAGC TTCTGACCCG ACAGGGCGAC TGGGACGTTC TGGCTAGCCT GGACCATTCG TTCGATGATC TGGCGGGCGA CGCCCGCTAC GCCCTGGTCA CACCCTTGGC TCCCGCGCCG CTGGCGGCGA TCGTCAACAG CGCCCAGGCC GGGCGCGACG TCTGGACCAC CTACGGCCGG CCCGACGCCT ACCAGAAGCG CGCCAACACC GGCGGATTTC TGCGCCTGGA GCATACGGGG GAGGCCTTCA ACTTCGTCTC ACTGACCGCC TACCGCGACA ACGAGTACAG CTTCCTGGCC GACCTTTCGG GCCTGCCCGA CGCGACCTTC CCGTTCCAAC CCGACGACTA TGTCGACGAA GACTCCCACC AGTTCAGCCA GGAGTTGCGC CTCACCTCCA GCGACGCGGC CAGGATCAAA TGGGTCGGCG GCCTATTCTA TTTCCGCGAA AGCATCGACA GGGTCGAGCG GATCGTCACC GACAGCCGCG CACCCTTGCC CGCGGCCCTG TCGGGCGATG TCAGCATGGG CCAGGACGCG ACAGCCGAAA GCTATGCGGC GTTTGGCCAG GCGACGATCC CGTTCGCGCG GATCTGGGAG CTGACCCTGG GCGCGCGTTT GACCCACGAT AAGCGTGACG TCTTTCAGAG CCTGATCAAC AACCGACCCA GCGACACCAA CCTGGCTTTC CCGGTGTTTC CTGGTTCGCT CTATGCGGTT CCGGCCAAGG CCGACTTCAC CAAGCCGACC TGGCGCATCA ACCTCGCCGT CGAGCCGAGC CCGGGCAAGC ACTTCTACGC CAGCTACGAC CGTGGCTATA AGTCCGGCTC GTTCACCAGC CAGGCCCAGA ACGCCGGCCA AGCCACCACC TTGGTCAAGC CCGAACAGCT GGACAATTTC AACCTCGGGG CCAAGACCCA ATGGCTGTCC AACCGGCTGC GGCTGAACGC CGACGCCTTC TATCTCGACT ACAAGGACCT GCAGGTCTTC GAGTTCGGCA GTAGCCTCAA CTTCGTGGTC TCCAACGCCG ACGCCAAGGT CAAGGGCCTC GAACTCCAGG CCTTGGCGGC GGTCTCACAC GACATCACCG TGGGCGCTAA CTACGCCTAT CTCGACACCG AATTCACCAG CAATCCGGCC TATGCCGGGG CGACCTTGCC TTACAGGGGC AATGTTCTGC CGCGAGCGCC CAAGCAGCAG TATTCGGTTT ATGTCGAAAC CAACCACCAG ATCCTGGGCG GCGAGCTTAC GGCCCGCTTA GCCTACGATT GGCGCGATGA TTTCTACTAT AACCCCTCCA ACGACATCGC CAGCAAGCAG AAGGCCTATG GCGTCATCGG CGGCTACTTG TCGTTTGAAA CCGCCGACCA GTGGAAAGTC GCGCTCTCGG GGGAGAACTT CGGCAACGAG CGCTACTCGG TGCACAACAT CTCGTTCCAG AATATGGGGT TTCGGCTCTA CGCCCCGCCG CGAACCTGGA CGCTGATCTT GTCCAAGGCC TTCTGA
|
Protein sequence | MRKMDLLRLS SLAALAAALG TASASAQARS PSESNTVADI IVTAQRRSEN LQSVPVAATA FGAEALEKSG AVNILDVAAQ TPGVTMTEYN IGEPQVYVRG VGSQSDSASS EPSVTVSVDE VPISRGGATG AAFLDTQRVE VLRGPQGTLY GRNASAGAIN LYTNRPTFDF SGRVEASAGR FGGYGAKGVV NAPLSEAVAL RVAAQYSDSD GYARTTPSGE RLQGGERYAA RVQLLTRQGD WDVLASLDHS FDDLAGDARY ALVTPLAPAP LAAIVNSAQA GRDVWTTYGR PDAYQKRANT GGFLRLEHTG EAFNFVSLTA YRDNEYSFLA DLSGLPDATF PFQPDDYVDE DSHQFSQELR LTSSDAARIK WVGGLFYFRE SIDRVERIVT DSRAPLPAAL SGDVSMGQDA TAESYAAFGQ ATIPFARIWE LTLGARLTHD KRDVFQSLIN NRPSDTNLAF PVFPGSLYAV PAKADFTKPT WRINLAVEPS PGKHFYASYD RGYKSGSFTS QAQNAGQATT LVKPEQLDNF NLGAKTQWLS NRLRLNADAF YLDYKDLQVF EFGSSLNFVV SNADAKVKGL ELQALAAVSH DITVGANYAY LDTEFTSNPA YAGATLPYRG NVLPRAPKQQ YSVYVETNHQ ILGGELTARL AYDWRDDFYY NPSNDIASKQ KAYGVIGGYL SFETADQWKV ALSGENFGNE RYSVHNISFQ NMGFRLYAPP RTWTLILSKA F
|
| |