Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2063 |
Symbol | |
ID | 5899518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2201013 |
End bp | 2203850 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641562552 |
Product | TonB-dependent receptor |
Protein accession | YP_001683689 |
Protein GI | 167646026 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.154815 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAGGC GCAACACGGG CGCGATCCGT CCACAATCCA ACCGACTCTA TGTCACCGCT TCGGCCATCG CGATCACGGT GGCGATGGCC AGCCCCGGCG TCGTCTGGGC CCAAGCCACG ACGCCGCCGC CGTCCGACAC CACGGTCGAC GAACTGGTGG TCACCGGCAT TCGCGCCGGC ATCCAGAACT CGATCAACAT CAAGAAGAAC GAGACCTCGA TCGTGGAGGC GGTGTCGGCT GAAGACATCG GCAAGCTGCC GGACGTCTCG ATCGCCGAAT CGATCGCGCG CCTCCCGGGC CTCGCCGCTC AGCGCGTGAA CGGCCGCGCC CAGGTGATCT CGATCCGGGG CCTGGCGCCC GACTTCACCA CCACCTTGCT CAACGGCCGC CAGCAGGCCA GCTCGGGCGA CAACCGCGCC GTCGAGTTCG ACCAGTATCC GTCCGAGCTG CTGTCCAGCG TGGTGATCTA CAAGACGCCC GACGCCCAGG TCTCGGGCAT GGGCCTGTCG GGCACCGCCG ACCTGCGCAC CGTTCGGCCC CTGGCGTTCG GCAAGCGCGC CGTGGCCGTC AATATTCGCG GCGAGAAGAC CGAAGGCCGC AAGCTCAACG ACGACGCCAG CAACTGGGGC GGCCGGTTCA GCGCCAGCTA TATCAACCAG TTCGCCGACG GCACGTTCGG CGTGGCGCTC GGCTACGCCC ACCTGGATTC GCCGTCGCAG GTGAAGCACT ACAAGGCCTA CGGCTATGAG GCCTTCGACT TCGCCGTCAC TCCCGACAGC GCCGACAACG CGCTCATGCT CAATGGCCAG GAGGTGTTCG CCACCTCGCG CAACAACAAG CGCGACGCCT TCATCGGCAT TTTCGAATAT CAGCCCAACG ACAGCATCCA CTCGACCCTG GATCTCTACT ACTCCAAGTT CGACCAGCAG GAGACGACGC GCGGGGCGCA ATGGTTCTCG AACGGTTGGG CCGACAACGC CACCTTCACC GGGGTCACGA CTCGCGACGT GGGCGGCTCG GCGATGGCCG CCAGCGGCGT CCTGCACAAC GGCGTGCCGA TCCTTCGCAA CGACTACAAC ACCCGCAAGG ACGAGCTGTT CTCGGCCGGC CTGAACAACG AATTCAGGCT GGGCGAGCGC ACCAAGCTGT TCGCCGACCT GTCCTATTCG TCGAACAATC GTCGCGAACA GATCATGGAG ACCTATGCGG GCTACGGTCT GGGCGTCGGG GGCGTGACGC CCGCCACGTC GGATGTCGGG CGCACCTTCG ACACCATCGG CTTCAAGGTC GCCGACGACG ACTTCTCGCA ATATGACGAG GGCCTGAACT ACGCCGACGC CAGCAAGGTG TCACTGGGCG ACCGCGCGCC GTGGGGCGGC TGGGGTCACG ACGGCGCGAT CCGCTACCCG CACGTCAAGG AAGAGGTCTC GGCGATCGAC CTGCGCCTGG AGCACGAGTT CGACGGGTTC ATCACCAGCG TCGACTTGGG CGCTAACGCC ACCCACCGCG AGAAGACCAA GACCGTCTCG GACAACGACT TGTTCCTGAA GAACGGCCGC CAGCAGATCC TCGTCGATGC GGCCGACCTT GAAAAGCCGA CTTCGCTGGG CTTCGCCGGC TTCGGCTCGG TGCTCAGCGT CGACATCGGC GACGTCTGGC AGAAGTACTA CAACTCAGCG CCGATCCTCG ACGCCAACTA CTTCGACAAA AACTGGGAGA TCACCGAGGA CGTCCAGACC TTCTTCGCCA AGGCCAATTT CCGGACCGGC GATCTGCGCG GCAACGTCGG CGTCCAGGTC GTGCACCAGT CGCAGGAATC GAGTGGCGTG GTGATCCTGG GCCAGCCGGT CGTGCCAACC CAGATCAACG CCAAGGAAGA CTACACCGAC GTCCTGCCCA GCCTGAACAT GATCTACGAT CTGGGGCATG GGCAAAGGCT GCGCTTCGCG TTGTCGAAGA CGATGGCCCG ACCCCGGATG GACGAGATGC GGGCCAACCT CACGCCGGGA TTCAACTCGC TGGTCTGCAG CGGCCAGCCC TGCGCGCCGG GCACTGTGGT CAATCCGTGG TCGGCCAGCG GCGGCAACCC GCACCTGCGG CCGTGGGAGG CCAAGGCCGC CGACGTCGCC TACGAGTGGT ACATCGGCTC CGCGACCTAC GTGTCGGTCG CCGGCTTCTA CAAGAAGCTC GACACCTACA TCTATAACCA GACTAGAACC TTCGACTTCA CCGGCATCCC GCTGCCGTCA TCGGCTTCGG CGATCCCGCC GGGCGTGATC ATCAGTAATA TCGGCCAGAT CACCCAGCCG GCCAACGGCA AGGGCGGCGT GGTCAAAGGG CTTGAGTTCA GCGGCGCGCT CGATCTTGGC AAGGTTCTTG AGGCGCTGTC AGGCTTCGGC GTGCAGGGCA GCCTGTCGCT GACCAAGTCC AATCTCAACC CGACTCCCGA TACGACCCAG AAGGTCCGTA TCGCCGGCCT ATCGGGCACG GTCTACAACC TGACGGGCTA TTACGAGAAG GGGGGCTTCC AGGCGCGGAT CAGCCAGCGC TATCGCTCGG GCTTCAAGGG CGACGTGGTG CAACTGTTCG CCACCCGGGG GGCGACCGAG ATCCTCGCCG ACAAGCAGGT CGACGCGCAG ATCGGTTACA CCTTCCAAGA AGGCCGGCTG GAAGGCCTGG GCTTCCTGCT GCAGGTCAAC AACCTGACCA ACTCGCCGTA CCGCACGCGT CTGGGGCTAG ACGGGGGCGG CACCAAGACG GCTAGCGGGG ATTCCCTGCC CGAGACCTAT GAGGAATACG GCCGCCAGTT CCTGTTCGGG GTGAACTACC GGTTCTAA
|
Protein sequence | MRRRNTGAIR PQSNRLYVTA SAIAITVAMA SPGVVWAQAT TPPPSDTTVD ELVVTGIRAG IQNSINIKKN ETSIVEAVSA EDIGKLPDVS IAESIARLPG LAAQRVNGRA QVISIRGLAP DFTTTLLNGR QQASSGDNRA VEFDQYPSEL LSSVVIYKTP DAQVSGMGLS GTADLRTVRP LAFGKRAVAV NIRGEKTEGR KLNDDASNWG GRFSASYINQ FADGTFGVAL GYAHLDSPSQ VKHYKAYGYE AFDFAVTPDS ADNALMLNGQ EVFATSRNNK RDAFIGIFEY QPNDSIHSTL DLYYSKFDQQ ETTRGAQWFS NGWADNATFT GVTTRDVGGS AMAASGVLHN GVPILRNDYN TRKDELFSAG LNNEFRLGER TKLFADLSYS SNNRREQIME TYAGYGLGVG GVTPATSDVG RTFDTIGFKV ADDDFSQYDE GLNYADASKV SLGDRAPWGG WGHDGAIRYP HVKEEVSAID LRLEHEFDGF ITSVDLGANA THREKTKTVS DNDLFLKNGR QQILVDAADL EKPTSLGFAG FGSVLSVDIG DVWQKYYNSA PILDANYFDK NWEITEDVQT FFAKANFRTG DLRGNVGVQV VHQSQESSGV VILGQPVVPT QINAKEDYTD VLPSLNMIYD LGHGQRLRFA LSKTMARPRM DEMRANLTPG FNSLVCSGQP CAPGTVVNPW SASGGNPHLR PWEAKAADVA YEWYIGSATY VSVAGFYKKL DTYIYNQTRT FDFTGIPLPS SASAIPPGVI ISNIGQITQP ANGKGGVVKG LEFSGALDLG KVLEALSGFG VQGSLSLTKS NLNPTPDTTQ KVRIAGLSGT VYNLTGYYEK GGFQARISQR YRSGFKGDVV QLFATRGATE ILADKQVDAQ IGYTFQEGRL EGLGFLLQVN NLTNSPYRTR LGLDGGGTKT ASGDSLPETY EEYGRQFLFG VNYRF
|
| |