Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1289 |
Symbol | |
ID | 5898744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1355602 |
End bp | 1358538 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641561774 |
Product | TonB-dependent receptor |
Protein accession | YP_001682917 |
Protein GI | 167645254 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.384181 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0181603 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTTC ATAATTCTCG TCGCAACGCC ATGCTCATGG GCGCGTCAGC GCTCGTCATC TGCGGTGCTC TGCCCGCGAT CGCTCAGGCG CAAACAACCG CGCCGGCGTC GTCGGCCGAC ACCGTCGAGG AAGTCGTCGT CACCGGCCAG CGCGCCGCCC TGCAATCGGC CCAGAAGCTG AAGCAGAACG CCGAGCAACT GGTCGATTCG ATAACCGCCA CCGATATCGG CGCCTTGCCG GACCGCAGCG TCACCGAAGC CCTTCAGCGT GTCGCGGGGG TCACCATCGG CCGCACGGCC GACGGCCGCG ACGCCGACCG CATTTCGGTC GAAGGCAGCG GCGTCCAGGT TCGCGGTCTC AGTTGGGTCC GCGGCGAAGT GAACGGCCGC GACAGCTTCT CGGCCAAGGC CGGCCGCACC CTGAGCTTCG AGGACGTTCC GCCCGAACTG ATGTCCGGCG TCGACGTTTA CAAGAACCCT TCGGCCGACA TCATCGAAGG CGGCGTCGGC GGCACCGTCA ACCTGCGCAC CCGCCTGCCG TTCGACAGCG GCAAGCGCAA TCTGGCCTAT TCGGCCGACT CCAGCTGGGG CGACCTGTCC AGGAAATGGG AGCCCAGCGG CTCGGTTCTC TACAGCAACC GGTGGGACAC CAAGATCGGC GAGCTCGGCT TCCTGATCGA CCTGTCAGAC TCCAAGCTGA GCAGCCGCAC CGACACCATC TCGGTCGACC CCTATTTCGC GCGCACCAAC ACCCTGGTCC CCGGCAAGAC CGTCTATGTC CCGGGCGGTT TCGGCTATCG CAGCCTGGAC TTCGAGCGCG AACGCAAGGG CATCGCCGCC GCCCTGCAAT GGCGCCCGAA CGAGCAATGG GACGCCAGCC TGCAGTTCCT GCGCTCGTCG GCCTCGCAGG CCTCGACGGA GCACGCGGTC GGTTTCAATC CCGGATCGAC CAACGGGCCG GCGACCGGCA CGGACTTCAC CTACGATTCG GACGGTCATT TCCTGAAGGG CACGCTTGCC CAGACGCCGG GCGGCTCGAG CCTGGGATCG TCGACCATCG ACACCCGGTA CTCGGATCGT AGCTCGGTGA CCTCCGACTA CGCCCTGAAG GTGAAATACA CTCCCAACGA CAAGTGGGCG TTCAGCGGCG ACATCCAGTA CGTCTACGCC AAGACCAAGA CCGTCGATAA CACCGCGTTC AACGCGCTGA ACAGCGACGC CGCGCCCGCC AGCCTGGACC TCACCGGCAG CCTGCCGGTG ATCACCATGA ACAACGACAC GGCCTACACG TCCAACGCCG CCAACTACTA CCTGCAGGCG GCGATGGATC ACCACGACCG CAACGAGGCG GCCCAGTGGG CCGAGCGCTT CGACGGCGAC TACAGCTTCG ACGATGGCGG CTGGCTGAAG TCGTTCCGCT TCGGCGTTCG CCACACCTTC CGCCAGGCGA CCACCCGCGA AACCAACTAC CGTTGGGACA CGGTGGCGCC GAGCTGGTCG GCGACCGCTC CGATCAGCAC GCTGGACGGC TACCAAGGCT ATTACGGGCT CTACGAATTC GACAACTACT TCCGCGGCAA GGCCCACCTG CCCGCGACCT TCGTCATGCC GAGCGCGCAG TTCGTGAACA ACTACGGCGA CACCTCGCTG GTGCTCTCCA AGATCGCGCA GCAGAACGGC GGCGGCTGGC GGCCCTTCAA CGGTGTCTTT GACGACCAAG GTCAGGCCGG CGGCAAGGGC AGCATCAACC ACCAGAAGGA AGAGACCCTG GCGGCCTACG GCCTGCTGCG GTTCGGCCAT GACGTCTCGC TGTGGGGCGA TCAGCGCGAG ATCGACGGCA ACTTCGGCCT CCGCGTCGTC AAGACCGAAA GCCAGAGCCT GGGCATGCAG GTGTTCACCC CGAACACCAC CAGCACCGAC ATTCCGGCCG CCGATCAGGC GTTCTCGAAC GGCGCCAAGA GCCCCTACAA GGGCGGTCGC GACTATGTGA GCGTCCTGCC CAGCCTGAAC GTCCGCCTGA AGATCACGCC GGACATGTTC ATCCGCTTCG CGGCGGCCAA GGCCATCGTC CGTCCGGACT TCCAGCAACT GCAGCCGAAC TACACGATCT CGGCCACCAA CGGCTTCATC ACCGGCGGGA CCTGCTCCAG CACCATCCCG GGCGGCTCTC AGGCCAACTG CGTCTATCAG TACACGGCCA ATGCCGGTAA TCCGGACCTG AAGCCGACCC GTTCGACCCA GTTCGACGTG TCATACGAGT GGTACTTCAA CTCGACCGGC AACGTGACGG CGACGGCGTT CTACAAGGAC ATCTACAACT TCGTCACCAA CGGCTCGACG AACCTCAACT TCACCAACAA CGGCGTGACC CGCACCGTTC AGGTGGTCCA GCCGTACAAC GCCGGCCACG GCACGATCAA AGGCTTCGAA GTCGCCTACC AGCAATATTA CGACTTCCTG CCAGGCGTCC TGCGCGGCCT GGGGACGCAA GCCAACTTCA CCTATGTCGA CAGCAAGGGC TCGCGCAACG CCGCGTCCAA CCCATACGAC ACCAACCAGG TCGGCAATAT CAGAACCAAT GGGGAGGAGC TGCCGCTCGA GGGCCTGTCG AAGAAGAGCT ACAACGTCGC GGCGCTATAC GACCTGGGCA AGGTCTCGGC GCGCCTCGCC TACAACTGGC GTGAGCGCTA CCTGCTGACC ACCACGGCGG CGAACATCAA CATTCCGGCC TGGTACGGCT CCTACGGCCA GCTGGACGGC TCGGTGTTCT ACACGGTCAA CGACGCGCTG AAGATCGGCT TCCAGGCGGC CAACCTCACC AACACCCGGA CCAAGATCCT GGTCAGCTAT CCGGGCAAGC CCGAGGAAGG TTTGACGAAC CACAACTGGG TTGTCGCCGA TCGCCGCTAC TCGATCGTGC TCCGCGGCAC GTTCTAA
|
Protein sequence | MSLHNSRRNA MLMGASALVI CGALPAIAQA QTTAPASSAD TVEEVVVTGQ RAALQSAQKL KQNAEQLVDS ITATDIGALP DRSVTEALQR VAGVTIGRTA DGRDADRISV EGSGVQVRGL SWVRGEVNGR DSFSAKAGRT LSFEDVPPEL MSGVDVYKNP SADIIEGGVG GTVNLRTRLP FDSGKRNLAY SADSSWGDLS RKWEPSGSVL YSNRWDTKIG ELGFLIDLSD SKLSSRTDTI SVDPYFARTN TLVPGKTVYV PGGFGYRSLD FERERKGIAA ALQWRPNEQW DASLQFLRSS ASQASTEHAV GFNPGSTNGP ATGTDFTYDS DGHFLKGTLA QTPGGSSLGS STIDTRYSDR SSVTSDYALK VKYTPNDKWA FSGDIQYVYA KTKTVDNTAF NALNSDAAPA SLDLTGSLPV ITMNNDTAYT SNAANYYLQA AMDHHDRNEA AQWAERFDGD YSFDDGGWLK SFRFGVRHTF RQATTRETNY RWDTVAPSWS ATAPISTLDG YQGYYGLYEF DNYFRGKAHL PATFVMPSAQ FVNNYGDTSL VLSKIAQQNG GGWRPFNGVF DDQGQAGGKG SINHQKEETL AAYGLLRFGH DVSLWGDQRE IDGNFGLRVV KTESQSLGMQ VFTPNTTSTD IPAADQAFSN GAKSPYKGGR DYVSVLPSLN VRLKITPDMF IRFAAAKAIV RPDFQQLQPN YTISATNGFI TGGTCSSTIP GGSQANCVYQ YTANAGNPDL KPTRSTQFDV SYEWYFNSTG NVTATAFYKD IYNFVTNGST NLNFTNNGVT RTVQVVQPYN AGHGTIKGFE VAYQQYYDFL PGVLRGLGTQ ANFTYVDSKG SRNAASNPYD TNQVGNIRTN GEELPLEGLS KKSYNVAALY DLGKVSARLA YNWRERYLLT TTAANINIPA WYGSYGQLDG SVFYTVNDAL KIGFQAANLT NTRTKILVSY PGKPEEGLTN HNWVVADRRY SIVLRGTF
|
| |