Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0970 |
Symbol | |
ID | 5898425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1024674 |
End bp | 1027649 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641561452 |
Product | TonB-dependent receptor |
Protein accession | YP_001682598 |
Protein GI | 167644935 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTGC GGGCGCTTGC CCTGATCAGC ACGAGCATTC CCGCGGCCGC GCTGATGGCG ACGCCCGCGG TTGCCGAGGA CTATACGAAC GTGCAGGCCT CGGGCCGCGT GGAGTCGACC GACGGCAAGC CGGTCGCCGG CGCGGTGGTC CGGATCACGT CGGAAACCCT GGGCGTGAAG CGCTCGGCGG TCACCAGCGC CAGCGGCGCT TACGTCATTC CGCAGCTCGC GCCGGGCGTC TACGCGGTCT CGGTGACGGC GGCGGGCTTC GACACCTATG CCGAGAAGGG CGTGCTGATC AGCCGTTCGG GCGCATCGAA CCTCTTCACC CTGGCCCCGG TCGGCTCGGT CGAGGCCATC GAGGTCAAGG CTGGCCGCAC CCGCACGCCG GACTTCGAAA TCACCACCAC GGGCGCGGCC ATCGAGATCG GCGAACTGGC CGAACGCGTG CCGGTTGGCC GCTCGCTGCG TGACATCGCC CTGATGTCGC CGGGCGTGGT CCAGGGGTCG TCGTCGGCCA ACGGGACCTT CGCCAACCAG ATCTCGATCT CCGGCGCCAG CTTCATCGAG AACGCCTTCT TCGTGAACGG CCTGAACATC ACCAACTTCC GCATGGGCCT GCTGCCCGTC GAGGTGCCGT TCGACTTCTA CAAGTCGGTG GAAGTGAAGA CCGGCGGCTA TCCGGCCGAG TTCGGCCGCT CGACCGGCGG CTTCGTCAAC GCCATCACCA AGTCGGGCGC CAACGCGTTC CATGGCGGCG TGACGGCGAC GTGGGAGCCG GGCGACCTGC GCGATTCCGC GCCCAACACG TTCAAGAACA ACTTCAAGGA CGCCACCTCC AGTCGCGAGG AATGGGTGGC CGAGCTGGGC GGGCCGATCA TCAAGGATCG CCTGTTCTTC TACGGCCTCT ACAATCAGCG CAAACTCGAG TCCTTCAGCC CGTCGGCCAG CCAGGACAAC GCCACCCGCA CCCGCGACGA CGCGCCGTTC TGGGGCGGCA AGCTGGATGG CTGGCTGACC AGCAAACAGC ACCTTGAGTT TACCTATTTC GACAGCACCA AGGACGTCCG CAATCGCAGC CTGAACTACA ACCGCGCCAC CAAGCAGGTC GGCGCGGAAA CCGGCGGCAC CAACCAGCGC TCCGGCGGCG AGAACTACAT CGCCCGCTAC ACCGGCACCT TCACGCCCTG GTTCACCCTG TCGGGCGCCT ATGGCGTCAA CAAGTTCCGC GATGGCCAAC TGCCGCTCGA CACGACCCAC GAGCGCGTCA TCGACTACCG GACCAATTCG GCCGGTGTCG ATATCGGCGT CAACAAGGTG ACCGACGCGA TGTCGTTCAA CGACGACGAG CGCACGTTCT ATCGCGCCGA CGCCGACTTC AACTTCGACC TGTTCGGCGC CCACCACCTG CGGGTCGGCT ACGATCACGA GGAGAACACC GCCACCCAGG TGTTCGAGAC CATCGGCTCG GGTTTCCTGA AGGTCTTCCG CGCCACCGGC GCCGACCAGA CCGCCCTGCC GGCGGGCACG GACTATGTGA CCACCCGCGT CTACCGCAAC AGCGGCTCGT TCAGCACGGT CAACCAGGCT TTCTACGCCC AGGACCAGTG GTCGCTGTTC AATGATCGAC TGCAGTTCGA GGCGGGCGTC CGCAACGACC GGTTCGACAA CCGCAACGCC GACGGCTCGA CCTTCTACGA TCCGGGCAAC CAATGGGCGC CGCGCATCTC GGTCTCGGGC GACCCCTTCG GCGACGGCAA GTCCAAGCTC TACGGCTACT TTGGGCGCTA CTACCTGCCG CTGCCCAGCG ACCTGTCGCT GAAGTTCGCC GGGTCGCTGG TGACCTATAC CCGCTACAAC CTGCTGACCG GCGTCGCCGC CGACGGTACG CCGAGCCTGG GCGCGCCGGT CACCAGCGTG GCGGGCATGG CGCCCTGTCC GGACACCGGG ACGGCCAATT GCCTGGTCTC GGCCTCGGGC CTGGTGGCCG ACACCGCCGA GTCGATCGCC CACAACCTCA AGCCCCAGTC AGCCGACGAG TTCGTGCTGG GCTACGAGCG GCGCTTTGGC GACCTGATCA AGGTGGGGGC CTATTTCACC CACCGCGAGC TGAACAACAT CGTTGAGGAC GTGGCCATCG ACACCGGCGC GCGCGCCTAT TGCGTGAAGG CCGGCTTCAC CGCGGCCCAG TGCCGGTCCA GCTTCGGCGG CGGTCGCCAA TGGGTGATCG TCAATCCCGG CGAGGACGTG ACGATCCGCC TCAACAGCCT GCCGGACGGC TCCAAGCCGG TCGTCACCCT GGCGGCCGCG GACCTGACCT ATCCCAAGCC CAAGCGCGAC TACAACGCCC TGACCGCCAC CTTCGACCGG ACCTTCGACG GCAAGTGGTC GCTGTCGGGC TCCTACACCT GGGCCAGCCT GAAAGGGAAC TACGAGGGCG GCGTGCGCTC CGAGAACGGC CAGCTGGCCA TCAACACCAC GGCCGACTTC GACTCGCCGG GCTTCCTGGA CGGCGCCGAC GGCTACCTGC CCAATCACCG CCGCCACACC TTCAAGGGCT ATGGCAGCTA CCAGGTCAAC AAGTGGCTGA CCCTGGGCGG CAACGGCTCG GTGCAGTCGC CGCGAAAGTA CAGCTGCATC GGGATCGTGC CCAATGCGGT CGATCCGGTC GCCTTCGGCT ACCAGGGCTA TGGTTTCTAC TGCCAGGGCA AGGTCGTCGA ACGCGGCTCG GCCTTCGAGG GCGACTGGGT GACCCAGTTC AACACCAGCG CCGTGCTGAC CCTGCCGACG CCGGGCGACC GCTTCGACGC CTCGCTGCGA CTGGACGTCT TCAACCTGTT CAATTCGCAC GCGGTGACGG CCTATCACGA GTTCGGCGAC GTGGGCGGCA GCGGCGTCGC GGACGTCAAT TTCCGCAAGC CGGTCGACTA CCAGACACCG CGCTATGCGC GGATCCAGCT GCGCGTCGCC TTCTAG
|
Protein sequence | MALRALALIS TSIPAAALMA TPAVAEDYTN VQASGRVEST DGKPVAGAVV RITSETLGVK RSAVTSASGA YVIPQLAPGV YAVSVTAAGF DTYAEKGVLI SRSGASNLFT LAPVGSVEAI EVKAGRTRTP DFEITTTGAA IEIGELAERV PVGRSLRDIA LMSPGVVQGS SSANGTFANQ ISISGASFIE NAFFVNGLNI TNFRMGLLPV EVPFDFYKSV EVKTGGYPAE FGRSTGGFVN AITKSGANAF HGGVTATWEP GDLRDSAPNT FKNNFKDATS SREEWVAELG GPIIKDRLFF YGLYNQRKLE SFSPSASQDN ATRTRDDAPF WGGKLDGWLT SKQHLEFTYF DSTKDVRNRS LNYNRATKQV GAETGGTNQR SGGENYIARY TGTFTPWFTL SGAYGVNKFR DGQLPLDTTH ERVIDYRTNS AGVDIGVNKV TDAMSFNDDE RTFYRADADF NFDLFGAHHL RVGYDHEENT ATQVFETIGS GFLKVFRATG ADQTALPAGT DYVTTRVYRN SGSFSTVNQA FYAQDQWSLF NDRLQFEAGV RNDRFDNRNA DGSTFYDPGN QWAPRISVSG DPFGDGKSKL YGYFGRYYLP LPSDLSLKFA GSLVTYTRYN LLTGVAADGT PSLGAPVTSV AGMAPCPDTG TANCLVSASG LVADTAESIA HNLKPQSADE FVLGYERRFG DLIKVGAYFT HRELNNIVED VAIDTGARAY CVKAGFTAAQ CRSSFGGGRQ WVIVNPGEDV TIRLNSLPDG SKPVVTLAAA DLTYPKPKRD YNALTATFDR TFDGKWSLSG SYTWASLKGN YEGGVRSENG QLAINTTADF DSPGFLDGAD GYLPNHRRHT FKGYGSYQVN KWLTLGGNGS VQSPRKYSCI GIVPNAVDPV AFGYQGYGFY CQGKVVERGS AFEGDWVTQF NTSAVLTLPT PGDRFDASLR LDVFNLFNSH AVTAYHEFGD VGGSGVADVN FRKPVDYQTP RYARIQLRVA F
|
| |