Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4007 |
Symbol | |
ID | 5901469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4337537 |
End bp | 4338706 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641564528 |
Product | TonB-dependent receptor |
Protein accession | YP_001685630 |
Protein GI | 167647967 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.413023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGGGCG GCTTCTACCT GCACCGCGAC ACCGACCTCA ACGGCGAGGA CCGGTCCAGC TCGGCCTTCC TCACCACCCG CCGCCTCACC GGTCTGCCGG GCGCGACCTT CGCCAAGTTC GGCGCCGACA CGCGCACCTA CGAACTGGCC GGCTTCGGTG AGCTCACCTA CCACCTGACC GACAAGCTGT CGGCCACCGG CGGTCTGCGC TACGGCAAGT ACGGCGGCAC GGTGGACACC TATGCCGGCT TCAACACGGC GTATTTCACC TACGCGCTGC TCGGCTTTTC CGGGCCGCTG GCGCTCACCC CGTCTCCGGC CTCGACCACG AAATACCCCT CGGCCGAAAA GGCGTCGTGG AAAGCCAGCC TGACCTACAA GCCGTCGCGC GACCTGACGA CCTACGCCAC CGTCTCCACC GGCTACCGGA CGCCCGTCTA CAACGGGCGC GCCGGCAGCG TCAGCACGGT CAATCCGAGC GACCTGGTCA TTCCGGCCGG CGCGGGCTCG GACAATCTCA TCAACTACGA GGTCGGCCTG AAGGGGCGCT GGCTGGACGG GAAGCTGAAC GCGAACCTGG CGGCCTATTA TATCGACTGG AAGAACATCC AGGTTCAGGC CAACCGCCAG TCGGATTCGA TCCAGTTCGC CACCAATGTC GGGCGCGCCG CCAGCAAGGG GCTGGAGGCG GAAGTCACGC TCGCGCCGGT CCGCGGGCTG GTGTTGGGGC TGAACGGCTC GCTCAACGAC GCCAAGGTGA CCGAGCTCTC TCAGCAGGAG GCCGTGATCT CCGGGGCGGT GGATGGCGCG AGGCTGGCGT CGCCGCACGT GCAGGGGGCG TTGTTCGGCA CGTACAGCTA CGCCGTGGGC GACGGGGCGA CGGGCTTCAC CAGCGTCCAG ATCCAGCACG TTGGTTCATT CCCCAACGGC TTCCCCAACA AGCCGGGCAC GCCGGGAACG CTTAGCCCGC TGTACGGACA CACCGACAGC TACACCTACG TCAACCTGCA GACGGGCCTG ACGTTCGGCA AGCTGAGCAC GACCCTCTAC GCCGAGAACC TCGGCAACAG CCGGGCGACG GTCTACATTC ACCCCGAAGC CTTCGTTTAC AGTCGCAACG CGATCGTCCG GCCGCGCACG TTCGGCGTCC GGGTGGGCTA CGACTTTTGA
|
Protein sequence | MAGGFYLHRD TDLNGEDRSS SAFLTTRRLT GLPGATFAKF GADTRTYELA GFGELTYHLT DKLSATGGLR YGKYGGTVDT YAGFNTAYFT YALLGFSGPL ALTPSPASTT KYPSAEKASW KASLTYKPSR DLTTYATVST GYRTPVYNGR AGSVSTVNPS DLVIPAGAGS DNLINYEVGL KGRWLDGKLN ANLAAYYIDW KNIQVQANRQ SDSIQFATNV GRAASKGLEA EVTLAPVRGL VLGLNGSLND AKVTELSQQE AVISGAVDGA RLASPHVQGA LFGTYSYAVG DGATGFTSVQ IQHVGSFPNG FPNKPGTPGT LSPLYGHTDS YTYVNLQTGL TFGKLSTTLY AENLGNSRAT VYIHPEAFVY SRNAIVRPRT FGVRVGYDF
|
| |