Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2663 |
Symbol | |
ID | 5900118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2895087 |
End bp | 2897273 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641563154 |
Product | TonB-dependent receptor |
Protein accession | YP_001684288 |
Protein GI | 167646625 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0150828 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.481934 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCGAC GCCTTACTTC CCTTTACTGG ATTTCCGGAA CTTCCGCGCT GGCCCTGGCC GGCGCCGCCC AGGGCGCCAC GCCAGCCGCG CCCGCCGAGA TCGACGCCAA CGCCGCCCCG GCCGTCGAGC AGGTGGTGGT CACCGCCGAG CGCCGGGTCA CCGACCTGCA GAAGACCGCC ATCGCCATTT CCGCCTTCAG CGGCCAACTG CTGGCTGATC GCAAGATCGA CAGCATCCGC GACCTGTCGG GCCAGATCCC CAACTTCAGC ATCAGCCGCG TCACCATCAG CCACACCACC CAGACCTACG CCCTGCGCGG CGTGGGCGAG AGCGATCCGA TCCAGGAGCC GGTGCTGGCG GTCTATGTCG ACGACGTCTA CATCCCGCGC CAGATCGGCT CGATGGTCGA GTTCAACGAC CTGGAGCGGG TCGAGGTGCT GCGCGGCCCG CAAGGCACGC TTTACGGTCG CAACTCCAGC GCCGGCGCCC TGCGGATCAT CACCCGCGAC CCGGGCGAGG CCTTCCGCGC CAAGGCCGAG GTCGGGCTTG GAAACTACGG CGCGGTCGAT GTGCGCGGCC TGATCGAGGG GCCGCTGGTC GAGGGCAAGG TGTCGGGCAG CCTGTCCTAT ATCCACCACA GCCGCGACGG CGTCACCTTC GACCCAACCC TGAACCACGA CGTCAATCGC ATCGATCTCG ACGCCTACCG GGCCAAGCTG CGCTGGACGC CGACCGACAG GCTGGACGTG CTGCTGACGC TCAACGCCCT GAAGGATCGC AGCGACACGC GCAGCTACGT GCCCGTCAAG CAGCCGGGCG GGGGGTTCCG CAACGACCGC TCCTATTCCG AGGTCGAGCC GACCCAGGAC CTGGACCAGG TCAGCGGCGC CCTGCGCGTG CAGTACACGC TGAACGACAA TCTGAAGCTC AAGTCAGTGA GTTCGTACGG CGGCTTCAAC CTCAACCCCG TCGCCTATGA CAACGACGGC GAGGCGGCGC TAATCCAGAA GAACCTGATC CACTACAACG ATCAGTACGT CACCCAGGAA ATCCAGCTGA ACGGCGACTA CGGCAAGCTG ACCTTCACCA GCGGCGCGTT CTATTTGCAC GAGCGGTTCT TCGTTCAGCG TGACGGCTAC AGCCGGCGCA ACGCCGTGCC GAGCGACCCC GTCACGACCC CGGGAAGTTA CGGCTTCGCC CGGGCCCACA ACATCACCAA CACCGACGCC TACGCGCTGT TCGGCGAGGC GACCTACGCG CTCAGTGACA GGCTCAGTGT CACCGGCGGC CTGCGCTGGA CCAACGAAAA GAAGGATTTC CTGTTCGACA ACAAGGTGCT GAACCTGGCG GGTCAGGTGA CCGGCCAGTC GATCGCCGGC CACGCCGACA AGATCTTCTC CGCCGTCACG CCCAAGCTGT CGGCCCAGTT CCAGTGGACG CCGGACGTCC TGCAGTACGT GACCTATTCG CGAGGCTTCA AGTCGGGCGG GTTCGACAAC CGCGCCACGC GCCTGGACCT GGCGACGCGT CCGTTCGCGC CGGAGAAGGT CGATACCTAC GAGACCGGCC TGAAGACCGA ACTGCTGAAC CACCGCGCGC GGTTGAACCT GGCGGTGTTC TACAACGACT ACAAGGACCT GCAGGTCAGC TACAGCGATC CGGCCTATCC CGGCAATTCG GTGCGCGGCA ACGCCGGCAA GGCCCACACC TACGGCGTCG AGCTGGAGAG CGACGTGCGG GCCACCGAGC GCCTCTCGCT GCAAGCTTCG GCCGGCTACC TGTTCGCGGT TTATGACAAG TACAAGAACG CCGGCGGCCT GGGCGTCGAC GCCGACGGCC ACCGCCTGCT GAACTCGCCG CGCTGGAGCG TCTCGGGCGG CGTCACCTAC GACGTGCCGG TCGGTATTCC GGGCTCGATC CGGGTGGGTT TGAACGCCCA GTTCCAGACC AAGACCTATT TCAGCGCCCT GCAACGTCCG CAGGACCAGG CCCCGGCCCA GACCTTCGTC AACGGCACGG TGACCTGGCA GTCGCCCGAT CCGCGCTGGA GCGTCCAGCT GTCGGGTCGC AACCTGCTGG ATTCCGACGA GCCGGTCAGC GCGACCTACA CGCCCTCGAC AGGGGTCTAC TACAAGAACT ATCCCGATCC GCGGACCTGG CTGGTCACGC TGAAATACGC GCTGTGA
|
Protein sequence | MSRRLTSLYW ISGTSALALA GAAQGATPAA PAEIDANAAP AVEQVVVTAE RRVTDLQKTA IAISAFSGQL LADRKIDSIR DLSGQIPNFS ISRVTISHTT QTYALRGVGE SDPIQEPVLA VYVDDVYIPR QIGSMVEFND LERVEVLRGP QGTLYGRNSS AGALRIITRD PGEAFRAKAE VGLGNYGAVD VRGLIEGPLV EGKVSGSLSY IHHSRDGVTF DPTLNHDVNR IDLDAYRAKL RWTPTDRLDV LLTLNALKDR SDTRSYVPVK QPGGGFRNDR SYSEVEPTQD LDQVSGALRV QYTLNDNLKL KSVSSYGGFN LNPVAYDNDG EAALIQKNLI HYNDQYVTQE IQLNGDYGKL TFTSGAFYLH ERFFVQRDGY SRRNAVPSDP VTTPGSYGFA RAHNITNTDA YALFGEATYA LSDRLSVTGG LRWTNEKKDF LFDNKVLNLA GQVTGQSIAG HADKIFSAVT PKLSAQFQWT PDVLQYVTYS RGFKSGGFDN RATRLDLATR PFAPEKVDTY ETGLKTELLN HRARLNLAVF YNDYKDLQVS YSDPAYPGNS VRGNAGKAHT YGVELESDVR ATERLSLQAS AGYLFAVYDK YKNAGGLGVD ADGHRLLNSP RWSVSGGVTY DVPVGIPGSI RVGLNAQFQT KTYFSALQRP QDQAPAQTFV NGTVTWQSPD PRWSVQLSGR NLLDSDEPVS ATYTPSTGVY YKNYPDPRTW LVTLKYAL
|
| |