Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1862 |
Symbol | |
ID | 5899317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1996051 |
End bp | 1998894 |
Gene Length | 2844 bp |
Protein Length | 947 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641562352 |
Product | TonB-dependent receptor plug |
Protein accession | YP_001683489 |
Protein GI | 167645826 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.248067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACGCC ACCTCTGTCG CGCCACGCTG CTGGCGCTCA TCTCGGTCGG CTCGGTCTCG ACCGCCGGCC GCGCCCAAGG CCGGACAGAA TTGGGACCCG CCCTCACCCG GCTCGCCGTC GAACGCAATG TCCAGATCCT TTTCCAGCCG CATCTTGTCG AAGGCCTGGT CGCCAATCCG GTTCGGCGCG GGACCAGCCT GGACCAGGCC ATGACCATGA TCATTGGCCG CCAGGGCCTC AGGATCCGCA AGGTGCGCGC GGGGATCTAT GCGGTCGAGC CAGAAATCAG GAGCATTTCT CCACCGGCTT CCCTCTCCAA GGAAGACAGC CCGAGCGTTG TCGCCCCGCT GATCGTCACC GCCGTCTACG CCGCCAGCCT CGAGCGGACC CTTGCCCTCA AGCGGGACGC GACCCACGGC CTGGACGCGG TCAGCGCCGA GGACATCGCC CGCCTGCCCG CCGCCAACGC CGCCGAGGCC CTGCAACTGG CGCCGGGCGT GAGCCTGGAG CGCCATCGCG GCGTGGGCCT CTATGTCAGC GTCCGGGGGC TTGGGCCGCA GTTCCAGAAC GTCCTGCTGA ACGGCCGGTC GATCGCCATC AACGACCTGG TCGAGAACGG GGGGTTTCGC GGCCGGCAGT TCCGCTTCGA GGTCCTGCCC TCCGACGTCA TCGACCGCAT CGAGGTCATC AAGACCACCA CCGCCGACAT GGACGAAGGG GCGCTGGGCG GCAATATCGA CGTGCGGACC TTCAAGCCGC TGGAGCGCGG CCCCCGCGCG GTGCTGTCGG CCCGGGCCTC GCAGGGCCAG GCCGGCAAGC CGGACCCCGC AGTGTCGGGG GTCTGGAGCT GGGTCTCCCC CGACGGTCAT CTGGGCCTGC TGGCCGCGGG CATGGCCGAG CGCCGCCAGA TCCGTAACGA CCGCCTCTAC CAGACGGGCT GGAACCTCGA CCGCTTCACC AATGTCCTGC CCGCCGGCCT GTACACGCCA ACGCGCACGC GGCCGACCAT CGAGCTGGAA GACCGCCGTC TGATGTCGGG CGACTTCGCC CTGCAGTGGC GGCCCTCGCC CGACTGGCGG ACCGATATCG ACCTGCTGGT GACACGCCTG GACGCCCACT ACGACGAATT CGGCCTGGAC ATCTATCCGG ACGACACCAC CTTCGCCCAC CCCGCCTTCG TGGCTGGCAG CCAGAGGGTG GTCGGCGACA CCGTTCAGGC CGGCCAGATC GACAACGTGC GCTGGATGGC GTCGCGCGAG ACGAGCCTCA ATCGCCACGA CTTGGCGGCC TTCGGCGTCC GGCAAAGCTG GACGCCGGGC GCGTGGGCGC TGGACATCGA CTACGCCTAC TCCCGAGCCC GCAGCTATCA TCCCGACGGC CAGGGCACTG TGCGGGCGCG CGCCGCCTTC TTCGCCCCGT TGATTTACGA CTTCGGCGGC GGCCTTCACA GCGCGCCGAC GCTGAAGACC ACGATCGACT ACACCGACCC CGCGCGGTTC GTCGGCCAGG CGTTCGACTA CACCTGGAAG GACTCGCGCG ACACCGACGA GGCGCTGAAA GCCGACCTCG CCCGGTCGTT GGGGGCTGGC AAGCTCAGCC TTGGGGTCGA GGGTCATCGG CGGGTGCGCG ACTATCGCCG GCGCGACTGG ATTCTCAACA CGGTCGTCGG CGCGCCCCTG ACTAGCCTGG GCGGTGAGTA TTACGGCCAA ACTCCCGTCT CGGACTATCT GGCCGGCACG CGGGGCGAGC TGCCGCGCCA CTGGGTGGCC CTGGACGCCC GCGCCTTCTA CGAGCAACTG TTCACGGAAG AGATCGCAGC CTTGCCCCCG ACGGTGTCTG ATCGGCGCAA TTCGTTCGTG GTCGAGGAGA AGATCCTCTC GGCCTATGCG CGCGGCGACT TCTCGGCCCG CTGGTTCGGC CTGCCGGTCG ATGGCGACGT GGGCGTTCGC TACGCGAGCA CGCGGCAGAT CTCGACGGGC GTGCTGTCCA GCGGCGCCGA ACCGATCCCC GCCCAGTGGC GCAAGGCCTA TGGCAACTGG CTGCCCAGCG CCAATTTGCG CGTCACCCTG ACGCCGGACC TGCTGCTGCG GCTGGCGGCC TCGCGGGTGG TCAACCGCCC CAACGTCGTC GACAACGCCC CGCGCATCAC CCTGGCCCGC GACACGCCGA CCGCCAACGG CGGCAACCCG GACCTCGACC CGTTCCTGGC CACCCAGTTG GACGCCTCGC TGGAGTGGTA CTTCCCGTCC GGCGGCGCCC TGACCGGCGC GGTGTTCGAC CGGCGGCTCG ACAACTACAT CACTGCCCAG AACACTTTCA TCCAGGTTCC CGGGCGCGGC GAAATCCTGC TGTCGACCAA CGTCAACGGC GGCGACGCCC GCATCCAGGG TCTGGAGCTG GCCTACAGCC GAACCTTCAA AAGCCTGCCC GCGCCGCTGA ATGGCCTGGG CATGCAGGGA TCGCTGACCC TGGTGCGCAG CCAGGCCAAC TATTTCGCCG GCGACCGGGT GATCCGCAAC GCCCTGCTGG GCCTGTCACG CACCAACTAC AGCCTGCTGG CCTTCTACGA GCGCGGCCGC GCCTCCGTGC GACTGGGCTA CAATTGGCGC GGCGCATACC TGACCACGAT CGGTAGCTCG ATCACCGCCC CGGCCACCAC GGCGGCCTTC GGGTCGTTGG ACGGCGCGGC GTCCTGGCGG GTCAATCGGC GGGCGACGAT CACCTTCGAG GGCGTGAACC TGGCCGACGC GCGGCGCTTC GTCTATGGCG AGAGCCGCGA CCAGCCGATG GAAATTCATC ACTGGGGCCG ATACCTGTCC ACGAGGCTGC GATGGGCGTT CTGA
|
Protein sequence | MPRHLCRATL LALISVGSVS TAGRAQGRTE LGPALTRLAV ERNVQILFQP HLVEGLVANP VRRGTSLDQA MTMIIGRQGL RIRKVRAGIY AVEPEIRSIS PPASLSKEDS PSVVAPLIVT AVYAASLERT LALKRDATHG LDAVSAEDIA RLPAANAAEA LQLAPGVSLE RHRGVGLYVS VRGLGPQFQN VLLNGRSIAI NDLVENGGFR GRQFRFEVLP SDVIDRIEVI KTTTADMDEG ALGGNIDVRT FKPLERGPRA VLSARASQGQ AGKPDPAVSG VWSWVSPDGH LGLLAAGMAE RRQIRNDRLY QTGWNLDRFT NVLPAGLYTP TRTRPTIELE DRRLMSGDFA LQWRPSPDWR TDIDLLVTRL DAHYDEFGLD IYPDDTTFAH PAFVAGSQRV VGDTVQAGQI DNVRWMASRE TSLNRHDLAA FGVRQSWTPG AWALDIDYAY SRARSYHPDG QGTVRARAAF FAPLIYDFGG GLHSAPTLKT TIDYTDPARF VGQAFDYTWK DSRDTDEALK ADLARSLGAG KLSLGVEGHR RVRDYRRRDW ILNTVVGAPL TSLGGEYYGQ TPVSDYLAGT RGELPRHWVA LDARAFYEQL FTEEIAALPP TVSDRRNSFV VEEKILSAYA RGDFSARWFG LPVDGDVGVR YASTRQISTG VLSSGAEPIP AQWRKAYGNW LPSANLRVTL TPDLLLRLAA SRVVNRPNVV DNAPRITLAR DTPTANGGNP DLDPFLATQL DASLEWYFPS GGALTGAVFD RRLDNYITAQ NTFIQVPGRG EILLSTNVNG GDARIQGLEL AYSRTFKSLP APLNGLGMQG SLTLVRSQAN YFAGDRVIRN ALLGLSRTNY SLLAFYERGR ASVRLGYNWR GAYLTTIGSS ITAPATTAAF GSLDGAASWR VNRRATITFE GVNLADARRF VYGESRDQPM EIHHWGRYLS TRLRWAF
|
| |