Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1450 |
Symbol | |
ID | 5898905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1541929 |
End bp | 1544922 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641561937 |
Product | peptidoglycan binding domain-containing protein |
Protein accession | YP_001683078 |
Protein GI | 167645415 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0762725 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCCG CGGCGCCATG GAGCGTTAAG GGGATAGACC CCAAGGCACG GGAGGTCGCG AAAGACCTCG CGCGTCGCTC CGGCATGACG CTGGGCGAGT GGCTCAATCG CATGATCATC GAGGGCGAGG GCGTCGATGT CGCCGCCCTG GAAGCCTTCG GCGAACCCAG CCGTCCATCC CTCGCCGTCA ACAACGAACG CCCCAATACT TCCTACTATG AGACCGCTCG GGGCGCCGCG CCGTCGCGTA TCGAAGTGCG CGAGCATCCG GCCGACGAGG TCGGTCGCGT CGCCATCGCC CTGGACCGGC TGACCGACCG CATCGAGTCC GCCGAGAAGC GCTCGGCCCA GGCCATTTCC GGAATCGACG AATCCGTGCG CGGCGCCTTG CAGCGGCTGG CCACGGCCGA GCGCGAGCAG GTCGCCGTCG CCGCCCGCTT CGAGGGCGCG GTCGACGAGA TCAAGACCGA GCAGATCCGC GGCGCCGAGC GCCTGCGCCG CATCGAGAAC GAGGCCGCCG GCCCGCGCTC GGCCGAGGCG CTGCGGGCTC TGGAAGGCGC GCTCGGCAAG GTCGCCGGCC ATCTCTATGA AGGCGAAGCC CGCACCCGCG AGACGATCGC TCATCTCGAA CAGCGTCTCG ACAATCAAGC CGGCGTGGGC GCGGGCGATC CGTCGGCCCT GGTCGACGAA GTCGTGGCCC GACTGGGTGA GCGTCTGGAA GCCGCCGAAT CCCGCACCGC CGAGGCGCTG CAGGCCCTGG GCGCCTCTTT CACCGCCCTG GACGGCCGCC TGCGGACCGT CGAGACCAGC AATCCTGGCG AAGGCGTGCA GAAGCGCCTG GACGAGCTGT CCAACGGCCT GACCCAGCGC ATGGAGGCCG CCCGCATGGA AATGGCGGCC AAGCTGCGCC AGTCCGCCGA CGGTCGCTTC GACCGCATGG AGCTCAAGCT CGGCGAAATG ACCGCCCACG TGCAGGCCGC CGAGCAGCGC TCGGCCCAGG CCATCGAGCG CATGGGCCGG GAAGTGGTCG GCATGGCCGA CGCCCTGAAC CGCCGCGTCC AGACCTCGGA AGCCCGCAAC ACCGCCGCCA TCGAGCAAGT CGCCGCCAGC ATCGACCAGA AGCTGAACCG CGCCGACAGC GTTCAGGCCC AGGCTCTCGA GAAGCTGGGC ACGGAAATCG CCCGGATCAC CGAGAAGCTG GCCGAGCGGA TCAGCAACGC CGAGCGCCGT AACGCCCTGG CCATCGACGA CGTCGGCGAC CAGGTCACCC GCGTCACCGA TCGCCTGAAC CAGCGTCACG AGCGCACCTC GCAGGAACTG GTCGACCGCA TCCGCCAGAG CGAGGAGCGC ACCGCGCGCA TGCTCGACGA AGCGCGCGAG AAGATCGACG CCCGGCTTGC CGAAGCTCAA CGCAAGCTGG CCGAACAGGC CGCCGCGACC GCGGCCCCAG TCACGCGGTC CGACCCGTCG CCGTTCGACG GCGAGCGCTA TTCGTTCGGG GGCGAGGCCG TCCCCGAGGA CGTGTTCGAC CAACCCGCCG CCTTCATCCC GGCTCATGCA TCGACGCCGG CCCGCACCGC CGCGCCCGCC GCCTTCGAAG CCCCGGTCTA CGAGGCCCCG ACCTTCCCGG CCTCCGAGCC CGCCGAGCCC GAGTTCGGCG AAGAGGACTT CGAGGCCGCC GACGGCTTCG TCTCCGCGCC CGAGCCGGAG ACGGCGTTCG AGCCGGACGC CGACTACGAT ACCGATTTCG CGGCCCCCGA CTTCGCAGCC TCCGAGTTCT CCGCCCCCGA AGAGCCGGCC GCGCCGTCGC GTCCACTGTC GACGCGCGAA GTGATCGAGC AGGCCCGGGC CGCCGCCCGC CAAGCCGCCT TGGCCGGCGA GACCAAGGGC AAGGACAAGC CGAAGACCAA GAAGGTCGGA ACCTCGCTGT TCTCCGGCTT CGGCGCCAAG AACGCCGAGA AGAAGGCCAA GAAGGGCGCT ATGAAGACCG CCCTGATGGT CTCGGCCACC GCCGCCTTCA TGGGTGTCGG CGCGGCCGGC GTGATCATGA GCATGGGTCA CGGCGGCGGA CCGCTGCCGG AGCGTGTCGC TCAGGGCACG GCCAACAAGG CGACCGGCGA AATCCGCGCC ATCGAAACCG ACACCAGCCC GGCCGGCGCC CGCGCCGCCG TCGCCCTGAC CTCGCAGGTC CCGCTCGACG CCACCGCCCT GCCGGTCGAG GCGTCCACCG TCACGCCGCC CTCCGAGAAC GCCCAGGCCC TCTACGACGA CGGCGTGCGC CGCATCGAGG CCAAGGACCG CACCGGCCTG GAGCCGATGC GCAAGGCCGC CAACCTCGGC CTGCCGCGCG CCCAGTTCTA CCTGGCCAAG ATGTACGAAG TCGGCGAGGG CGGGGTGAAG AAGGACCTCG TTGAAGCCCG CCGCTGGACC GAGCGCGCCG CCACGGCCGG CGAAGCCCGC GCCATGCACA ATCTGGGCCT CTACTACTAC AAGGGCGACG GCGGCGAACG GAACTCGACC AAGGCCGCCA GCTGGTTCCG CAAGGCCGCG GACCTGGGTC TGGTCGACAG CCAGTTCAAC CTGGCTCAGC TCTACGAATT CGGGCGCGGC GCCGACCAGA ATCCGACCGA AGCCTACAAA TGGTACCTGA TCGCCGCCAA GAACGGCGAC ACCAGCGCCG GCGCTCGGGC CATGGCCCTG CGCAGCCAAC TGACCCCCGA GGGACAGCGG ACGGCCGAAC GCTCGGCCTA TGGCTTCCGC TCGCAGGCCG CCGCGGCGCC GCTGCAGACC GCCAGCGTCT CGGCCGCCAG CGCCGGCCTG GCCACCGCCC AAAAGGCCCT GTCCAGGCTC GGCTACTACC AGGGACCGCA GGACGGCGTC GCCTCTCCGG CCTTGCGGAT GGCCATCGCG GCTTACCAGC GTGATCAGAG TCTGCCGACC TCGGGAGCGC TCGATGGCGA GACCCTCAGC CGCCTGGCGA CCTGGGTCCG CTAG
|
Protein sequence | MTAAAPWSVK GIDPKAREVA KDLARRSGMT LGEWLNRMII EGEGVDVAAL EAFGEPSRPS LAVNNERPNT SYYETARGAA PSRIEVREHP ADEVGRVAIA LDRLTDRIES AEKRSAQAIS GIDESVRGAL QRLATAEREQ VAVAARFEGA VDEIKTEQIR GAERLRRIEN EAAGPRSAEA LRALEGALGK VAGHLYEGEA RTRETIAHLE QRLDNQAGVG AGDPSALVDE VVARLGERLE AAESRTAEAL QALGASFTAL DGRLRTVETS NPGEGVQKRL DELSNGLTQR MEAARMEMAA KLRQSADGRF DRMELKLGEM TAHVQAAEQR SAQAIERMGR EVVGMADALN RRVQTSEARN TAAIEQVAAS IDQKLNRADS VQAQALEKLG TEIARITEKL AERISNAERR NALAIDDVGD QVTRVTDRLN QRHERTSQEL VDRIRQSEER TARMLDEARE KIDARLAEAQ RKLAEQAAAT AAPVTRSDPS PFDGERYSFG GEAVPEDVFD QPAAFIPAHA STPARTAAPA AFEAPVYEAP TFPASEPAEP EFGEEDFEAA DGFVSAPEPE TAFEPDADYD TDFAAPDFAA SEFSAPEEPA APSRPLSTRE VIEQARAAAR QAALAGETKG KDKPKTKKVG TSLFSGFGAK NAEKKAKKGA MKTALMVSAT AAFMGVGAAG VIMSMGHGGG PLPERVAQGT ANKATGEIRA IETDTSPAGA RAAVALTSQV PLDATALPVE ASTVTPPSEN AQALYDDGVR RIEAKDRTGL EPMRKAANLG LPRAQFYLAK MYEVGEGGVK KDLVEARRWT ERAATAGEAR AMHNLGLYYY KGDGGERNST KAASWFRKAA DLGLVDSQFN LAQLYEFGRG ADQNPTEAYK WYLIAAKNGD TSAGARAMAL RSQLTPEGQR TAERSAYGFR SQAAAAPLQT ASVSAASAGL ATAQKALSRL GYYQGPQDGV ASPALRMAIA AYQRDQSLPT SGALDGETLS RLATWVR
|
| |