Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0067 |
Symbol | |
ID | 5897779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 75991 |
End bp | 81039 |
Gene Length | 5049 bp |
Protein Length | 1682 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641560550 |
Product | alpha-2-macroglobulin domain-containing protein |
Protein accession | YP_001681703 |
Protein GI | 167644040 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACGG ACGAGACTCC GCCCGAAGGC GGCCGCGGCG CTCCCTCCAC GCCCTGGGCC GATCGCTGGG AAGCCTTCAA GGGCAGGATT CCGCCGAGCC TGAAGTCGCC GCTGTTCGCG GTGGCGGTCG GCGCCTTGGT GGTGGGATTC GGCGGCGGCT TCGCGGTCGG CAAGGTCGCC GATTTCGGCT GGTTCGGCGG CAAGTCGGCC GCCACGGCCG AGGCGCCCAA GGGCCAGTCC TGGTCGCTGT TCGGCAAGCC CCGCTCGGCC AACGCGCCCC GGCGCGGCGT TCCCAAGCCC GAGGGCTTCG CCGTCTGGCG CAGCCGGATC GACAGCTCCG GCGCGGAGCC CATGGCCTGC GTCCAGATGA GCAAGCCGCT CGACCCGTCC AAGGCCTATG CCGACTTCGT GCTGATCTCG CCCGACCTGG GCCGCCAGCC GGCCGTGCGG GTCAAGGGCG ACGAGCTGTG CCTCGGCGGC GTCGGCTTCA CCGACCATCG TGTCACCCTG CTCAAGGGCC TGCCGGGCAA GACCGGCGAG ACCCTGGGCG CCAACGCCGA CGTCGACTTC ACCTTCTGCG AAAAGCCGCC CTATGTCGGT TTCGCCGGTG ACGGCGTGAT CCTGCCGCGC GAGGAGTCCG ACGGTGTGGC GCTTGAGACC ATGAACGTCT CCAAGCTGGC GATCGAGGTC TGGCGCGTCT CGGACCGCAA TCTGGTGCGC AAGTCGATCA GCGCGCCCGA TCCGAGCGGC GAGGGCGACT ACGCCAGCGA CTATGGCGAC GACAGTCCCG ACGACGAGGG CCGCCAGGTC TGGAAGGGCG TGATCGACGT CCAGGGCGCG GCCGGCCAGA AGGCGACCAC CGTCTTCCCC CTCGGCGCGG TGCTGAAGGA GATGAAGCCA GGCGGCTACG TGATCAAGGC GCGAGACGCC TCGGGCGGCC GCAAGCCGGA GGGCGACGAG GAGCCGAGCC CGGCCCAGGC CCGCCGCTGG ATCATGTTCA CCGACATGGC GCTGATCGGC TACGACGGCG CGGAGTCGCT GGACGTGGTG GTCCGCTCGC TGAAGACCGC CAAGACCCTG TCGGGCGTCA AGGTCACGCT GGTGGCCAAG GACGGCGAGG ACCTGGCCGT GGCCAAGAGC GACGCGGACG GCCGCGCGCG CTTCCCCCGC GCGCTGATGG ACGGCGAGGG CGCGTCTCAC GCCAAGATGG TCATGGCCTA TGGCGACCAG GGCGACCTGG CGGTGCTGGA CCTGGACCGC TCGCCGGTCG ACCTGTCCAA GCAGGGCGTC GGCGGCCGCA CCGAGTCCGA CGGCGGCCGG GCGCTCAGCA GCGACATCGA CGGCTATCTC TATGCCGATC GCGGCATCTA TCGGCCCGGC GAGACCGTCC ACCTGACGGC CATGGTCCGC GACCGGCTGG CCAAGGCGGT CAACGACCGC AAGGGCTACA TCCTGGTCAA GCGGCCCTCG GGCGTGGAGT TCAAGCGCTA TCCGTTCAGC CGCGCCGACG CCGGCGCCGT GCTGGCCGAC ATCGCCCTGC CGCGCAGCGC GCCGCGCGGC CGTTGGACGG CGGTGCTGAA GATGGAGGGG GTCGAGGCCG ACTCCGGATC ATTCAGCTTC AGCGTCGAGG ACTTCGCGCC GCAACGGCTG GCGGTCACCG CCACGGGCCA GGAGTCCGTC CCGGTCGGGG CCGGCCAGGA GCGCAAGATC GACGTCTCCG CGCGCTTCCT GTACGGCGCG CCCGGCGCGG GCCTGCAAAC CCAGGGCGAG GCGCGGCTGA AGACCGACAC CGACCCCTTC CCGCAGTTCA AGGGCTACGA GTGGGGCGAC GACCTGACGC CCTTCGACGA GAAGTTCATC GAACTGGGCA CGACCGTCAC CGACGGCGAC GGCCACGCCA TGCTGAACCT GGCCACCACC GAGGCCGGCG ACACCGCCCA GCCGCTGGTG GCGGCGGTGA CGGCCTCGGT GTTCGAGCCC GGCGGCCGGC CGGTCCGCGA GGCCTTGGAG CTCAAGGTGC GCGGCAAGCC GGTCTATTAC GGCGTCAAGG TCGAGCAAGG CGACGCCGGG CGCGGGGATC CGCCTGTCAG CCTGGAGATG ATCGCGGTCA ACGCCGCCGG CGCCCGGATC GCGTCGACGG CGACCTACAC CCTGATCAGC GAGAACTGGA ACTATGACTG GTTCCAGCAG GACGGACGCT GGCAGTGGCG GCGCACCAGC CGCGACGCGG TGGTGGCCAA GGCCACGGTC AATATCGGCG CGGGAGCGCC CGCGCGCTTC AACCGCCGGC TTGGCTGGGG CGACTACCGC CTGGTGGTCG AGGGGCCGGA CGGCAGCAAG ACCGTCACCA AGTTCTCATC CGGCTGGGGT TCGCCCGCCA AGGAAGGCGA GGCGCCCGAC TTCGTCCGCG TCAGCGCCGG GACCAAGGCC TATGCGCAAG GCGACACGGT GGAGATCACC CTGAAGTCGC CCTACGCCGG TCAGGCCCAG ATCGCCGTGG CGACCGACCG CTTGATCGAA TTCAAGACTC TCAGCGTCGG CGAGAACGGT ACGACCGTGA AGCTGAAGAC CTCGGCCGCC TGGGGCGGCG GGGCCTATGT GATGGTCACG GTGATCCAGC CGCGCGACCC GGTCAGCTCG CCCAAGCCCA AGCGGGCCCT GGGGCTGATC TATGTCCCGC TCGACCCCAA GGGCCGCAAG CTGACGGTCG ATATCGGCAC GCCGGTGAAG CTGGACTCCA AGGCCCCGGT CGACGTGCCG ATCAAGGTCA ATGGCCTGGG CTTTGGCCAA AGGGCCAAGG TGACGATCGC GGCGGTGGAC GAGGGCATCC TGCGCCTGAC GCGGCAGGAC AGCCCCGACC CGGCCAAGTG GTACTTCGGC AAGCGGGCCC TGACCCTGAA CTATCGCGAC GACTACGGCC GCCTGCTCGA CCCGAACATG GGTGCGCCGG CCAATGTCAA TTTCGGCGCC GACGAACTGG GCGGCGAGGG ATTGACGACC ACGCCGATCA AGACGGTGGC CCTGTGGTCG GGCATCGTCG AGACCGGGCT GGACGGCAAG GCCGTGGTCA AGCTGCCAGC CGCCGACTTC AATGGCGAAC TGCGGATCAT GGCCGTGGCC TGGACCGACA CCGCCGTCGG CTCGGGCTCC AAGCCGCTGA CCGTGCGCCA GCCGGTCGTG GCCGACCTCA ACCTGCCGCG CTTCCTGGCC CCCGGCGACA AGCCGATGGC CACGCTGGAG CTGCACAATG TCGAGGGCAA GGCCGGCGAC TATTCGGTCG AGGCCTGGTC GACCAACGGC ATCGCGGTGG CTTTCAAGAA GGTCATCACC CTGATGCTGG GCCAGCGGAT CGCCGAGAAG ATCCCTTTCC TGGCCCCCAA TGTCACCGGG ATCGGCAAGA TCGGCTTCAA GGTGGCCGGT CCGGGCTTCA ACACGTCCAA GGATTACCCG ATCCAGACCC GCCTGGGCTG GGGCGACGTG GTGCGCACGA CGACCGAGCT GCAGCAGCCA GGCATGAGCT ACACGCCCAA CGCCCAGTTG CTGTCGGGCC TGGCGGCCGG CGACATCACC CTGCAGGTCA GCTACTCGCC GTTCAAGGGC TTCGACCCCT CGGCGGTCGC GGTGGCGCTG CAGCGCTATC CCTATGGCTG CACCGAGCAG TTGGTCTCGA CCGCCTATCC GCTGCTCTAC GCCCAGAGCG TCTCCAGCGA CCCCAAGCTG AGACGCAATC CGGCGATCCT GGCCGGGGCG GTGGGCAAGC TGCTGGACCG CCAGACCCTG GACGGGGCCT TCGGCCTGTG GCGGGTCGGC GACGGCGAGG CCGACGCCTG GCTGGGCGCC TACGCCACTG ACTTCCTGGT GGAAGCCAAG GCCCAGGGCG TGGCCGTGCC GGATGAGGCG ATGGACAAGG CGCTGAACGC CATGCGCCAG ATCAGCCGGC CCGACGGCTG GAGCTCGGTG TCCTACCGCC TGGAATATCC CGAATGGTGG GGTCGCACGC CCGACGACTC CAAGAAGGCC ACCGAGCGCA TGCGCCGCCG GGCCTCGGCC TACGCCCTCT ATGTGATGGC CAAGGCCGGG CGCGGGGATC TGGCCCGGCT GCGCTGGTGG CACGACGTGC AGATGAAGGA CGAGGACCAG CCCCTGGCCA AGGCCCAGGT GGCGGCGGGC CTGGCCCTGA TGGGCGACCA GGCGCGGTCC CGCTCGGCCA TGCGCCAGGC GGTCCGTTCG CTGGGGTGGC GCGACGACAG CGACTGGTAC CAGAGCCCGC TGCGCGACGT GGCCACGATC ACGGCGCTCG CCGTCCAGGC CGGCCAGAGC GACATCGCTC GCCAGTTGCA GGGGCGTCTC GAGAACGTGG TCAAGGATCC CGACGCCCTG AACACCCAGG AGCAGGCGGC GGTGCTGTTC GCGGCCTCGC AACTGCTCAA GGCCGCTGGT CCGATCACCA TCGAGGCCCA AGGCGTGACG GCCCTGCCGC CGGCCGGCGG CGCGCCGCGC TGGGCTGTGG GCAAGCTGGC CGACGCCCGC TTCGTCAACA AGGGCAAGGG CGCCCTGTGG CGGACGGTCA GCGTGCGCGG CACGCCGATC GCCGCGCCAG GCGCCGAAAG CAATGGCCTG TCGGTGTCCA AGCGGCTGTT CTCGATGAAC GGCGGGGCGA TCGATCCCAG CCAGATCCAC CAGGGCGACC GGGTGATCGT GCTGGTGTCC GGGCGCTCGA TGCAGGCCCG CTCGACCGCC CTGGTGGTCG ATGACGCCCT GCCGGCCGGC TTCGAGATCG AGACCACCCT GGGCGCCGAC GACGCCCAGA ATGGGCCGTT CAAGTTCCTG GGCGAGCTGA CCAATCCCGA TGTCCAGGAA AGCCGCGACG ACCGCTATAT CGCTGCGCTG GACCTGGCCG GCGAGAAACC GTTCGCTATG GCCTATGTGG CGCGGGCCGT GACGCCGGGC GAGTTCTTCC TGCCTGGGGC CGTGGCCAAG GACATGTACC GGCCCAGCCT GAACGCGCGG TCGGACGCGG GGCGGATCAC CGTGGCGCCG GGAGGGTAG
|
Protein sequence | MSTDETPPEG GRGAPSTPWA DRWEAFKGRI PPSLKSPLFA VAVGALVVGF GGGFAVGKVA DFGWFGGKSA ATAEAPKGQS WSLFGKPRSA NAPRRGVPKP EGFAVWRSRI DSSGAEPMAC VQMSKPLDPS KAYADFVLIS PDLGRQPAVR VKGDELCLGG VGFTDHRVTL LKGLPGKTGE TLGANADVDF TFCEKPPYVG FAGDGVILPR EESDGVALET MNVSKLAIEV WRVSDRNLVR KSISAPDPSG EGDYASDYGD DSPDDEGRQV WKGVIDVQGA AGQKATTVFP LGAVLKEMKP GGYVIKARDA SGGRKPEGDE EPSPAQARRW IMFTDMALIG YDGAESLDVV VRSLKTAKTL SGVKVTLVAK DGEDLAVAKS DADGRARFPR ALMDGEGASH AKMVMAYGDQ GDLAVLDLDR SPVDLSKQGV GGRTESDGGR ALSSDIDGYL YADRGIYRPG ETVHLTAMVR DRLAKAVNDR KGYILVKRPS GVEFKRYPFS RADAGAVLAD IALPRSAPRG RWTAVLKMEG VEADSGSFSF SVEDFAPQRL AVTATGQESV PVGAGQERKI DVSARFLYGA PGAGLQTQGE ARLKTDTDPF PQFKGYEWGD DLTPFDEKFI ELGTTVTDGD GHAMLNLATT EAGDTAQPLV AAVTASVFEP GGRPVREALE LKVRGKPVYY GVKVEQGDAG RGDPPVSLEM IAVNAAGARI ASTATYTLIS ENWNYDWFQQ DGRWQWRRTS RDAVVAKATV NIGAGAPARF NRRLGWGDYR LVVEGPDGSK TVTKFSSGWG SPAKEGEAPD FVRVSAGTKA YAQGDTVEIT LKSPYAGQAQ IAVATDRLIE FKTLSVGENG TTVKLKTSAA WGGGAYVMVT VIQPRDPVSS PKPKRALGLI YVPLDPKGRK LTVDIGTPVK LDSKAPVDVP IKVNGLGFGQ RAKVTIAAVD EGILRLTRQD SPDPAKWYFG KRALTLNYRD DYGRLLDPNM GAPANVNFGA DELGGEGLTT TPIKTVALWS GIVETGLDGK AVVKLPAADF NGELRIMAVA WTDTAVGSGS KPLTVRQPVV ADLNLPRFLA PGDKPMATLE LHNVEGKAGD YSVEAWSTNG IAVAFKKVIT LMLGQRIAEK IPFLAPNVTG IGKIGFKVAG PGFNTSKDYP IQTRLGWGDV VRTTTELQQP GMSYTPNAQL LSGLAAGDIT LQVSYSPFKG FDPSAVAVAL QRYPYGCTEQ LVSTAYPLLY AQSVSSDPKL RRNPAILAGA VGKLLDRQTL DGAFGLWRVG DGEADAWLGA YATDFLVEAK AQGVAVPDEA MDKALNAMRQ ISRPDGWSSV SYRLEYPEWW GRTPDDSKKA TERMRRRASA YALYVMAKAG RGDLARLRWW HDVQMKDEDQ PLAKAQVAAG LALMGDQARS RSAMRQAVRS LGWRDDSDWY QSPLRDVATI TALAVQAGQS DIARQLQGRL ENVVKDPDAL NTQEQAAVLF AASQLLKAAG PITIEAQGVT ALPPAGGAPR WAVGKLADAR FVNKGKGALW RTVSVRGTPI AAPGAESNGL SVSKRLFSMN GGAIDPSQIH QGDRVIVLVS GRSMQARSTA LVVDDALPAG FEIETTLGAD DAQNGPFKFL GELTNPDVQE SRDDRYIAAL DLAGEKPFAM AYVARAVTPG EFFLPGAVAK DMYRPSLNAR SDAGRITVAP GG
|
| |