Gene Caul_0067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0067 
Symbol 
ID5897779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp75991 
End bp81039 
Gene Length5049 bp 
Protein Length1682 aa 
Translation table11 
GC content70% 
IMG OID641560550 
Productalpha-2-macroglobulin domain-containing protein 
Protein accessionYP_001681703 
Protein GI167644040 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACGG ACGAGACTCC GCCCGAAGGC GGCCGCGGCG CTCCCTCCAC GCCCTGGGCC 
GATCGCTGGG AAGCCTTCAA GGGCAGGATT CCGCCGAGCC TGAAGTCGCC GCTGTTCGCG
GTGGCGGTCG GCGCCTTGGT GGTGGGATTC GGCGGCGGCT TCGCGGTCGG CAAGGTCGCC
GATTTCGGCT GGTTCGGCGG CAAGTCGGCC GCCACGGCCG AGGCGCCCAA GGGCCAGTCC
TGGTCGCTGT TCGGCAAGCC CCGCTCGGCC AACGCGCCCC GGCGCGGCGT TCCCAAGCCC
GAGGGCTTCG CCGTCTGGCG CAGCCGGATC GACAGCTCCG GCGCGGAGCC CATGGCCTGC
GTCCAGATGA GCAAGCCGCT CGACCCGTCC AAGGCCTATG CCGACTTCGT GCTGATCTCG
CCCGACCTGG GCCGCCAGCC GGCCGTGCGG GTCAAGGGCG ACGAGCTGTG CCTCGGCGGC
GTCGGCTTCA CCGACCATCG TGTCACCCTG CTCAAGGGCC TGCCGGGCAA GACCGGCGAG
ACCCTGGGCG CCAACGCCGA CGTCGACTTC ACCTTCTGCG AAAAGCCGCC CTATGTCGGT
TTCGCCGGTG ACGGCGTGAT CCTGCCGCGC GAGGAGTCCG ACGGTGTGGC GCTTGAGACC
ATGAACGTCT CCAAGCTGGC GATCGAGGTC TGGCGCGTCT CGGACCGCAA TCTGGTGCGC
AAGTCGATCA GCGCGCCCGA TCCGAGCGGC GAGGGCGACT ACGCCAGCGA CTATGGCGAC
GACAGTCCCG ACGACGAGGG CCGCCAGGTC TGGAAGGGCG TGATCGACGT CCAGGGCGCG
GCCGGCCAGA AGGCGACCAC CGTCTTCCCC CTCGGCGCGG TGCTGAAGGA GATGAAGCCA
GGCGGCTACG TGATCAAGGC GCGAGACGCC TCGGGCGGCC GCAAGCCGGA GGGCGACGAG
GAGCCGAGCC CGGCCCAGGC CCGCCGCTGG ATCATGTTCA CCGACATGGC GCTGATCGGC
TACGACGGCG CGGAGTCGCT GGACGTGGTG GTCCGCTCGC TGAAGACCGC CAAGACCCTG
TCGGGCGTCA AGGTCACGCT GGTGGCCAAG GACGGCGAGG ACCTGGCCGT GGCCAAGAGC
GACGCGGACG GCCGCGCGCG CTTCCCCCGC GCGCTGATGG ACGGCGAGGG CGCGTCTCAC
GCCAAGATGG TCATGGCCTA TGGCGACCAG GGCGACCTGG CGGTGCTGGA CCTGGACCGC
TCGCCGGTCG ACCTGTCCAA GCAGGGCGTC GGCGGCCGCA CCGAGTCCGA CGGCGGCCGG
GCGCTCAGCA GCGACATCGA CGGCTATCTC TATGCCGATC GCGGCATCTA TCGGCCCGGC
GAGACCGTCC ACCTGACGGC CATGGTCCGC GACCGGCTGG CCAAGGCGGT CAACGACCGC
AAGGGCTACA TCCTGGTCAA GCGGCCCTCG GGCGTGGAGT TCAAGCGCTA TCCGTTCAGC
CGCGCCGACG CCGGCGCCGT GCTGGCCGAC ATCGCCCTGC CGCGCAGCGC GCCGCGCGGC
CGTTGGACGG CGGTGCTGAA GATGGAGGGG GTCGAGGCCG ACTCCGGATC ATTCAGCTTC
AGCGTCGAGG ACTTCGCGCC GCAACGGCTG GCGGTCACCG CCACGGGCCA GGAGTCCGTC
CCGGTCGGGG CCGGCCAGGA GCGCAAGATC GACGTCTCCG CGCGCTTCCT GTACGGCGCG
CCCGGCGCGG GCCTGCAAAC CCAGGGCGAG GCGCGGCTGA AGACCGACAC CGACCCCTTC
CCGCAGTTCA AGGGCTACGA GTGGGGCGAC GACCTGACGC CCTTCGACGA GAAGTTCATC
GAACTGGGCA CGACCGTCAC CGACGGCGAC GGCCACGCCA TGCTGAACCT GGCCACCACC
GAGGCCGGCG ACACCGCCCA GCCGCTGGTG GCGGCGGTGA CGGCCTCGGT GTTCGAGCCC
GGCGGCCGGC CGGTCCGCGA GGCCTTGGAG CTCAAGGTGC GCGGCAAGCC GGTCTATTAC
GGCGTCAAGG TCGAGCAAGG CGACGCCGGG CGCGGGGATC CGCCTGTCAG CCTGGAGATG
ATCGCGGTCA ACGCCGCCGG CGCCCGGATC GCGTCGACGG CGACCTACAC CCTGATCAGC
GAGAACTGGA ACTATGACTG GTTCCAGCAG GACGGACGCT GGCAGTGGCG GCGCACCAGC
CGCGACGCGG TGGTGGCCAA GGCCACGGTC AATATCGGCG CGGGAGCGCC CGCGCGCTTC
AACCGCCGGC TTGGCTGGGG CGACTACCGC CTGGTGGTCG AGGGGCCGGA CGGCAGCAAG
ACCGTCACCA AGTTCTCATC CGGCTGGGGT TCGCCCGCCA AGGAAGGCGA GGCGCCCGAC
TTCGTCCGCG TCAGCGCCGG GACCAAGGCC TATGCGCAAG GCGACACGGT GGAGATCACC
CTGAAGTCGC CCTACGCCGG TCAGGCCCAG ATCGCCGTGG CGACCGACCG CTTGATCGAA
TTCAAGACTC TCAGCGTCGG CGAGAACGGT ACGACCGTGA AGCTGAAGAC CTCGGCCGCC
TGGGGCGGCG GGGCCTATGT GATGGTCACG GTGATCCAGC CGCGCGACCC GGTCAGCTCG
CCCAAGCCCA AGCGGGCCCT GGGGCTGATC TATGTCCCGC TCGACCCCAA GGGCCGCAAG
CTGACGGTCG ATATCGGCAC GCCGGTGAAG CTGGACTCCA AGGCCCCGGT CGACGTGCCG
ATCAAGGTCA ATGGCCTGGG CTTTGGCCAA AGGGCCAAGG TGACGATCGC GGCGGTGGAC
GAGGGCATCC TGCGCCTGAC GCGGCAGGAC AGCCCCGACC CGGCCAAGTG GTACTTCGGC
AAGCGGGCCC TGACCCTGAA CTATCGCGAC GACTACGGCC GCCTGCTCGA CCCGAACATG
GGTGCGCCGG CCAATGTCAA TTTCGGCGCC GACGAACTGG GCGGCGAGGG ATTGACGACC
ACGCCGATCA AGACGGTGGC CCTGTGGTCG GGCATCGTCG AGACCGGGCT GGACGGCAAG
GCCGTGGTCA AGCTGCCAGC CGCCGACTTC AATGGCGAAC TGCGGATCAT GGCCGTGGCC
TGGACCGACA CCGCCGTCGG CTCGGGCTCC AAGCCGCTGA CCGTGCGCCA GCCGGTCGTG
GCCGACCTCA ACCTGCCGCG CTTCCTGGCC CCCGGCGACA AGCCGATGGC CACGCTGGAG
CTGCACAATG TCGAGGGCAA GGCCGGCGAC TATTCGGTCG AGGCCTGGTC GACCAACGGC
ATCGCGGTGG CTTTCAAGAA GGTCATCACC CTGATGCTGG GCCAGCGGAT CGCCGAGAAG
ATCCCTTTCC TGGCCCCCAA TGTCACCGGG ATCGGCAAGA TCGGCTTCAA GGTGGCCGGT
CCGGGCTTCA ACACGTCCAA GGATTACCCG ATCCAGACCC GCCTGGGCTG GGGCGACGTG
GTGCGCACGA CGACCGAGCT GCAGCAGCCA GGCATGAGCT ACACGCCCAA CGCCCAGTTG
CTGTCGGGCC TGGCGGCCGG CGACATCACC CTGCAGGTCA GCTACTCGCC GTTCAAGGGC
TTCGACCCCT CGGCGGTCGC GGTGGCGCTG CAGCGCTATC CCTATGGCTG CACCGAGCAG
TTGGTCTCGA CCGCCTATCC GCTGCTCTAC GCCCAGAGCG TCTCCAGCGA CCCCAAGCTG
AGACGCAATC CGGCGATCCT GGCCGGGGCG GTGGGCAAGC TGCTGGACCG CCAGACCCTG
GACGGGGCCT TCGGCCTGTG GCGGGTCGGC GACGGCGAGG CCGACGCCTG GCTGGGCGCC
TACGCCACTG ACTTCCTGGT GGAAGCCAAG GCCCAGGGCG TGGCCGTGCC GGATGAGGCG
ATGGACAAGG CGCTGAACGC CATGCGCCAG ATCAGCCGGC CCGACGGCTG GAGCTCGGTG
TCCTACCGCC TGGAATATCC CGAATGGTGG GGTCGCACGC CCGACGACTC CAAGAAGGCC
ACCGAGCGCA TGCGCCGCCG GGCCTCGGCC TACGCCCTCT ATGTGATGGC CAAGGCCGGG
CGCGGGGATC TGGCCCGGCT GCGCTGGTGG CACGACGTGC AGATGAAGGA CGAGGACCAG
CCCCTGGCCA AGGCCCAGGT GGCGGCGGGC CTGGCCCTGA TGGGCGACCA GGCGCGGTCC
CGCTCGGCCA TGCGCCAGGC GGTCCGTTCG CTGGGGTGGC GCGACGACAG CGACTGGTAC
CAGAGCCCGC TGCGCGACGT GGCCACGATC ACGGCGCTCG CCGTCCAGGC CGGCCAGAGC
GACATCGCTC GCCAGTTGCA GGGGCGTCTC GAGAACGTGG TCAAGGATCC CGACGCCCTG
AACACCCAGG AGCAGGCGGC GGTGCTGTTC GCGGCCTCGC AACTGCTCAA GGCCGCTGGT
CCGATCACCA TCGAGGCCCA AGGCGTGACG GCCCTGCCGC CGGCCGGCGG CGCGCCGCGC
TGGGCTGTGG GCAAGCTGGC CGACGCCCGC TTCGTCAACA AGGGCAAGGG CGCCCTGTGG
CGGACGGTCA GCGTGCGCGG CACGCCGATC GCCGCGCCAG GCGCCGAAAG CAATGGCCTG
TCGGTGTCCA AGCGGCTGTT CTCGATGAAC GGCGGGGCGA TCGATCCCAG CCAGATCCAC
CAGGGCGACC GGGTGATCGT GCTGGTGTCC GGGCGCTCGA TGCAGGCCCG CTCGACCGCC
CTGGTGGTCG ATGACGCCCT GCCGGCCGGC TTCGAGATCG AGACCACCCT GGGCGCCGAC
GACGCCCAGA ATGGGCCGTT CAAGTTCCTG GGCGAGCTGA CCAATCCCGA TGTCCAGGAA
AGCCGCGACG ACCGCTATAT CGCTGCGCTG GACCTGGCCG GCGAGAAACC GTTCGCTATG
GCCTATGTGG CGCGGGCCGT GACGCCGGGC GAGTTCTTCC TGCCTGGGGC CGTGGCCAAG
GACATGTACC GGCCCAGCCT GAACGCGCGG TCGGACGCGG GGCGGATCAC CGTGGCGCCG
GGAGGGTAG
 
Protein sequence
MSTDETPPEG GRGAPSTPWA DRWEAFKGRI PPSLKSPLFA VAVGALVVGF GGGFAVGKVA 
DFGWFGGKSA ATAEAPKGQS WSLFGKPRSA NAPRRGVPKP EGFAVWRSRI DSSGAEPMAC
VQMSKPLDPS KAYADFVLIS PDLGRQPAVR VKGDELCLGG VGFTDHRVTL LKGLPGKTGE
TLGANADVDF TFCEKPPYVG FAGDGVILPR EESDGVALET MNVSKLAIEV WRVSDRNLVR
KSISAPDPSG EGDYASDYGD DSPDDEGRQV WKGVIDVQGA AGQKATTVFP LGAVLKEMKP
GGYVIKARDA SGGRKPEGDE EPSPAQARRW IMFTDMALIG YDGAESLDVV VRSLKTAKTL
SGVKVTLVAK DGEDLAVAKS DADGRARFPR ALMDGEGASH AKMVMAYGDQ GDLAVLDLDR
SPVDLSKQGV GGRTESDGGR ALSSDIDGYL YADRGIYRPG ETVHLTAMVR DRLAKAVNDR
KGYILVKRPS GVEFKRYPFS RADAGAVLAD IALPRSAPRG RWTAVLKMEG VEADSGSFSF
SVEDFAPQRL AVTATGQESV PVGAGQERKI DVSARFLYGA PGAGLQTQGE ARLKTDTDPF
PQFKGYEWGD DLTPFDEKFI ELGTTVTDGD GHAMLNLATT EAGDTAQPLV AAVTASVFEP
GGRPVREALE LKVRGKPVYY GVKVEQGDAG RGDPPVSLEM IAVNAAGARI ASTATYTLIS
ENWNYDWFQQ DGRWQWRRTS RDAVVAKATV NIGAGAPARF NRRLGWGDYR LVVEGPDGSK
TVTKFSSGWG SPAKEGEAPD FVRVSAGTKA YAQGDTVEIT LKSPYAGQAQ IAVATDRLIE
FKTLSVGENG TTVKLKTSAA WGGGAYVMVT VIQPRDPVSS PKPKRALGLI YVPLDPKGRK
LTVDIGTPVK LDSKAPVDVP IKVNGLGFGQ RAKVTIAAVD EGILRLTRQD SPDPAKWYFG
KRALTLNYRD DYGRLLDPNM GAPANVNFGA DELGGEGLTT TPIKTVALWS GIVETGLDGK
AVVKLPAADF NGELRIMAVA WTDTAVGSGS KPLTVRQPVV ADLNLPRFLA PGDKPMATLE
LHNVEGKAGD YSVEAWSTNG IAVAFKKVIT LMLGQRIAEK IPFLAPNVTG IGKIGFKVAG
PGFNTSKDYP IQTRLGWGDV VRTTTELQQP GMSYTPNAQL LSGLAAGDIT LQVSYSPFKG
FDPSAVAVAL QRYPYGCTEQ LVSTAYPLLY AQSVSSDPKL RRNPAILAGA VGKLLDRQTL
DGAFGLWRVG DGEADAWLGA YATDFLVEAK AQGVAVPDEA MDKALNAMRQ ISRPDGWSSV
SYRLEYPEWW GRTPDDSKKA TERMRRRASA YALYVMAKAG RGDLARLRWW HDVQMKDEDQ
PLAKAQVAAG LALMGDQARS RSAMRQAVRS LGWRDDSDWY QSPLRDVATI TALAVQAGQS
DIARQLQGRL ENVVKDPDAL NTQEQAAVLF AASQLLKAAG PITIEAQGVT ALPPAGGAPR
WAVGKLADAR FVNKGKGALW RTVSVRGTPI AAPGAESNGL SVSKRLFSMN GGAIDPSQIH
QGDRVIVLVS GRSMQARSTA LVVDDALPAG FEIETTLGAD DAQNGPFKFL GELTNPDVQE
SRDDRYIAAL DLAGEKPFAM AYVARAVTPG EFFLPGAVAK DMYRPSLNAR SDAGRITVAP
GG