Gene Caul_3424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3424 
Symbol 
ID5900879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3701396 
End bp3703174 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content69% 
IMG OID641563930 
Productvon Willebrand factor type A 
Protein accessionYP_001685049 
Protein GI167647386 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.236212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCAC GCCCCCTTCG TATCCTCGGT TCCACAGCCC TCTCCCTGCT GATCGCCGGA 
TCCGCGAGTG GACCCAGCCT GGCCCTTTCG GAGCCAGGCG ACGGCGTGGT GACACCCGAG
GCCTGCGCGG CCCTCGGCTA CGCCGCGCCC GCTCCAGACG GTTACGATAG CGTCGTCGTC
ACCGCGACCA AACGGACCTC CCGGGAGCAG AGACTCACGT CGAGAGCCCG GCCCGTGATC
GCGACGCCCG GGCTGACGCC ACCACCGCCA CCGCCTCCGC CATCACCACC GCCGCCTCCG
GCCGCCTACA GCTTCGCGGC CCCCTCCCCC GTCGTTGCGC CCAACTTCGC GCCGCCGATC
CGCGACACCG AGAAATACCC CGGCGCGGCG GCCAACCCGG TCAAGCGCGT GGCCGAGGAA
CCGGTCTCGA CCTTCTCGAT CGATGTCGAC ACCGCCGCCT ACGCCAATGT CCGCCGCTTC
CTCAACGAGG GCGCCGCCCC GCCTCACGAT GCCTTGCGGG TCGAGGAGCT GATCAACTAT
TTCGACTACG GCTACGCCAG GCCCACCGCC CAGGAGCCTC CCTTCAAGCC GACCGTGACC
GTGGTTCCCT CGCCCTGGTC GCAGGATCGC CAGCTGATGC ACATCGGCGT GCAGGGCTAT
GCGACGCCGC GCGCCGGCCA GCCGCCACTG AACCTGGTGT TCCTGATCGA CACCTCCGGC
TCGATGTCCG GCCCCGATCG CCTGCCCCTG GCCAAGAAGG CGCTGAACGT GCTGATCGAC
CAGCTTCGGC CGCAGGATCG GGTGTCGATG GTCGCCTATG CCGGTTCGGC CGGGGCGGTG
CTGTCGCCCA CCGACGGCAA GTCGAAGCTC AAGATGCGCT GCGCCCTGAC CGCCCTGCGG
TCCGGCGGCT CGACCGCCGG CGGCCAGGGG CTGGAACTCG CCTACGCCCT GGCCAGGCAG
AACCTCGACC CCAAGGCCGT CAACCGGGTG ATCCTGATGA CCGACGGCGA CTTCAATGTC
GGCATCGCCG ATCCGACGCG CCTGAAGGAT TTCGTCGCCG ACCAGCGCAA GAGCGGCGTC
TACCTCTCGG TCTACGGCTT CGGGCGCGGC AACTACAACG ACACGATGAT GCAGGCCCTG
GCCCAGAACG GTAACGGCAC GGCCGCCTAT GTCGACGGCC TGCAGGAAGC CCGCAAGCTG
CTGCGCGACG ACTTCGACAG CGCCCTGTTC CCGATCGCCG ACGACGTTAA GATCCAGGTC
GAGTTCAATC CGGCCAAGGT CAGCGAATAT CGGTTGATCG GCTACGAGAC CCGGCTGCTC
AATCGCGAGG ACTTCAACAA CGACCAGGTC GACGCCGGCG AGATAGGCTC TGGCGCGGCG
GTCACGGCGA TCTACGAGAT CACCCCGGTC GGGGCGAAGC CGTCGTCCGA TCCGCTGCGC
TATGGCGCCA AGCCGTCGCC GGCGACGGGC GGGAGCGAGC TGGCCTTCCT GAAGATCCGC
TACAAGCCGC CCGGCGGCTC GACCTCGAAG CTGATCGAGC GCCCGATCGG GGCTGGCGAT
ATGCACGCCA GTCTGGCCGC GGCGCCGGAG GCCACCCGCT TCGCCGTCGC CGTGGCCGCC
TACGGCCAGA AATTGCGCGG CGATCCCTGG GTCGACGCCA GCTTCGACTG GGACGCTGTC
ACCGCCCTCG CCCAGGGCGC GCGGGGCGAA GACCCCTACG GCCTGCGCGC CGAGTTCGTG
CAGTTGACCC GCGCGGCCAA GGACGTGAAG GGAAGCTAG
 
Protein sequence
MAPRPLRILG STALSLLIAG SASGPSLALS EPGDGVVTPE ACAALGYAAP APDGYDSVVV 
TATKRTSREQ RLTSRARPVI ATPGLTPPPP PPPPSPPPPP AAYSFAAPSP VVAPNFAPPI
RDTEKYPGAA ANPVKRVAEE PVSTFSIDVD TAAYANVRRF LNEGAAPPHD ALRVEELINY
FDYGYARPTA QEPPFKPTVT VVPSPWSQDR QLMHIGVQGY ATPRAGQPPL NLVFLIDTSG
SMSGPDRLPL AKKALNVLID QLRPQDRVSM VAYAGSAGAV LSPTDGKSKL KMRCALTALR
SGGSTAGGQG LELAYALARQ NLDPKAVNRV ILMTDGDFNV GIADPTRLKD FVADQRKSGV
YLSVYGFGRG NYNDTMMQAL AQNGNGTAAY VDGLQEARKL LRDDFDSALF PIADDVKIQV
EFNPAKVSEY RLIGYETRLL NREDFNNDQV DAGEIGSGAA VTAIYEITPV GAKPSSDPLR
YGAKPSPATG GSELAFLKIR YKPPGGSTSK LIERPIGAGD MHASLAAAPE ATRFAVAVAA
YGQKLRGDPW VDASFDWDAV TALAQGARGE DPYGLRAEFV QLTRAAKDVK GS