Gene Caul_3825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3825 
Symbol 
ID5901287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4144420 
End bp4146420 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content65% 
IMG OID641564347 
Productconjugal transfer coupling protein TraG 
Protein accessionYP_001685449 
Protein GI167647786 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCA CCAAAATTCT CTGGGGCCAG ATCCTCACCG TCTTCCTGAT CGTCCTCCTC 
ACCACCTGGA CCGCGACCGA ATGGACCGCG TGGCGGCTCG GCTTTCAGGC GCAGCTCGGC
CACCCGTGGT TCGAGCTGTT CGGCTGGCCG GTCTATTATC CACCAGCCTT CTTCTGGTGG
TGGTATTTCT ACGATGCCTA TGCGCCACCG ATCTTCGTCG AGGGGGCCTA TATCGCGGCG
TCCGGCGGCT TCATCTCGAT CGCGGTGGCG ATCGGCATGT CGGTGTGGCG GGCGCGCGAG
GCGAAGAATG CCGAGACCTT CGGCTCGGCG CGCTGGGCTC GCCTTGAGGA GGTACGGTCC
GCGGGCCTGC TCGGCGCCGA TGGCGTGGTG CTCGGCAAGA TGGACCGCGA CTATCTGCGC
CACGATGGAC CCGAGCACAT TCTTTGCTTT GCGCCGACGC GCTCGGGCAA GGGCGTCGGC
CTGGTCGTGC CATCACTGCT GACCTGGCCG GGCTCCGCCA TCATTCACGA CATCAAGGGC
GAGAACTGGC AGCTCACCGC CGGCTTCCGC GCCCGCCACG GCCGCGTACT GCTGTTCGAC
CCGACCAACG CCAAGTCGGC CGCCTATAAC CCGCTGCTGG AGGTTCGACG TGGCGAATGG
GAAGTCCGCG ACGTGCAGAA TATCGCCGAC ATTCTGGTCG ACCCCGAAGG CAGTCTTGAG
AAGCGGAACC ACTGGGAAAA AACCAGCCAC GCGCTGCTGG TCGGCGCGAT CCTCCACGTC
CTCTATGCCG AGGAAGACAA GACACTCGCC GGCGTCGCCG GCTTCCTCTC CGATCCAAAG
CGGCCGATTG AATCGACGCT CGCGGCGATG ATGAAGACCG CCCATCTCGG CGAGGCCGGC
CCGCATCCTG TCATCGCCAG TGCGGCCCGC GAGTTGCTGA ACAAGTCCGA CAACGAACGA
TCCGGCGTAC TCTCGACCGC CATGTCGTTC CTGGGATTGT ATCGCGATCC CGTCGTCGCG
GAGGTGACGC GGCGCTGCGA CTGGCGCATC GCCGACATTG TCGGCGGCAA GCACCCGACC
AGTCTCTACC TCGTGGTGCC GCCCTCGGAC ATCAACCGCA CCAAGCCGCT CATCCGCCTG
ATCCTCAATC AGGTTGGTCG CCGGCTGACG GAGGACCTGC AGGCGAAGGT CGGCCGTCAT
CGCCTGCTGC TGATGCTCGA CGAGTTTCCG GCGCTTGGCC GGCTCGACTT CTTCGAGTCC
GCGCTCGCCT TCATGGCAGG CTATGGCCTG AAAGCCTTTT TGATCGCCCA ATCGCTGAAC
CAGATCGAGA GGGCCTACGG CCCCAACAAT TCCATCCTCG ACAACTGCCA TGTGCGCGTC
AGCTTCGCCA CCAACGATGA GCGCACCGCC AAACGCGTTT CCGATGCGCT CGGCACCGCG
ACGGAGATGC GGGCGATGCG TAACTATGCC GGCCATCGTC TCTCGCCCTG GCTCGGCCAC
CTGATGGTGT CCCGCTCGGA AACCGCGCGG CCTCTGCTCA CTCCCGGCGA GGTCATGCAG
CTCCCGCCGG CCGACGAAAT CGTCATGGTC GCCGGAACGC CGCCGATCCG CGCTAGGAAG
GCGCGCTACT TCGAGGATCG GCGACTTCAG GAGCGGGTCC TGGCGCCTCC CAAACTCATC
AAGCCAGAGA CGGGCATGGC GGACGACTGG AGCAAGCTGC CGCTCCCGGC TCGCCCGGTT
GCCGCCCTGA CCGGCGCAGC GGCCAACATT CATAGCGATG GTGATGGCGA TGAAGACGCC
ACGGGTTCCG AACGTCGCCA TCAGCCCGAA CTGGAGCGCG CGAGCCCCGT CGAGAAGAAG
GCGCCGATCG AGAACGAGTT CGAGGTCGGC GTGGACGACG ATCCCGAGGA AGACGCCGCG
CGGATCAGCC GGGCGAACCA GGTGATGCAA GGCGTCGCCC GCCAGGTCTC GCTCGATCTT
AACGACGGCA TGGACCTGTG A
 
Protein sequence
MSATKILWGQ ILTVFLIVLL TTWTATEWTA WRLGFQAQLG HPWFELFGWP VYYPPAFFWW 
WYFYDAYAPP IFVEGAYIAA SGGFISIAVA IGMSVWRARE AKNAETFGSA RWARLEEVRS
AGLLGADGVV LGKMDRDYLR HDGPEHILCF APTRSGKGVG LVVPSLLTWP GSAIIHDIKG
ENWQLTAGFR ARHGRVLLFD PTNAKSAAYN PLLEVRRGEW EVRDVQNIAD ILVDPEGSLE
KRNHWEKTSH ALLVGAILHV LYAEEDKTLA GVAGFLSDPK RPIESTLAAM MKTAHLGEAG
PHPVIASAAR ELLNKSDNER SGVLSTAMSF LGLYRDPVVA EVTRRCDWRI ADIVGGKHPT
SLYLVVPPSD INRTKPLIRL ILNQVGRRLT EDLQAKVGRH RLLLMLDEFP ALGRLDFFES
ALAFMAGYGL KAFLIAQSLN QIERAYGPNN SILDNCHVRV SFATNDERTA KRVSDALGTA
TEMRAMRNYA GHRLSPWLGH LMVSRSETAR PLLTPGEVMQ LPPADEIVMV AGTPPIRARK
ARYFEDRRLQ ERVLAPPKLI KPETGMADDW SKLPLPARPV AALTGAAANI HSDGDGDEDA
TGSERRHQPE LERASPVEKK APIENEFEVG VDDDPEEDAA RISRANQVMQ GVARQVSLDL
NDGMDL