Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3824 |
Symbol | |
ID | 5901286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4142649 |
End bp | 4144388 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564346 |
Product | hypothetical protein |
Protein accession | YP_001685448 |
Protein GI | 167647785 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3843] Type IV secretory pathway, VirD2 components (relaxase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCG AGGACGATTT CCGCATCCGG CCCGGCCGCA TCCGCTCGAC CAGCGCGCAA CGGGCGCGGC CCTTTATCGC CCAGGCGCTT GCCGCCGCCC AACGCGCGGG CGGCAGCATT TCCCGACAAG GAAAGATCGG ACCTGGCGCG CGCTCGCGTT TTGGCGCCGG CCAGCGCGCC TCGATCCAGG CGAACCGCCT GATCACCGCT CGCTCGCGCG GCGCGGTCAT CAAGGCGCGC GTCGTTCGCC ATAGTGGCCG CTCCGCGCCG CTCGGCACCC ATCTCAATTA TCTCCAGCGC GACGGCGTCA CCCGCGACGG AGAGAAAGCC CGGCTGTTCG GCTCCGGTGC CGACGACATC GACGCCAAGG CCTTCGCCGA ACGCGCCGAG GACGACCGCC ACCACTTCCG CTTCATCGTC AGTCCGGACG ACGCGACGGA GATGTCCGAT CTCAAGGGCT TTACCCGCGA GTTGGTCGGC CAGATGGAGA AGGATCTCGG CACGCGCCTC GAATGGGTCG CGGTCGATCA CTGGAACACC GAGCACCCGC ATGTCCATCT GATCGTCCGC GGTGTCCGCG ACGACGGCGA GAACCTGGTC ATCTCGCGCG ACTACATCAA GGAGGGAATG CGCGACCGGG CGCGCGAGCT GATCACCCAG GAGTTGGGGC CGCGCACCGA TCAGGAGATC CGCCGCACGA TCGAACGCCA GATCGACGCC GACCGCTGGA CCAATCTCGA TCGCCAGCTT GCCCGTGACA GCTATCGCAC CGGCGTCATC GACCTTGCGC CGCGCGCCGA TCGCCAGCCC GACGAATTCC ATGCGCTGAA AGTCGGCCGA CTTCGCAAGC TCGAAGGGCT CGGCCTCGCC GACGAGATTG GCCCCGGCCA GTGGACTATC TCCGAAAAAG CCGAGGCGAC GATGCGCGAA CTCGGCGAAC GCGGCGACAT CATCAAGCGC ATTCATCATG GCCTGACCGA GCGCGGCATC GAGCGTGGCG CTTCGAGCTA TGTACTCGCG GCCGAGAGCC TCGACGAACC TGTCGTCGGC CGCCTGGTCC AGCGCGGTCT CGACGACGAG CTGAAGGGCA CGGCCTATGC CGTTGTCGAC GGCATCGATG GGCGCACGCA CCACGTCAAG CTGCCGGACT TGGACGCCGC CGGCGACAGC GCGCCGGGCT CTATTGTCGA GCTGCGAAAG TTCGATGACG CCCAGGGGCG CAGGCGCCTC GCGCTGGCGG TCCGATCCGA TCTCGACATC GAGCGCCAGG TCTCCGCGAC CGGGGCAACG TGGCTCGATC GGCAGGCCAT CGCCCGCGAG CCGGTGGCGC TTGGCGGCGG TGGCTTCGGC GCCGAGGTGC GCGACGCCAT GGACCGGCGC GCCGAACATC TTGTCGGCCA GGGCCTCGCC GAGCGGCAAT CGCGCGGCGT CAGTTTCTCG CGCGGCCTGA TCGACACGCT GCGCCGGCGC GAGCTGGACG CGGCGGCGGC TCGCCTGTCG ACCGAGACAG GTCGGCCGCT GACGAAACAG AACGAGGGCG AATACGTCAC CGGGGCATAC CAGCGCCGCC TGACGCTCGC TTCCGGCCGC TTCGCGATGA TCGATGACGG CCTCGGCTTC AGCCTCGTGC CGTGGTCGCC ATCCCTCGAA AAGCAGCTCG GGAAGCATGT CTCCGGTATC TCGCGCGCCG ACGGCGGCGT CGACTGGAGC TTTGGCCGAA ACAGGGGGCT CGGCCGATGA
|
Protein sequence | MSGEDDFRIR PGRIRSTSAQ RARPFIAQAL AAAQRAGGSI SRQGKIGPGA RSRFGAGQRA SIQANRLITA RSRGAVIKAR VVRHSGRSAP LGTHLNYLQR DGVTRDGEKA RLFGSGADDI DAKAFAERAE DDRHHFRFIV SPDDATEMSD LKGFTRELVG QMEKDLGTRL EWVAVDHWNT EHPHVHLIVR GVRDDGENLV ISRDYIKEGM RDRARELITQ ELGPRTDQEI RRTIERQIDA DRWTNLDRQL ARDSYRTGVI DLAPRADRQP DEFHALKVGR LRKLEGLGLA DEIGPGQWTI SEKAEATMRE LGERGDIIKR IHHGLTERGI ERGASSYVLA AESLDEPVVG RLVQRGLDDE LKGTAYAVVD GIDGRTHHVK LPDLDAAGDS APGSIVELRK FDDAQGRRRL ALAVRSDLDI ERQVSATGAT WLDRQAIARE PVALGGGGFG AEVRDAMDRR AEHLVGQGLA ERQSRGVSFS RGLIDTLRRR ELDAAAARLS TETGRPLTKQ NEGEYVTGAY QRRLTLASGR FAMIDDGLGF SLVPWSPSLE KQLGKHVSGI SRADGGVDWS FGRNRGLGR
|
| |