Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4157 |
Symbol | |
ID | 5901619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4507044 |
End bp | 4508342 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641564678 |
Product | integrase family protein |
Protein accession | YP_001685779 |
Protein GI | 167648116 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.225884 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.636127 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGCTGT CCATTCGAAC CGTCGAAGGC CTGGAGCCGA GCGACACTGA CTACTTCGTC TGGGATGACG AACTACCGGG CTTTGGGCTG CGAGTGTACG CCTCGGGCCG CAAGACCTAC CTCGTCCAGT ACCGGTCGAG CCGGCGCACC CGGCGCCTGA CGCTCGGCAT CCACGGCGCC CTTACGCCCG AGATGGCTAG GCGCGAGGCC AAGATTCATC TCGGTGGCAT CGCCAAGGGC GGTGACCCGG CCGAGGAGAA GAAGACCAAC CGCAACGGCA TGACCGTTCG CGAGCTCTGC GAACGATACA TGGAGGACGC TGAGATCGGC CTGGTCCCCG GCAAGCGGCG GATGCCGAAG AAGCCGTCCA CCCTCCAGGC TGATCGCAGC CGCATCAGCT CCCACATTAC GCCGTTGCTC GGCACGCGCC TGGTCAAGGA GGTCACGCGC GCCGACATCC ATCAGTTCAT GCGCGACGTC GCCCTCGGCA AGGCGCGCCG CGATCGCAAG ACCAAGCTGA GGGGCCGTTC GATCGTGCGC GGCGGTCTGG GCACGGCGGG GCGCACCCTG GGTCTGCTCG GCGGCATCTT TACCTACGCT CAGCGGATCG GCGTGATCGA GAACAACCCC GTTCGAGGCA TCCCCAAGCC GGCGGTGAAC AGTCGCTATC GACGTCTCAG CGCAGCTGAA TATCGGACGC TGGGCGAAAC GCTGCGCGAG GCCGAGCAAG AAGGGCACAA CGAGAAGGCC CTGACGATCA TCAAGTTGCT GGCGCTGACA GGGTGTCGGC GGGGCGAGAT TGAGAAGCTG CGTTGGGAGG AGGTCGACGA CGCCGGCCGC TGTTTCCGGC TGGCGGACAC CAAGGAGGGG CGCTCGATCC GTCCGATCGG CCCGGAGGTC TTTGCCTTGC TCAATCATCG GCGGCCTGCC GACGCCAGGG GTTACGTCTT TGAAGGCGAC GTCGCCGGCA AGGCCTTCGA CGGCGTGCCC AAGGTGTGGG GCAAGACGAT CCGCAAGCAG CTGATGGATG TGACGCCACA CGTTTTGCGG CACAGCTTCG CCAGCATGGC GAACGATCTC GGCTTTACCG AAGCGACGAT CGCCGCGCTC CTTGGCCATG CGGCCGGCAC CACGACCAGC CGCTACGTCC ACAACCTCGA CACCGCCTTG ATCTCGGCCG CGGAGAAGGT TTCAGGGCAT ATCGCCGGCC TGCTACGCCA TGCCGACGAG CGCCAAGGTC TCAAGTCGCT TTTCGAAAGA TTCGGTGAAG CCACGGGCTC GTGGGGCAAG GCCGCGTAG
|
Protein sequence | MRLSIRTVEG LEPSDTDYFV WDDELPGFGL RVYASGRKTY LVQYRSSRRT RRLTLGIHGA LTPEMARREA KIHLGGIAKG GDPAEEKKTN RNGMTVRELC ERYMEDAEIG LVPGKRRMPK KPSTLQADRS RISSHITPLL GTRLVKEVTR ADIHQFMRDV ALGKARRDRK TKLRGRSIVR GGLGTAGRTL GLLGGIFTYA QRIGVIENNP VRGIPKPAVN SRYRRLSAAE YRTLGETLRE AEQEGHNEKA LTIIKLLALT GCRRGEIEKL RWEEVDDAGR CFRLADTKEG RSIRPIGPEV FALLNHRRPA DARGYVFEGD VAGKAFDGVP KVWGKTIRKQ LMDVTPHVLR HSFASMANDL GFTEATIAAL LGHAAGTTTS RYVHNLDTAL ISAAEKVSGH IAGLLRHADE RQGLKSLFER FGEATGSWGK AA
|
| |