Gene Caul_3468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3468 
Symbol 
ID5900923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3747659 
End bp3749374 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content58% 
IMG OID641563974 
Productrecombinase 
Protein accessionYP_001685093 
Protein GI167647430 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.723341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCGTCA AGGCTTACAG CTATGCGCGC TTCTCGTCTA AGGCCCAGGC CGAAGGCGAC 
AGTCTCAGAC GACAACTCAA GGCCGCGTTC GATTGGGCCG ACAAACACGG CCTCCAGCTA
GACACTCGAC ATCGAGACCT AGGTGTCTCC GCATATACCG GCGCGCACCG AGTCAAAGGC
GCACTAGCGA GCTTCATTCG GTCTGTCGAA GACGGTGAGA TAACGCGCGG TAGCTACCTG
CTCGTTGACA GCATGGACCG GCTTAGCCGG GAGAGCGAAA CCGAAGTCCT TCACTTGCTG
CTGACGCTGA CGCGGAACGG CATCAAGGTC GTGAATCTCG CTGAGAACCA TGTCCTCGAT
GAGAAGGCTG AGTTCACCGA CTACATTCGA GTCCTAATCC ATGCATCGCG GTCGAACGCC
GAAAGCGTGG AGAAGGGTCG TAAGGTCGGA CTGAAGCGAG CCGAGAACAA ACGTCTCGCC
CGTGAAGAAG GCAAGGTGTG GAGCGCCAGC GGGCCGGGTT GGCTTGAAGC CATCGTGACC
GGCACCAAGC CCAACCGACA TATTGAGTGG AAGGTCATCA AGGAACGCAA ACAGGTCGTT
CAGATGATGT TCGATTTCGC GGAAGCTGGC CTTGGTACGG TCGCGATCAC GCAGCGCTTG
AACGACGAGG GCTATAAGAG CTTTCGCTAT GATCGGCCCT GGCAGCAGAC CGTCGTTCTC
GACATGCTCC GCAATCGCGC GGTGATCGGC GAATACCAAC CGAAGTTCGC AACCTCGGGA
AGCAACAGCT ACAACCGGCC CAACGATGGC GACCCTATCT CGGGCTACTA TCCTGAGATC
ATCAAGCCCG AGCAATTCCA CCGAGTGCAG GCGATCATCG GCGCAAGGAA GCCGAAGCAA
GGCCGTTCAG CGGGATCGAA GGTGTTCAAC AACCTGTTCA TCGGCCTTGG CTCCTGCCAC
GAGTGCGGGG GAACGGTCGG CATCCATGTC GTGAAAGCGA AGGCCGAAGG GCGAGCGGAG
TATCAGGCTC TACGCTGCAT CAACTCGGCG CGGAACGCGA TCGGCAGCAC CAACGTCAAC
AACGTCTGCA CCAACAGGAC GCGCTACAGC TATCGCAAGT TGGAGGCATC CATTCTCGCG
CATGTGGGCT CCTACAAAAT CCCAAACTCC AAGTCAGGAC GGAACCGCGA GAGCGATCTA
GCGGACGCGA TAGCGATGCG AGACGACTTG GCGAAGAAGG TCGAAAACCT GCTCGATCTG
GTCGAGGACG GAAACAGAGG AATGGTCGCT CGATACAACG AGCGAGTCAC CCAACTGGAA
GCCCAGGAAG CCGAGGTCGC GAAGCTCAAG ATCGCTGTCG AGCAAACGAC CTACCAAGTG
CCGTTAGACA CCCGACGCAA AGCCCTAGCA GGGCTCATAG AGCGTCTGAA CACCGTTGAG
GGTGCTGCGC TGTATCAGCT TCGCGCATCG GTCGCGTTGG CGCTCAAAGG CGTGGTCGAT
CAAATCCGGT TCCACAAGGA CGGTAATGTC GATGTCATCC AAGCTGGGGG TGGTCGCGCA
TATCGGTTCA AGGATGGCGA CTTCATCGCG ACAGCAAACC TCCTGCCCGA TCTGGAAATT
GAAGCCGATG ACATCGGTGT TCGGTTGCGA GCTGGTTTGA CCTCTCATGA TCCCGAAGGG
GAAGAGCGTC TGAAGCGGGT AATCGCGGCC GAATAA
 
Protein sequence
MSVKAYSYAR FSSKAQAEGD SLRRQLKAAF DWADKHGLQL DTRHRDLGVS AYTGAHRVKG 
ALASFIRSVE DGEITRGSYL LVDSMDRLSR ESETEVLHLL LTLTRNGIKV VNLAENHVLD
EKAEFTDYIR VLIHASRSNA ESVEKGRKVG LKRAENKRLA REEGKVWSAS GPGWLEAIVT
GTKPNRHIEW KVIKERKQVV QMMFDFAEAG LGTVAITQRL NDEGYKSFRY DRPWQQTVVL
DMLRNRAVIG EYQPKFATSG SNSYNRPNDG DPISGYYPEI IKPEQFHRVQ AIIGARKPKQ
GRSAGSKVFN NLFIGLGSCH ECGGTVGIHV VKAKAEGRAE YQALRCINSA RNAIGSTNVN
NVCTNRTRYS YRKLEASILA HVGSYKIPNS KSGRNRESDL ADAIAMRDDL AKKVENLLDL
VEDGNRGMVA RYNERVTQLE AQEAEVAKLK IAVEQTTYQV PLDTRRKALA GLIERLNTVE
GAALYQLRAS VALALKGVVD QIRFHKDGNV DVIQAGGGRA YRFKDGDFIA TANLLPDLEI
EADDIGVRLR AGLTSHDPEG EERLKRVIAA E