Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5456 |
Symbol | |
ID | 5897090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010333 |
Strand | - |
Start bp | 169980 |
End bp | 171623 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641550743 |
Product | integrase catalytic region |
Protein accession | YP_001672229 |
Protein GI | 167621721 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0520655 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGATG ACCCGGATAT GCGTCCAACT GGAGGCGCTG AAGAACGCTG GCGCAGGGCG CTTGAGCGCC AACCAGAGCT ACGAAAGTTG GTTGAGGCTC CCAGCCGAAG CCGAGCGGAC GTGGAAACCG CCGCCACCCG ACTTGGCTTG CATATCTCGA CGCTTTACCG GCTGCTGCGC CGCTTTGAGG TCGATCGGAC TGCGGAGGCG ATCACGGGTA GAGCGCGGGG ATGGTGCCCG GGACGCAGCC GCGTCCCGGC CTTGATCGAG ACCGTGATCG ACGCGGCGAT CCAAGACTTC TTCCTTACCA AGCAGGCTCC GTCCGCCGCA GCGTTGCATC GAGAGATCGA GTTGCGCTGC CGGGCTACTG GATTGGCAAC ACCTGCAATC TCGACCGTCC ATCGGCGGCT GCGCAAGCTC GGCCGTAAGA CGGTCGCGGG GCGCCGCGAG GGGAGAGGTG CAGCCGAGGC TCAGACTATG CGGCCCGGCG CCTTGCAGAT CGACCGACCC AATATGCTCT GGCAGATCGA CCACTCCCCA GCCGATGTCG TGATTGTGGA CGTCGAGACA CGGGCGCCGA TCGGACGGCC GTGGGTGACG CTAGTGATCG ACGTGGCCTC GCGTGTCATC GCCGGCCTGC ATGTCTCACT CGAAGATCCG TCGGTGATCT CAGTCGGACT GGCCTTGCGC CACGCCATCC TGGATAAGGC CGACGCCCTG AGGGAACGGG AGGTGGCTGC GGACTGGCCA GCCTTCGGTT TGCCGGATGG CGTCCACAGC GACAACGGCT CCGATTTTCG AAGCGCGACG TTCCAGCGGG CCTGCGCCAA CCTCGGTATC GAGATCGACT ATCGGCCCCT GGGCGCGCCG CGCTATGGCG GTCATATTGA GCGCCTGATT GGGACCGCGC AGCAGGAGAT GCACCTGCTG CCGGGTACAA CCTTCTCCAA CGTCTCACAG CGCGGTGACT ATGACAGCGA CGGGTCCGCC GCGCTCACAC TGGACGAGTT TGAGACCTGG CTTTGGCGGT TCATCGCTAG CGACTACAAT ATGCGGATCC ACTCCGTCAC GGGTCGGCCG CCGCTAGTGG CCTGGCGCTG CGGCGTCGAT GGCCAGGGCT TTGCTCCGCG GAGGCCGAGC GATCCGGAGC GTCTAGCCTT CGAATTCCTG CCTTCGGTAC CGCGCGCCAT CACCCGCCAG GGCGTCGTCT TCAACCGCAT ACATTACTAT GAGCCGTTCC TGGAGCCGCT GTTCGACACT GGGGATCGTA GGCTGCTGGT CCGCTACGAC CCGCGCGATC TGTCCCGTCT CTACCTGGCC ACGGCGCAGG GTGTTCAATC CATTCGCTAC CGCAACCTCG CGCGCCCACC CATGAGCTTA TGGGAACTCC GGGCGGCCAG ACGACGCCTC GCTGCCGAGG GCGCGGCCCA TGTTAACGAA GACGCGCTGT TCGAGGCGCG ACGACGCAAT GTCGAGTTGG TCGGACGGGC GAAGAGCGAG ACCCGCCGTC AGCGCCGTGA CGCCGAACGC CGCGACCGCG GCTATGCGGC GGCGTTGGCT CCTGACCCGC CACAGCCCGA TCCGGAGCCG GTCGCGTCGC TGGGTCCCAT CAGCGGTCGA GTCGGGGAGA TTGAGCAGTG GTGA
|
Protein sequence | MVDDPDMRPT GGAEERWRRA LERQPELRKL VEAPSRSRAD VETAATRLGL HISTLYRLLR RFEVDRTAEA ITGRARGWCP GRSRVPALIE TVIDAAIQDF FLTKQAPSAA ALHREIELRC RATGLATPAI STVHRRLRKL GRKTVAGRRE GRGAAEAQTM RPGALQIDRP NMLWQIDHSP ADVVIVDVET RAPIGRPWVT LVIDVASRVI AGLHVSLEDP SVISVGLALR HAILDKADAL REREVAADWP AFGLPDGVHS DNGSDFRSAT FQRACANLGI EIDYRPLGAP RYGGHIERLI GTAQQEMHLL PGTTFSNVSQ RGDYDSDGSA ALTLDEFETW LWRFIASDYN MRIHSVTGRP PLVAWRCGVD GQGFAPRRPS DPERLAFEFL PSVPRAITRQ GVVFNRIHYY EPFLEPLFDT GDRRLLVRYD PRDLSRLYLA TAQGVQSIRY RNLARPPMSL WELRAARRRL AAEGAAHVNE DALFEARRRN VELVGRAKSE TRRQRRDAER RDRGYAAALA PDPPQPDPEP VASLGPISGR VGEIEQW
|
| |