Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5449 |
Symbol | |
ID | 5897131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010333 |
Strand | - |
Start bp | 162668 |
End bp | 165115 |
Gene Length | 2448 bp |
Protein Length | 815 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641550736 |
Product | type III restriction protein res subunit |
Protein accession | YP_001672222 |
Protein GI | 167621714 |
COG category | [S] Function unknown |
COG ID | [COG4951] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.788934 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGATC GGGAAGATCG ACAACGGCGG CTCCAGGAAC GTCTTCGCCA GTTGGAGCAA GAGCGGGCGG CGATCGAGGA CGAACTCGCG GGAATGGTCA TGGCCGCCGC TCGCGAGACT TCGCGCCCCC CGGCAATGGC GTTGCAACAG CCGCGGCAGG ATCAAGCCTT TGACAATCGC GCCAAGGTTG AACTTTTTCG AAGCCTGTTT CGGGGGCGAA GCGACGTATT CCCGCTGCGT TGGGAAAACC TGAAGACAGG TAAGAGCGGC TACGCGCCGG CCTGCGCCAA CGAGTGGAAG CGGGGTCTGT GCGAGAAGCC GCGGATCAAG TGCTCTGTGT GCGCAAATCA GGCTTTCATT GAAGTCAGCG ACCAGGTGAT CACCCACCAC CTGAGGGGGC AAGGCCCGGG CGGCGCCGCG TTCGTCGCGG GCGTCTACCC GGTTCTACCG GACGACACCT GTTGGTTCTT GGCGGCCGAC TTTGATGAGG CGGAATGGCG ACGGGATGTG AAAGCCTTCG CCGAAACCTG CCGCGCCTGG GATGTGCCTG TCGCCATTGA ACGATCACGC TCCGGCAACG GCGCCCATGC GTGGATCTTC TTCAGCGAGC CGATTTCGGC CTCGCTGGCC CGACGCTTGG GATCGGCTCT GATCACCGAG ACCTTGGACC GGACGCCCGA CATCGGGTTT GCGTCCTATG ATCGCTTGTT TCCCAGCCAG GATACCGTCC CAAGCGGCGG CTTTGGCAAC CTCATTGCCC TGCCGTTGCA GGGCTTGGCG CGCCAGGCCG GCAATAGCGT GTTCCTGGAC GATGACCTCG ATCCCTACGA CGACCAATGG AGGTGCCTGG CTGGCGTCCG GCGTCTCAAA CGCGACACCC TGGAGGCCCT GGTTGATGCC GCCAGCGCAG CCGGTCGGAT TCTTGGGGTC AGGATCCCTG TCGATGACGA TGATGAGGAG CCGTGGCTGG CCCCGCCCTC GCGCCGGCGA ACGCCGCCGG CGATTGCCGG GCCCCGGCCG AGCAATCTCA CGATGGTCGT CGCAGACCAG CTCTATATTC CGCGCAGTGG TCTGCCGTCT GGCCTGGTCG CGCGCCTGAT ACGGCTGGCG GCGTTTCAAA ATCCTGAGTT CTACGCCGCC CAGGCGATGC GGTTTGCGAC CCACGACAAG CCACGGATCG TATCGTGCGC GGAGCTGACC GTGAACCACA TCGGTCTGCC GCGGGGCTGC TTTGATGTGG CGATGGACCT GTTCGCGTCG CTGGGCGTCG CGGTGGAAAT CGAGGATCAG CGCCGTCGTG GCGCGGCGAT CAACATTTCA TTCAGTGGCG TATTGCGACC GGACCAGGAG CTGGCGGTCG ATGCGCTGCT GCCGCACGAC ATCGGCGTGC TCGCCGCGAC GACGGCTTTC GGGAAGACCG TGGTGGCGGC GCGGATGATC GCGGAGCGCG GGGTCAACGT GCTCGTTCTG GTCCATCGTC GCCAGCTGAT GGACCAGTGG GTGGAGCGCC TCGGCGCGTT TCTCAACACC GCGCCAGGGA TGATCGGCAA AATCGGTGGC GGCAAACGCA AGCCCTCGGG CCTCATCGAC ATCGCCCTTA TCCAGAGCCT GGTCAGAAAA GGGGAGGTGG ACGATATCGT GGGCGACTAT GGCCACCTCA TCGTCGATGA ATGTCACCAT CTTTCGGCCG TTAGCTTCGA GCAGGTCGCC AGGCGGACAA AGGCCCGCTA CGTCCTTGGG CTATCGGCGA CGGTGACCCG GAAAGACGGC CACCATCCGA TCATCTTCAT GCAGTGCGGG CCGGTTCGAA AACGTGTCGA TGCGCGCGCC GAGGCGGCGA GACGTCCGTT CGATCATCAC GTCCGGATTC GGCAGACGGC GTTTCGGCTG CCAGACAGCG AGGCGAACGC GGCGGCGGTT CCGATCCAGG ACGTCTATCG GGCGCTCGCC GGCGACGAGG GCCGCAACGA ACTGATCTTC AATGACGTGT TGGCTGCGTT GGAAGCTGGG CGCTCACCGG TCGTCATCAC TGAGCGGACG GATCATCTGG AAGCGCTGGC CGATCGGCTT TCGCGCTTCG CCAAGAACGT TATCGTTTTA CGTGGCAGCC AAAGCGAGCG GAAACGGCGA GAGGCGATGG AGCGCCTCGC GGCGATTCCG GAGCAGGACG AGCGGGTGAT CGTGGCGACC GGTCGCTACC TCGGGGAGGG CTTTGATGAC CAGCGCCTGG ACACGCTCTT CCTCACCATG CCGATCGCAT GGAGGGGAAC CTTGGCGCAG TATGCCGGTC GTCTTCACCG GCTTCATGAC CCCAAGCGGG AAGTGGTCAT CTATGACTAC GTCGACCGCG ACGTCCCGGT GCTCGCCCGT ATGGCGGCCA GACGCGCCAC AGGGTATTCG GGGATCGGCT ATACGACCGT CCAGAGGCCT GGATTGTTCG ACCGATAG
|
Protein sequence | MSDREDRQRR LQERLRQLEQ ERAAIEDELA GMVMAAARET SRPPAMALQQ PRQDQAFDNR AKVELFRSLF RGRSDVFPLR WENLKTGKSG YAPACANEWK RGLCEKPRIK CSVCANQAFI EVSDQVITHH LRGQGPGGAA FVAGVYPVLP DDTCWFLAAD FDEAEWRRDV KAFAETCRAW DVPVAIERSR SGNGAHAWIF FSEPISASLA RRLGSALITE TLDRTPDIGF ASYDRLFPSQ DTVPSGGFGN LIALPLQGLA RQAGNSVFLD DDLDPYDDQW RCLAGVRRLK RDTLEALVDA ASAAGRILGV RIPVDDDDEE PWLAPPSRRR TPPAIAGPRP SNLTMVVADQ LYIPRSGLPS GLVARLIRLA AFQNPEFYAA QAMRFATHDK PRIVSCAELT VNHIGLPRGC FDVAMDLFAS LGVAVEIEDQ RRRGAAINIS FSGVLRPDQE LAVDALLPHD IGVLAATTAF GKTVVAARMI AERGVNVLVL VHRRQLMDQW VERLGAFLNT APGMIGKIGG GKRKPSGLID IALIQSLVRK GEVDDIVGDY GHLIVDECHH LSAVSFEQVA RRTKARYVLG LSATVTRKDG HHPIIFMQCG PVRKRVDARA EAARRPFDHH VRIRQTAFRL PDSEANAAAV PIQDVYRALA GDEGRNELIF NDVLAALEAG RSPVVITERT DHLEALADRL SRFAKNVIVL RGSQSERKRR EAMERLAAIP EQDERVIVAT GRYLGEGFDD QRLDTLFLTM PIAWRGTLAQ YAGRLHRLHD PKREVVIYDY VDRDVPVLAR MAARRATGYS GIGYTTVQRP GLFDR
|
| |