Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3499 |
Symbol | |
ID | 5900954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3776192 |
End bp | 3778693 |
Gene Length | 2502 bp |
Protein Length | 833 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641564005 |
Product | hypothetical protein |
Protein accession | YP_001685124 |
Protein GI | 167647461 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGTTG AATTTGTAGG CAGGAACCTC CGTGTTCGTT GGAGTGAGAA CCCACTAAAC GCCGGGATTG GCGTTCGCGG ATACCAGCTT GAATTTGACG GTACTGACGG GACGATCACG CACTTCGTTG ACGCCTCGAA ATTCCAGTCT GAAATCGGCG GCGTTCGCAA CTACGCGGAC ACGCTGCTCT TCGAAGACCT CCTATCGGTC GGCGTTACGC GCTCTGTCGT GGTCCGCCTT AAGAGCCAAT CGTCAAGCGG CAAGTTCTCC GCGCCGGTAA CGATCACCGC TCAGAACCCC GCACCAGAGC TAAGCGCTCC AACGATCACC CCATCATTCA GCGGCTTTGA CGTTCGCATC ACTGCGCCAA ACGCGCGCGA TCTCGCGGGC TACATCATCG CGGTTGGAAC GTCGTCTTCG TTTGCACATA CCAACCCCGT CAATTGGGTC CACAACGGAC CAGACACGTC GGTTGTCGTC GCTCTTCCTG ACACCACCAT TAGATATGTC CGCGTCGCCG CGTATGACGT ATTTGGCACC GATGCTCTGA TCTGGACGAC CGCTCAATCG GTCATGAAGA TGACAAGCGA TCTCTCAGAG ATCGTATCCG GTGTCGATGA GCTTCACGAC CAAGTTGCGG TCCTGAATGC CGACGCGATC ATCAACGCGC AACTACTCGT CGATGCAGCC GAACAGAACA TCAAGACGCA ATTGAACCTC GACGATCAGG TGGATTACTG GATCGACTTG GGACACCTAG AAGGCATTCC CATCGGCAGC ATTGTCGAGG AAACGCGCAC CAAGACAGAC GAGCTAGTCG AGGTTCAAAA CCGCTATCTG GTGAAGACTG CCGATGGCCT GGGAGTGAGC ATCGACCTCA CGAAGGTCAT GGTCGGTCCA ACAGAGTCGC TATCGCAAAG GCTGAACACC ATCGCGGCCG ATACAGGCGC GGATGTCAGC GCGGCAATTG AAGACCTCAA CCAAGCCCTG ACAACCAAAA TCAACGCTGA AGCGACAGCA AGGCAGACTC TTCAAACGGA CTTTGAAGGG AATCTCGCGA GCGCTGTTTC GACGACCAAG GCCTATAGCG ACAACAAGCT CACGACGACG CTGAACAGCT ACGCGACCCT AAGTCTCGTG AACGGGAACA AGAGCGCGGC AGATAGCTCG ATCTCGACAC TGACGACGAA CCTATCCGCC GAAGTGACCG CGCGCATTCA GTTGGCAACG ACCGTTGGAA ACAACAAGTC ATCTGCCGAT AGCTCGATCT CAACCCTGAC CACGAACCTA GCGGCTGAAG TCACAGCCCG AACCAATCTC GCAACGACGG TCGGCACGAA CAAGTCGAAC GCAGACGCGC AACTGCTCGT CCTAACGACT GCAAAGGACG CACAAGCATC GCAGCTTTCC ACCCTTCAAA CGACGGTCGC GGGTCACACG GCAACCATCG CAACAAATGA CACCGCATTC ACGACGGAGA AGGCCGCACA GGCGACCCGT AACAGCGTGA TCGATGCGAA GTTCAACGGC ACAACGAGCA GCACGATCTA CACCGCTGCT CAAGCGGCAG CGACACAGGC ATCGGCTGTT GCGACCACCC TGAACCAAAT GGGCGTGACG ATCGGTCAAG GCAGCGCCTG GGCCATCGAC AGCAACAAGG TCTCGGTGTC CGCGACGGAA AGCCTAGCGA CGCGCCTAAC GAGCATCAAT TCTGAGATGG GTACGAAAGC GACACCGAGC TATGTCAGTG CTCAGATCAG CACGGCTATC AGCACGGCGA CGGGTCCTGG CAGTTCGATT GCGACCTCGC TGTCGAACTT GTCCTCCACC GTTGGCGGCC AAACAGCGTC GATTACGACC CTGCAACAGG TCCAGAACGG CAACAGTGCT CTCTACGGAT GGAGCCTTAA TAGCGGCGGT ATTGCGGTCG GTATGAAGGC CCTGAACAAC GGGTCGGCCG GTACGAATGC GATCATCTTT TCAACCGACA ACTTCTACGT CAACACGCCC GGCGGCAACT TGCCCCTGCT GGCGATCAGC AACGGCAGGA TGGTGTTCAC GGGTAACGTG GACATCAACG GCAACCTGAT TGTCAGCGGT TCGATCACAA CGAACGGCAT CGCAATCGGT GCGGTTTCAA GCACGGTCGC GACTTCAGGT AACTACAACG GCGGCTTTGG TAACAGCGGG AACACCGCTC AAGTCGCAAC GCTCACGTTG GTTTCAACCG GCAAGCCAAT CCTGATTTCG GGCATGTATA GCGGCATGTT GGTTTCGGGT CCGTCATGGA TCAACGCTAC CGGCATCATC ACTCGCAACG GCACGACGAT TCTCGAAAGC GCCGCTTACG CGCCTCGTAG TGGTCGATAC ACGCTACCAT TCCAGATCGT CGATAATCCC GGCCCTGGGA CATGGACTTA CAATATCCAC GACACCGTAG GTACGGGCGG TTACAACGCT TTCTACTTCT ACGCTCTGTC GGCAACGGAG CTAAAAGTAT GA
|
Protein sequence | MIVEFVGRNL RVRWSENPLN AGIGVRGYQL EFDGTDGTIT HFVDASKFQS EIGGVRNYAD TLLFEDLLSV GVTRSVVVRL KSQSSSGKFS APVTITAQNP APELSAPTIT PSFSGFDVRI TAPNARDLAG YIIAVGTSSS FAHTNPVNWV HNGPDTSVVV ALPDTTIRYV RVAAYDVFGT DALIWTTAQS VMKMTSDLSE IVSGVDELHD QVAVLNADAI INAQLLVDAA EQNIKTQLNL DDQVDYWIDL GHLEGIPIGS IVEETRTKTD ELVEVQNRYL VKTADGLGVS IDLTKVMVGP TESLSQRLNT IAADTGADVS AAIEDLNQAL TTKINAEATA RQTLQTDFEG NLASAVSTTK AYSDNKLTTT LNSYATLSLV NGNKSAADSS ISTLTTNLSA EVTARIQLAT TVGNNKSSAD SSISTLTTNL AAEVTARTNL ATTVGTNKSN ADAQLLVLTT AKDAQASQLS TLQTTVAGHT ATIATNDTAF TTEKAAQATR NSVIDAKFNG TTSSTIYTAA QAAATQASAV ATTLNQMGVT IGQGSAWAID SNKVSVSATE SLATRLTSIN SEMGTKATPS YVSAQISTAI STATGPGSSI ATSLSNLSST VGGQTASITT LQQVQNGNSA LYGWSLNSGG IAVGMKALNN GSAGTNAIIF STDNFYVNTP GGNLPLLAIS NGRMVFTGNV DINGNLIVSG SITTNGIAIG AVSSTVATSG NYNGGFGNSG NTAQVATLTL VSTGKPILIS GMYSGMLVSG PSWINATGII TRNGTTILES AAYAPRSGRY TLPFQIVDNP GPGTWTYNIH DTVGTGGYNA FYFYALSATE LKV
|
| |