Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5352 |
Symbol | |
ID | 5897156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010333 |
Strand | - |
Start bp | 61573 |
End bp | 65838 |
Gene Length | 4266 bp |
Protein Length | 1421 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641550644 |
Product | methylase/helicase |
Protein accession | YP_001672130 |
Protein GI | 167621622 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.686845 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCGA CATTCCCCCT CTTCAGCGCC CTTCAGACCC CAGGGGCCGC CCAGGACAAG GCCGCCAATA CGCTGGCCGC CGCCCGGGCG CTCACCCCGC ACCTCAATCG GTCGCGGGCG CTGGATCGAA AGCTGGTGGC CAGCACCATG ACCATCTCCT TCGGGGGCTC GGACGCCGAG GGCGCCTGGT CGTGGCGCGA TGCCTACGAC GCCATCGAGG CGGCCCTTGT CCTGCAGCTG CGTCGGCTCA GCCCGCAGAT TGGCCGGCTC GAGGACGCTC CGGCGCAGAT CGCGGCGCTG CTGGCCAGCG TCACCGACCT GACCTTGACC CACACCCGCC GCAGCGAGGA GCAGGTCGCG CTCGACCAGT TCTCCACGCC GCCGGCCTTG GCCGCCTTGG CGGTGCTGGC CGCCCAGGTC CGGCCTGGCG ACAAGGTGCT CGAACCGTCG GCCGGCACCG GCCTGATGGC GATCATCGCC GAGGCGTGCG GGGCGACGCT GGAGCTCAAT GAGATCGCGC CGCACCGTTC GGCTCTGCTC GATGGCCTGT TCCCGGCCGT CTCGCGCACC CGCCATGACG CGGCGCACCT GAAGGACATC CTGCCCTCGT CGGGCAGCTT CCAGGCGGTG ATCGCCAATC CGCCGTTCCA ACGGCTGGAG GGTCATCTGC ACGCCGCCAT CGACTGCCTT GCCGAAGGCG GACGGCTCAG CGCCATCGTA CCCACGCGCC TCTTCGAGGA CGCCGGGGCG ATGCAGGCGC TCGCCCGCCG CGGCCGGGTC GTGGCGTTGA TCGCCTTCCC GCCCCGGGCC TATGCCAAGC ACGGGACGTC CGTCGAGACC GGCCTGCTGG TCTTGGACTG CGTCGAAGAG TGCGGCGGCG TCCCCGTCAA GGTCGTCAGC GGCGAGACCC TGGCCGATAT CGCCAAGCTG ATCGCCGCGC TGGATCCGCG CCCGACCGCC CAGCCGCGCC AGTTCCGATC GGTCTCCAAC GTCGCCTTCC TGGCGCCGCG CGCCCGCGCC CTGGCCACCC CCTCGACCCG CCTGGGGTTC CTGGCCGGGG CGTCGCCGGT GGCGTTCGAG ACAATCGCCT GGTCGGGCGA GGGCCACGAC GTCGGCCTCT ACCAAGCCTA CCGGGTCAGC CGCCTGGACT TGCAGGGCAG CCGTCCACAC CCCTCCGCGC TGGTGGAGTC CGGCCCCATG GCGTCGGTGG CGCCGCCGGC CCCGACCTAT CGCCCGCTCC TGCCGACCAA CATCGTGCAG GAGGGGCTGC TGTCCGACGC CCAGATCGAG ACAGTGATCT ACGCCGGCGA GGCGCAATCG GGGCTTCTGC CGGGCTGGTG GGTGCTCGGC GAAGCGCCGC ACAAGCTGGT GCTGGTCAAG GAGGGCGCCC CCGGCGCCTT CCAGCTGCGG CGCGGCTTCT TCCTCGGCGA CGGCACCGGC TGCGGCAAGG GCCGCCAGGC CGCCGGGGTC ATCGCCGACA ACATGGCCCA GGGCCGCCTG AAAGCGGTGT GGATTTCCAG GAACGACACG CTTCTGGAGG ACGCACGGCG GGACTGGACG GCGATCGGCG GCTCCGCCAC CGACATCACC CCGCAGAGCG CCTGGAAGCA GGCCGACGCG ATCCGCATGG ACCGGGGGGT CCTGTTCACC ACCTATGCCA CCCTGCGTCA GCCGGCGCGG GGGATGCGTC TTTCGCGCCT CGACCAGATC GTCGGCTGGC TGGGCGCCGA CTTCGACGGG GTGATCGCCT TTGACGAAGC CCACGCCATG GCCAACGCGG CGGGCGGCGG CCAGGGGGCT CGTGGACCCA AGAAGGCCTC GCAACAGGGG ATGGCGGGCC TGGCCCTGCA GAACAGGTTG CCCAATGCGC GGGTGATGTA TGTCAGCGCC ACGGGTGCGA CCACGCCCGA GAACTTGGCC TATGCCGCCC GTCTGGGCCT GTGGGGCGGG CCGGAGGCGC CGTTCAACAC CCGCGACGCC TTCATGGACG CCGTCGAGAA GGGCGGCGTG GCGGTCATGG AGTTGATCGC CCGCGAACTG AAGGCGCTCG GCCTCTACAT CGCCCGGTCG CTGTCGTTCG ACGGGGTGGA ATATCAGGCC CTGCGCCATG TGCTGACCGG CGATGACGTC GAGATCTGGA ACGCCTGGGC CGACGCCTAC CAGTTGATCC ACCAGAACCT GCGCGCCGCC CTGGAGGCGG TCGGCATCAC GCAGGACGGC AAGGCCAAGA GCGGTCAGGC CGCTTGCGCG GTCATGTCGG CCTTCGAGGG CGCCAAGCTG CGCTTCTTCG GCGCGCTGCT GGCGGGCCTG AAGTCGCCGA CCCTGATCGC GGCGATCCGC GACGACCTCG TCCAGGGACG TTCCAGCGTC GTGCAGATCG TCTCGACCAA CGAGGCGGTG ATGGAGCGCC GGCTGGCGCA GATCCCGCCC GAAGAGTGGA ACAACCTGAC CATCGATCTG ACCCCCAAGG ACCAGGTGCT GGACTATCTG ATGGGGGCCT TCCCTATCGC CGCCATGGAG GCGATCGATG ATAAGGAAGG CAATGTGACG ATGCGCCCGT TGATCGTGGA TGGTCAGCCG GTCGTCAGCC AGGAGGCCCT GCGCCTGCGG GAGGCGCTGG TCGTCCACCT GGCCTGCCTG CCCGCCGTTC CGGGTGTGCT GGATGCGGTC CTGGGCGCGC TCGGCCCGGA CAACGTCGCC GAGATCACCG GCCGCTCGCG CCGGGTCGTC CTGCGCGATG GCCGCCGGGT GGTGGAGCGT CGCAGCGCGT CGAGCGCCAA GGCCGAGACC GACGCCTTCA TGAGCGGCAA GAAGCGGGTG CTGGTGTTCT CCGACGCCGG CGGCACGGGG CGCAGCTACC ACGCCGACCT GAACTGCGCG AACCAGGACC GTCGCCGGCA CTATCTCTGC GAGCCCGGCT GGCGGGCCGA CGCGGCGATC CAGGGCCTTG GCCGCTCGCA CCGGACCAAC CAGGCCAGCG CGCCGCTGTT CTGTCCGGTC ACGACCGACA TCCACGGCGA GAAGCGGTTC ACCTCGACGA TCTCGCGGCG GCTGGACAGC CTGGGCGCCC TGACCAAGGG TGAGCGCCGA ACAGCCGGCA ACGGCCTCTT CCGGCCCGAG GACAATCTCG AAAGCCCCTG GGCGCACCGC GCACTGCAGG CCTTCTACGT CGCCTTGCAT TGGGGGAACG TGCCGGCGAT GGACCGGGTG ACCTACGAGC AAAAGACCGG GCTTCAGCTC CTCGACAGCG ATGGCCAGTT GAAGAAGGCC GAGGACCTGC CGCCGATGAA CACCTGGCTC AACCGGCTCC TGGCGCTGCG GATCGAGGAT CAGAACGCCT TGTTCGAGAC CTTCGACGCG GTGCTCACCA GCATCCTCGA ACGGGCCGCC GCCTCGGGCG CTCTGGACAA GGGCATGGAG GACATCGTCG CCGATGACCT GACGGTGACG TCGGAGGAGG TGATCCGGAC TGACGCGGTG TCGGGCGCGC AGACCAAGGT CGTGACCTTT GCCGTCCGCA CACGCCGCGT GCTGGCGTCG GCGGCGGATG CGCTGGCCGG GCTCGATCCG CAAACGCTGG AGTATGTGGT CAACACCAAG TCGCAGCGCG CCGGCCTGGT GGTGAAGGGC CTGACGACGA CCGACGACGA TGACCGTCTG GTTCAGGCCG TCCGGCTGAT CCGCGCCGAG AAGGCCGCCG TGCTGCCCCT GAAGACCTAT GAGGAATCGG CCTGGGAGGT GGTCGCCGAA CCCATCTGGC GCGCGACCTG GGACGCCGAG GTCGCCGGCG CCGATCCCTG GCACACCCGC CAGCTGGCCC TGGTCACCGG TCTGCTGCTG CCGGTGTGGA GCAGTCTGCC TAGCAAGCGG ACCTTCGTCC GCCGGCTGAA GGCTCCGGAC GGGCGGCGCT GGCTTGGGCG GGTTCTGGGC CCCGCGGATG TCACCAAGCT GAAGATCGCG CTTGGGATCA GCGACGTCGC CACGGCCGTC GGCTCTGGCA ACAACGCGGC CAGTATGGTC CTGGGCGAGA ACATCTCCAT CGCCCTGGCG GGCGGGTTCT GGCTTCGCCG GGCCAAGGTG ATGGACCGCT ACCGGCTTGA GGTTGTCGGC GCCGGCTCCC AGCGCGCGAT GTTCCAGGCG CTCGGCTGCT TTGTCGAAAT CATCAACTAC ACCCCTCGCG TCTTCGTGCC TGTCGACCAG CCACAGGTGC TCTGCGCCGT CCTGGCCAAG TGGCCAGCCC AGACCATCCT GCCGGCGGCG GCGTAG
|
Protein sequence | MNATFPLFSA LQTPGAAQDK AANTLAAARA LTPHLNRSRA LDRKLVASTM TISFGGSDAE GAWSWRDAYD AIEAALVLQL RRLSPQIGRL EDAPAQIAAL LASVTDLTLT HTRRSEEQVA LDQFSTPPAL AALAVLAAQV RPGDKVLEPS AGTGLMAIIA EACGATLELN EIAPHRSALL DGLFPAVSRT RHDAAHLKDI LPSSGSFQAV IANPPFQRLE GHLHAAIDCL AEGGRLSAIV PTRLFEDAGA MQALARRGRV VALIAFPPRA YAKHGTSVET GLLVLDCVEE CGGVPVKVVS GETLADIAKL IAALDPRPTA QPRQFRSVSN VAFLAPRARA LATPSTRLGF LAGASPVAFE TIAWSGEGHD VGLYQAYRVS RLDLQGSRPH PSALVESGPM ASVAPPAPTY RPLLPTNIVQ EGLLSDAQIE TVIYAGEAQS GLLPGWWVLG EAPHKLVLVK EGAPGAFQLR RGFFLGDGTG CGKGRQAAGV IADNMAQGRL KAVWISRNDT LLEDARRDWT AIGGSATDIT PQSAWKQADA IRMDRGVLFT TYATLRQPAR GMRLSRLDQI VGWLGADFDG VIAFDEAHAM ANAAGGGQGA RGPKKASQQG MAGLALQNRL PNARVMYVSA TGATTPENLA YAARLGLWGG PEAPFNTRDA FMDAVEKGGV AVMELIAREL KALGLYIARS LSFDGVEYQA LRHVLTGDDV EIWNAWADAY QLIHQNLRAA LEAVGITQDG KAKSGQAACA VMSAFEGAKL RFFGALLAGL KSPTLIAAIR DDLVQGRSSV VQIVSTNEAV MERRLAQIPP EEWNNLTIDL TPKDQVLDYL MGAFPIAAME AIDDKEGNVT MRPLIVDGQP VVSQEALRLR EALVVHLACL PAVPGVLDAV LGALGPDNVA EITGRSRRVV LRDGRRVVER RSASSAKAET DAFMSGKKRV LVFSDAGGTG RSYHADLNCA NQDRRRHYLC EPGWRADAAI QGLGRSHRTN QASAPLFCPV TTDIHGEKRF TSTISRRLDS LGALTKGERR TAGNGLFRPE DNLESPWAHR ALQAFYVALH WGNVPAMDRV TYEQKTGLQL LDSDGQLKKA EDLPPMNTWL NRLLALRIED QNALFETFDA VLTSILERAA ASGALDKGME DIVADDLTVT SEEVIRTDAV SGAQTKVVTF AVRTRRVLAS AADALAGLDP QTLEYVVNTK SQRAGLVVKG LTTTDDDDRL VQAVRLIRAE KAAVLPLKTY EESAWEVVAE PIWRATWDAE VAGADPWHTR QLALVTGLLL PVWSSLPSKR TFVRRLKAPD GRRWLGRVLG PADVTKLKIA LGISDVATAV GSGNNAASMV LGENISIALA GGFWLRRAKV MDRYRLEVVG AGSQRAMFQA LGCFVEIINY TPRVFVPVDQ PQVLCAVLAK WPAQTILPAA A
|
| |