Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4158 |
Symbol | |
ID | 5901620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4508728 |
End bp | 4512324 |
Gene Length | 3597 bp |
Protein Length | 1198 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641564679 |
Product | YD repeat-containing protein |
Protein accession | YP_001685780 |
Protein GI | 167648117 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.529985 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTTA CTTGGCGAAC GGCCACCTAC ATGGGCAGCG GCGGCGCCAC GGCAACCTAC CCCATCGCCA TCAGCGAGGT AACGCTGCCG GGCGGGCGGT CGATCAGCTA CACCTACGAA AGCGCCACGA CCAATCCTGT CGCCAACCAA GTCCTTCGCC TGAAAAAGGT CGAGTACAAG GACGCCGCCG GGACCGTTCT GGATAGCGAG AGCTACGGTT ACGACAACGC GCTGTTCAAG ACCTTCGTGA CGCAGGTCTA CGACTCCGCC AATGTCCTGC GTTGGAAGGT TACTTACGAC AATCAGGGTC GGGCGACACG CAGCGAGGGG CCCAACGGCG AGCAACTGGC GAGCGTGGTC TACAGTCCTA AGGCTCTGGC GTACTCGCGC ACGGTGACCG ACGCGCTCGG CCGCTCGACG GTCTACAATT ACACCAGAAG TTCACAATAC GTCTTCGACT CCAAGCTGGT CAGCATCCAG GGGCAGGCCA CGGCCCATTG CCCCAGCAGC GCCGCCAGCA TGACCTACGG GTCCGACAAA TACCTGAAAA CGGCGACCGA CGAGGAAGGT CGGGTAACCA GCTATGTCCG CAACGCCAAG GCCCAGCCCA CCTCGATCAC GGAAGGCTTT GGCACGCCGT CGGCGGTGAC GCGTACGATG ACCTGGCATG CGACGTTCAA CGTGCCCACG CAGGTGGTCG AGCCCAAGCT GACGACGGAC TACGTCTACG ACACGCAGGG CCGGCTGACG TCCATGACGC AGAAGGACAC CACGACCTAC ACCGCGCCCT ATGCGACCAA TGGACGGACC CGGACCTGGA CCTATGGCTG GAGCCCGTCG GGACAGCTGC TGAGCGTCGA CGGCCCGCTG GCCGGAACCG GCGACACGCG CAGCTGGACC TATAATGCGG ACGGCTACCT GGCCACGGCC ACCAACGAGC TGGGGCAGGT CACGACGGTC ACCGCCTGGG ATTGGCGCGG CTCGCCGCTG ACGATCGTGG ACGCCAACGG GGTCTCCACG GCCCTGACCT ACGATATCCG CGGACGTATG CTGACGGCGA CGCTCGATCC AGCTGGGGCC TCGTCACAGT ATCAGTTCGC TTATAACGCC GTGGGGGACC TGACCAAGAT CACCCTGCCG CTGGGCGGCT ATCTGCAATA CACCTATAAC GACGCGCGCC AGCTGACCCA GGTGGCCAAC GACCGCGGCG AGACCGTCAC CCTGACGCCC AACGCCGTGG CGGATCCGAC CTCGCGCGTG ATCGAGGCCG GCTCGACGAT CACCGCCCAG CAGACACTGG TCTATGACGA GCTGGGCCGG CTGATCCAGG CGATCGGCGC GGGCAGCCAG ACCACCAACC TGGGGTACGA CAAGGTCAGC AACCCCACCA GCTTGACCGA CGCGCGGGGC AAGCTGTTTA CCACCGCCTT CGACCCGCTG GATCGGGTGA TCACCCAGAC CGATCCCGAG GCCCACAGCG TCCGCTACGC CTATGACGCG GCCGACAAGG TGACCAGCCA CAAGGACGGC CGCCAGCTGG AGACGACGAT GATCGTCGAC GGCTTTGGCC AGGTGATCCA GGAGACCAGC CCTGACCGGG GCCTGCGCAA ATACTGGTAC GACGCGGCCG GGCGGCTGAC CAAGCTGGTC GACGGCGACA ACGAAGAGAC CGACTACGCC TACGATAACG CCGGACGGCG CACGTCGATG AGCTTCCCCG GCGCGAGCTG GGAGACGGTC ACCTACGGCT ATGACGCCGT CGCCGGCGGG AACAAGGGGG CCGGGCGGCT GACAAGCGTC ACCGAAGAGT CCGGCTCGAC CAGCCTGACC TACGACGCCC AGGGCCGTCT GACCCAGGAC GCCAAGGTCA TCCAGGGGCA AGCTTATAAT GTCGGTTACG CCTACGACGC CAACGGCAAG GTCACCCAGA TCACCCTGCC ATCGGGCCGG ATCGTCACCT ACGCGCGCGC CGTCGATGGT CAGGTGGTTG CAGTGTCGAC CAGGCCCTCG GCGACCGGCG CGGTGCAAAA CATCGCCACC AGCGTGGCCT ACCAGCCGTT CGGCCCGCTC AAGGGCCTGA CCTATGGCAA CGGCCTGGCG CTGGATCAGA CTCTTGACCA GAACTACTGG CTGACCGGGA CCAAGGTCTC CGCCACGGGG GTCACGCGCC TGGACCTGAC CTTCGACCGC AACGAGAACG GCCAGCTGGC GGGGGTGACC GACAACGCCG CCACCGGCCG CAGCGCCTCG TTCGGCTACT ATGACTCCGG CCGGCTGCAG TACGGCGTGG GTCCCTGGGG CGACCATAGC TACGCCTATG ACGCCGCGGG CAACCGCACC GACACCGGCG GCGTGGTCGC CTACGAGCTG GCCTCCAGCG CCGCGACCAA CAACCGGGTT ACTCAGGTTC GTGACGCCAA CAGCACCGTA CTGCGCAATC TGATCTACCG AAGCGGCGGC GATCTCTATC AGGACGCCCG GGTGGGCGGC TCGACCTACC AGTACTACTA CAACGCCAGG AAGCGGCTGG TGGTGGCCAA CAAGGACACG GTCGACGCGG CCTACTACGG CTACGACTTC CGCAACCAGC GAGTCTGGCG CCAGGTGCTG ACGCCGACCT ACAGCTCCAC CCACTACATC TTCGACCAGC AGGGCCATCT CTTGGCCGAG CACAACGGCG ACACTGGCGC GGTGATCAAG CAGTACATCT GGCTGGACGA CGCGCCGCTG GCGGTGATCG ACAAGTCGTC GGGAACCGAG GTGGTCTACT ACATCCACAC CGGCCAGATC GGCGAACCGC TGGTGATGAC CGACGACAGC AAGGCCAAGG TCTGGGACGC CTATGTCGAG CCGTTCGGCC GCGCCCAGGT GTTCGGCACG GCCAGCGCCA ACATCGACCT GCGCCTGCCG GGGCAATGGG CCCAGATGGA GAGCGGCGGC CTCAGCCAGA ACTGGAACCG CGACTACGAC CCCACCCTGG CCCGCTACGT CCAGGCCGAC CGCATCGGTC TCGGCGGCGG GCAAAACCTC TACGCCTATG TCGATGGAAG GCCGACGGAA TACAGCGATC CGGACGGAAG GATTCCGCTC CCCCTAATCA CCGGTGCCAT TGGCGCCGTG ATCGGAGGAG GGTCGAACAT CCTTGGTCAG CTCTACATGA ACGGCGGTGA TTTTAGTTGT ATCAACTGGA AGAACGTCGG TGTTGCAACG CTTGTTGGCG GTGTAACTGG CGCCCTTGCG CCCTTTTATG GGACAACGCT CTATGGTGCG GCGGCCCTGG GAGCATATGG CAATTTTGCT CAGTATACGG GAACCCAAAT GGTCAACGGG GACTCCCTCA GCTTGGGCGG CATGGGCTGG AGTTTGGGAA CAGGTGCTGT CGGTGGCGCG ATTGGAGGTA AATTTGTCGC TCCGGGGATG CGATTCAATC CGAATTCGCC TTTTCTAGAT GGTGGACTCG CTCGTGCGCT AAACGACTCT CACAACCTGT CCAAGTTGCT TGCTCCCAGC GCTCTGGTCC GAAACTTTGG TGGCGCGGCA CCTGGATCTG TCGATTGGCC TCCGATTCCA GGTTCGGCGC GTTGTAGCTG CCGATAG
|
Protein sequence | MSFTWRTATY MGSGGATATY PIAISEVTLP GGRSISYTYE SATTNPVANQ VLRLKKVEYK DAAGTVLDSE SYGYDNALFK TFVTQVYDSA NVLRWKVTYD NQGRATRSEG PNGEQLASVV YSPKALAYSR TVTDALGRST VYNYTRSSQY VFDSKLVSIQ GQATAHCPSS AASMTYGSDK YLKTATDEEG RVTSYVRNAK AQPTSITEGF GTPSAVTRTM TWHATFNVPT QVVEPKLTTD YVYDTQGRLT SMTQKDTTTY TAPYATNGRT RTWTYGWSPS GQLLSVDGPL AGTGDTRSWT YNADGYLATA TNELGQVTTV TAWDWRGSPL TIVDANGVST ALTYDIRGRM LTATLDPAGA SSQYQFAYNA VGDLTKITLP LGGYLQYTYN DARQLTQVAN DRGETVTLTP NAVADPTSRV IEAGSTITAQ QTLVYDELGR LIQAIGAGSQ TTNLGYDKVS NPTSLTDARG KLFTTAFDPL DRVITQTDPE AHSVRYAYDA ADKVTSHKDG RQLETTMIVD GFGQVIQETS PDRGLRKYWY DAAGRLTKLV DGDNEETDYA YDNAGRRTSM SFPGASWETV TYGYDAVAGG NKGAGRLTSV TEESGSTSLT YDAQGRLTQD AKVIQGQAYN VGYAYDANGK VTQITLPSGR IVTYARAVDG QVVAVSTRPS ATGAVQNIAT SVAYQPFGPL KGLTYGNGLA LDQTLDQNYW LTGTKVSATG VTRLDLTFDR NENGQLAGVT DNAATGRSAS FGYYDSGRLQ YGVGPWGDHS YAYDAAGNRT DTGGVVAYEL ASSAATNNRV TQVRDANSTV LRNLIYRSGG DLYQDARVGG STYQYYYNAR KRLVVANKDT VDAAYYGYDF RNQRVWRQVL TPTYSSTHYI FDQQGHLLAE HNGDTGAVIK QYIWLDDAPL AVIDKSSGTE VVYYIHTGQI GEPLVMTDDS KAKVWDAYVE PFGRAQVFGT ASANIDLRLP GQWAQMESGG LSQNWNRDYD PTLARYVQAD RIGLGGGQNL YAYVDGRPTE YSDPDGRIPL PLITGAIGAV IGGGSNILGQ LYMNGGDFSC INWKNVGVAT LVGGVTGALA PFYGTTLYGA AALGAYGNFA QYTGTQMVNG DSLSLGGMGW SLGTGAVGGA IGGKFVAPGM RFNPNSPFLD GGLARALNDS HNLSKLLAPS ALVRNFGGAA PGSVDWPPIP GSARCSCR
|
| |