Gene Caul_4158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4158 
Symbol 
ID5901620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4508728 
End bp4512324 
Gene Length3597 bp 
Protein Length1198 aa 
Translation table11 
GC content64% 
IMG OID641564679 
ProductYD repeat-containing protein 
Protein accessionYP_001685780 
Protein GI167648117 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.529985 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTTA CTTGGCGAAC GGCCACCTAC ATGGGCAGCG GCGGCGCCAC GGCAACCTAC 
CCCATCGCCA TCAGCGAGGT AACGCTGCCG GGCGGGCGGT CGATCAGCTA CACCTACGAA
AGCGCCACGA CCAATCCTGT CGCCAACCAA GTCCTTCGCC TGAAAAAGGT CGAGTACAAG
GACGCCGCCG GGACCGTTCT GGATAGCGAG AGCTACGGTT ACGACAACGC GCTGTTCAAG
ACCTTCGTGA CGCAGGTCTA CGACTCCGCC AATGTCCTGC GTTGGAAGGT TACTTACGAC
AATCAGGGTC GGGCGACACG CAGCGAGGGG CCCAACGGCG AGCAACTGGC GAGCGTGGTC
TACAGTCCTA AGGCTCTGGC GTACTCGCGC ACGGTGACCG ACGCGCTCGG CCGCTCGACG
GTCTACAATT ACACCAGAAG TTCACAATAC GTCTTCGACT CCAAGCTGGT CAGCATCCAG
GGGCAGGCCA CGGCCCATTG CCCCAGCAGC GCCGCCAGCA TGACCTACGG GTCCGACAAA
TACCTGAAAA CGGCGACCGA CGAGGAAGGT CGGGTAACCA GCTATGTCCG CAACGCCAAG
GCCCAGCCCA CCTCGATCAC GGAAGGCTTT GGCACGCCGT CGGCGGTGAC GCGTACGATG
ACCTGGCATG CGACGTTCAA CGTGCCCACG CAGGTGGTCG AGCCCAAGCT GACGACGGAC
TACGTCTACG ACACGCAGGG CCGGCTGACG TCCATGACGC AGAAGGACAC CACGACCTAC
ACCGCGCCCT ATGCGACCAA TGGACGGACC CGGACCTGGA CCTATGGCTG GAGCCCGTCG
GGACAGCTGC TGAGCGTCGA CGGCCCGCTG GCCGGAACCG GCGACACGCG CAGCTGGACC
TATAATGCGG ACGGCTACCT GGCCACGGCC ACCAACGAGC TGGGGCAGGT CACGACGGTC
ACCGCCTGGG ATTGGCGCGG CTCGCCGCTG ACGATCGTGG ACGCCAACGG GGTCTCCACG
GCCCTGACCT ACGATATCCG CGGACGTATG CTGACGGCGA CGCTCGATCC AGCTGGGGCC
TCGTCACAGT ATCAGTTCGC TTATAACGCC GTGGGGGACC TGACCAAGAT CACCCTGCCG
CTGGGCGGCT ATCTGCAATA CACCTATAAC GACGCGCGCC AGCTGACCCA GGTGGCCAAC
GACCGCGGCG AGACCGTCAC CCTGACGCCC AACGCCGTGG CGGATCCGAC CTCGCGCGTG
ATCGAGGCCG GCTCGACGAT CACCGCCCAG CAGACACTGG TCTATGACGA GCTGGGCCGG
CTGATCCAGG CGATCGGCGC GGGCAGCCAG ACCACCAACC TGGGGTACGA CAAGGTCAGC
AACCCCACCA GCTTGACCGA CGCGCGGGGC AAGCTGTTTA CCACCGCCTT CGACCCGCTG
GATCGGGTGA TCACCCAGAC CGATCCCGAG GCCCACAGCG TCCGCTACGC CTATGACGCG
GCCGACAAGG TGACCAGCCA CAAGGACGGC CGCCAGCTGG AGACGACGAT GATCGTCGAC
GGCTTTGGCC AGGTGATCCA GGAGACCAGC CCTGACCGGG GCCTGCGCAA ATACTGGTAC
GACGCGGCCG GGCGGCTGAC CAAGCTGGTC GACGGCGACA ACGAAGAGAC CGACTACGCC
TACGATAACG CCGGACGGCG CACGTCGATG AGCTTCCCCG GCGCGAGCTG GGAGACGGTC
ACCTACGGCT ATGACGCCGT CGCCGGCGGG AACAAGGGGG CCGGGCGGCT GACAAGCGTC
ACCGAAGAGT CCGGCTCGAC CAGCCTGACC TACGACGCCC AGGGCCGTCT GACCCAGGAC
GCCAAGGTCA TCCAGGGGCA AGCTTATAAT GTCGGTTACG CCTACGACGC CAACGGCAAG
GTCACCCAGA TCACCCTGCC ATCGGGCCGG ATCGTCACCT ACGCGCGCGC CGTCGATGGT
CAGGTGGTTG CAGTGTCGAC CAGGCCCTCG GCGACCGGCG CGGTGCAAAA CATCGCCACC
AGCGTGGCCT ACCAGCCGTT CGGCCCGCTC AAGGGCCTGA CCTATGGCAA CGGCCTGGCG
CTGGATCAGA CTCTTGACCA GAACTACTGG CTGACCGGGA CCAAGGTCTC CGCCACGGGG
GTCACGCGCC TGGACCTGAC CTTCGACCGC AACGAGAACG GCCAGCTGGC GGGGGTGACC
GACAACGCCG CCACCGGCCG CAGCGCCTCG TTCGGCTACT ATGACTCCGG CCGGCTGCAG
TACGGCGTGG GTCCCTGGGG CGACCATAGC TACGCCTATG ACGCCGCGGG CAACCGCACC
GACACCGGCG GCGTGGTCGC CTACGAGCTG GCCTCCAGCG CCGCGACCAA CAACCGGGTT
ACTCAGGTTC GTGACGCCAA CAGCACCGTA CTGCGCAATC TGATCTACCG AAGCGGCGGC
GATCTCTATC AGGACGCCCG GGTGGGCGGC TCGACCTACC AGTACTACTA CAACGCCAGG
AAGCGGCTGG TGGTGGCCAA CAAGGACACG GTCGACGCGG CCTACTACGG CTACGACTTC
CGCAACCAGC GAGTCTGGCG CCAGGTGCTG ACGCCGACCT ACAGCTCCAC CCACTACATC
TTCGACCAGC AGGGCCATCT CTTGGCCGAG CACAACGGCG ACACTGGCGC GGTGATCAAG
CAGTACATCT GGCTGGACGA CGCGCCGCTG GCGGTGATCG ACAAGTCGTC GGGAACCGAG
GTGGTCTACT ACATCCACAC CGGCCAGATC GGCGAACCGC TGGTGATGAC CGACGACAGC
AAGGCCAAGG TCTGGGACGC CTATGTCGAG CCGTTCGGCC GCGCCCAGGT GTTCGGCACG
GCCAGCGCCA ACATCGACCT GCGCCTGCCG GGGCAATGGG CCCAGATGGA GAGCGGCGGC
CTCAGCCAGA ACTGGAACCG CGACTACGAC CCCACCCTGG CCCGCTACGT CCAGGCCGAC
CGCATCGGTC TCGGCGGCGG GCAAAACCTC TACGCCTATG TCGATGGAAG GCCGACGGAA
TACAGCGATC CGGACGGAAG GATTCCGCTC CCCCTAATCA CCGGTGCCAT TGGCGCCGTG
ATCGGAGGAG GGTCGAACAT CCTTGGTCAG CTCTACATGA ACGGCGGTGA TTTTAGTTGT
ATCAACTGGA AGAACGTCGG TGTTGCAACG CTTGTTGGCG GTGTAACTGG CGCCCTTGCG
CCCTTTTATG GGACAACGCT CTATGGTGCG GCGGCCCTGG GAGCATATGG CAATTTTGCT
CAGTATACGG GAACCCAAAT GGTCAACGGG GACTCCCTCA GCTTGGGCGG CATGGGCTGG
AGTTTGGGAA CAGGTGCTGT CGGTGGCGCG ATTGGAGGTA AATTTGTCGC TCCGGGGATG
CGATTCAATC CGAATTCGCC TTTTCTAGAT GGTGGACTCG CTCGTGCGCT AAACGACTCT
CACAACCTGT CCAAGTTGCT TGCTCCCAGC GCTCTGGTCC GAAACTTTGG TGGCGCGGCA
CCTGGATCTG TCGATTGGCC TCCGATTCCA GGTTCGGCGC GTTGTAGCTG CCGATAG
 
Protein sequence
MSFTWRTATY MGSGGATATY PIAISEVTLP GGRSISYTYE SATTNPVANQ VLRLKKVEYK 
DAAGTVLDSE SYGYDNALFK TFVTQVYDSA NVLRWKVTYD NQGRATRSEG PNGEQLASVV
YSPKALAYSR TVTDALGRST VYNYTRSSQY VFDSKLVSIQ GQATAHCPSS AASMTYGSDK
YLKTATDEEG RVTSYVRNAK AQPTSITEGF GTPSAVTRTM TWHATFNVPT QVVEPKLTTD
YVYDTQGRLT SMTQKDTTTY TAPYATNGRT RTWTYGWSPS GQLLSVDGPL AGTGDTRSWT
YNADGYLATA TNELGQVTTV TAWDWRGSPL TIVDANGVST ALTYDIRGRM LTATLDPAGA
SSQYQFAYNA VGDLTKITLP LGGYLQYTYN DARQLTQVAN DRGETVTLTP NAVADPTSRV
IEAGSTITAQ QTLVYDELGR LIQAIGAGSQ TTNLGYDKVS NPTSLTDARG KLFTTAFDPL
DRVITQTDPE AHSVRYAYDA ADKVTSHKDG RQLETTMIVD GFGQVIQETS PDRGLRKYWY
DAAGRLTKLV DGDNEETDYA YDNAGRRTSM SFPGASWETV TYGYDAVAGG NKGAGRLTSV
TEESGSTSLT YDAQGRLTQD AKVIQGQAYN VGYAYDANGK VTQITLPSGR IVTYARAVDG
QVVAVSTRPS ATGAVQNIAT SVAYQPFGPL KGLTYGNGLA LDQTLDQNYW LTGTKVSATG
VTRLDLTFDR NENGQLAGVT DNAATGRSAS FGYYDSGRLQ YGVGPWGDHS YAYDAAGNRT
DTGGVVAYEL ASSAATNNRV TQVRDANSTV LRNLIYRSGG DLYQDARVGG STYQYYYNAR
KRLVVANKDT VDAAYYGYDF RNQRVWRQVL TPTYSSTHYI FDQQGHLLAE HNGDTGAVIK
QYIWLDDAPL AVIDKSSGTE VVYYIHTGQI GEPLVMTDDS KAKVWDAYVE PFGRAQVFGT
ASANIDLRLP GQWAQMESGG LSQNWNRDYD PTLARYVQAD RIGLGGGQNL YAYVDGRPTE
YSDPDGRIPL PLITGAIGAV IGGGSNILGQ LYMNGGDFSC INWKNVGVAT LVGGVTGALA
PFYGTTLYGA AALGAYGNFA QYTGTQMVNG DSLSLGGMGW SLGTGAVGGA IGGKFVAPGM
RFNPNSPFLD GGLARALNDS HNLSKLLAPS ALVRNFGGAA PGSVDWPPIP GSARCSCR