Gene Caul_2146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2146 
Symbol 
ID5899601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2324853 
End bp2326232 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content72% 
IMG OID641562636 
Producthypothetical protein 
Protein accessionYP_001683772 
Protein GI167646109 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.107944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.048736 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCGTA CGGGCTCGAA AAAGACGCTG ACGACGGCCA ACCTCGCCGC CCTGGGCGCC 
GAGCGCCTGG CCGATCTGTT GATCGACGTC GCCGAGGGCC ACGCCCAGAT CAAGCGCCGG
CTGCGGCTGG AGCTGGCGGG CGAGGTGGGC GCCGCCGACC TGGCCGCCGA ACTGGCCAAG
CGCATCGATT CCATCGCCGA CAGCCGGGCC CGCGTTCACT GGCGCAAGCA CAAGGAGTTC
GTGCGCGAGC TCGACATGCA GCGGGCGTTG ATCGCGGGCC GGCTGACCGC CCTGGATCCG
GCCCTGGCCT TGCCGATGAT GCTGCGGTTC CTCGACCTAG CCGAAGGGGT GTTCCACCGC
ACGGCCGACG CCAAGGGCGA GGTCGACGCG GTGTTCGACG CCGCCGTCGA CGACGTGGCG
GCAATCGCGC CCCTGGCCAT GCCCAATCCT CGTGACCTGG CCGATCAACT TCTGAACCTG
CTGCTGACCG GCCGGGCGGG CCTGGGGCCG CGGGTGCTGA AGAACGCCCT GCCAGCCCTC
GGCGCCGAGG CGGTGGCCCA ACTGCGGGCC AGGATCGAGA CGACCATGGC CTCGCAGAAG
CGGGCCAGCG GCGCGCTGAA GGCCGCCGTC CAGGTATTGG CCGACGCCCA GGGCGACGTC
GACGGCTATA TCGCCCAGTT CACCGACTCC CAGGCCGTCC TGCCGCCGAT CGGGGCGCAG
ATCGCCCGGC GGCTGACGGC GGCGGGCCGT TTCGACGAGG CGGTGGCGGC GCTCGATCGC
TCGACGCCGG GGTCCTTCGC TCAACTGGTC GGGACGGTTC TGGGCCGACC CACCCTTCCA
GGGCCGGGCG CCCTGGACTG GGAGGACGCC TATATCGAGG TGCTGGAGGC GAGCGGGCGG
TCGGGTCTCG CGCAGGAGAT GCGCTGGGCC AGCTTCGAAC GCGGCCTGTC GGTCGAGCGG
CTGCGCGATC ACCTCAAGCG CCTGGCCGAT TTCGACGACG TCGAGGCCGA GGATCGCGCC
CTGGCCTATG CCGAGGATTT CCATGACCTG CACGCCGCCC TCGACTTCCT GATCCGCTGG
CCCGCCTGGG ACCGCGCCGC CCGGCTGGTG TTGCGCCGGC ACGGCGACCT GGACGGCGAC
CGCCCCGACC TGCTGGAGAC TGCCGCCCGG GCCATCGAGG GCCGCCATCC GCTGGCCGCC
ACCCTGCTGC TGCGGGCGCT GATTCTCGAC ACCGTCCGCT ACGCCCGCAC GACGCGCTAC
AAGGACGCCC AGCAGCAGTT GCTGGAGGCC GCTTCCCTGG CCCCGGCCAT CGCCGACTGG
CAGGGCCACG AAGACGCAAA CGCCTTCGCG GCGAAGGTGG CGGGCTATCG GCGGTGGTGA
 
Protein sequence
MKRTGSKKTL TTANLAALGA ERLADLLIDV AEGHAQIKRR LRLELAGEVG AADLAAELAK 
RIDSIADSRA RVHWRKHKEF VRELDMQRAL IAGRLTALDP ALALPMMLRF LDLAEGVFHR
TADAKGEVDA VFDAAVDDVA AIAPLAMPNP RDLADQLLNL LLTGRAGLGP RVLKNALPAL
GAEAVAQLRA RIETTMASQK RASGALKAAV QVLADAQGDV DGYIAQFTDS QAVLPPIGAQ
IARRLTAAGR FDEAVAALDR STPGSFAQLV GTVLGRPTLP GPGALDWEDA YIEVLEASGR
SGLAQEMRWA SFERGLSVER LRDHLKRLAD FDDVEAEDRA LAYAEDFHDL HAALDFLIRW
PAWDRAARLV LRRHGDLDGD RPDLLETAAR AIEGRHPLAA TLLLRALILD TVRYARTTRY
KDAQQQLLEA ASLAPAIADW QGHEDANAFA AKVAGYRRW